<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.9495.2</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                    <subj-group>
                        <subject>Medical Genetics</subject>
                    </subj-group>
                    <subj-group>
                        <subject>Neurogenetics</subject>
                    </subj-group>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Human brain harbors single nucleotide somatic variations in functionally relevant genes possibly mediated by oxidative stress</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 2; peer review: 2 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Sharma</surname>
                        <given-names>Anchal</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no" equal-contrib="yes">
                    <name>
                        <surname>Ansari</surname>
                        <given-names>Asgar Hussain</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no" equal-contrib="yes">
                    <name>
                        <surname>Kumari</surname>
                        <given-names>Renu</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Pandey</surname>
                        <given-names>Rajesh</given-names>
                    </name>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Rehman</surname>
                        <given-names>Rakhshinda</given-names>
                    </name>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Mehani</surname>
                        <given-names>Bharati</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Varma</surname>
                        <given-names>Binuja</given-names>
                    </name>
                    <xref ref-type="aff" rid="a3">3</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Desiraju</surname>
                        <given-names>Bapu K.</given-names>
                    </name>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Mabalirajan</surname>
                        <given-names>Ulaganathan</given-names>
                    </name>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Agrawal</surname>
                        <given-names>Anurag</given-names>
                    </name>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a4">4</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Mukhopadhyay</surname>
                        <given-names>Arijit</given-names>
                    </name>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                    <xref ref-type="aff" rid="a2">2</xref>
                    <xref ref-type="aff" rid="a5">5</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Genomics &amp; Molceular Medicine Unit, CSIR-Institute of Genomics &amp; Integrative Biology, Delhi, 110020, India</aff>
                <aff id="a2">
                    <label>2</label>Academy of Scientific and Innovative Research, CSIR-Institute of Genomics &amp; Integrative Biology (AcSIR-IGIB), Delhi, 110020, India</aff>
                <aff id="a3">
                    <label>3</label>CSIR Ayurgenomics Unit- TRISUTRA, CSIR-Institute of Genomics &amp; Integrative Biology, Delhi, 110020, India</aff>
                <aff id="a4">
                    <label>4</label>Molecular Immunogenetics Unit, CSIR-Institute of Genomics &amp; Integrative Biology, Delhi, 110020, India</aff>
                <aff id="a5">
                    <label>5</label>School of Environment and Life Sciences, University of Salford, Manchester, UK</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:arijit.mukhopadhyay@gmail.com">arijit.mukhopadhyay@gmail.com</email>
                </corresp>
                <fn id="fn1">
                    <p>&#x2020;Equal contribution</p>
                </fn>
                <fn fn-type="con">
                    <p>Anchal Sharma, Asgar Hussain Ansari, Bharati Mehani and Bapu K. Desiraju has performed the entire analysis of exome data and subsequent analysis. Renu Kumari has performed the sample preparation for the NGS experiments. Rajesh Pandey has played crucial role in Mi-Seq validation of the somatic sites. Rakhshinda Rehman has helped in immunofluoscent and ELISA experiments with supetvision from Ulaganathan Mabalirajan and Anurag Agrawal. Arijit Mukhopadhyay has conceptualised the study, acquired the necessary funding and wrote the manuscript along with Anchal Sharma and Asgar Ansari.</p>
                </fn>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>12</day>
                <month>12</month>
                <year>2016</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2016</year>
            </pub-date>
            <volume>5</volume>
            <elocation-id>2520</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>8</day>
                    <month>12</month>
                    <year>2016</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Sharma A et al.</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/5-2520/pdf"/>
            <abstract>
                <p>Somatic variation in DNA can cause cells to deviate from the preordained genomic path in both disease and healthy conditions. Here, using exome sequencing of paired tissue samples, we show that the normal human brain harbors somatic single base variations measuring up to 0.48% of the total variations. Interestingly, about 64% of these somatic variations in the brain are expected to lead to non-synonymous changes, and as much as 87% of these represent G:C&gt;T:A transversion events. Further, the transversion events in the brain were mostly found in the frontal cortex, whereas the corpus callosum from the same individuals harbors the reference genotype. We found a significantly higher amount of 8-OHdG (oxidative stress marker) in the frontal cortex compared to the corpus callosum of the same subjects (p&lt;0.01), correlating with the higher G:C&gt;T:A transversions in the cortex. We found significant enrichment for axon guidance and related pathways for genes harbouring somatic variations. This could represent either a directed selection of genetic variations in these pathways or increased susceptibility of some loci towards oxidative stress. This study highlights that oxidative stress possibly influence single nucleotide somatic variations in normal human brain.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>somatic variations</kwd>
                <kwd>exome sequencing</kwd>
                <kwd>oxidative stress</kwd>
                <kwd>8-OHdG</kwd>
                <kwd>brain</kwd>
                <kwd>SNVs</kwd>
                <kwd>axon guidance</kwd>
                <kwd>G:C&gt;T:A transversions.</kwd>
            </kwd-group>
            <funding-group>
                <funding-statement>The work is funded by the Council of Scientific and Industrial Research (CSIR), Government of India (grant number BSC-0123, Arijit Mukhopadhyay is a PI). In addition, BSC-0403 (imaging facility) and BSC-0121 (computing facility) are also acknowledged for central facilities. Anchal Sharma acknowledges the Department of Science and Technology (DST), Govt. of India for her INSPIRE fellowship. Funding for open access charge will be from BSC-0123 funded by the Council of Scientific and Industrial Research (CSIR), Government of India.  </funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
        <notes>
            <sec sec-type="version-changes">
                <label>Revised</label>
                <title>Amendments from Version 1</title>
                <p>In this new version we have added a couple of sentences in the discussion section taking into consideration the comments from the reviewers. Also, as there was a problem in the dataset 1 files, we have freshly uploaded them.</p>
            </sec>
        </notes>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>Somatic variations are an inevitable consequence of continuous cell divisions in multicellular complex organisms and can lead to genomic heterogeneity. Somatic variations can arise due to replication error with or without external environmental factors like mutagens, exposure to UV rays etc &#x2013; and accumulate over time as the organism ages. Depending on how early a somatic variation occurs in a particular cell lineage and the rate of division for that cell, somatic variations may clonally expand and cross the threshold of detection by genome sequencing technology. As reviewed by De, somatic variations can range from single nucleotides to whole chromosomes and can be found in both &#x2018;healthy&#x2019; and &#x2018;diseased&#x2019; tissues &#x2013; cancer being a unanimously accepted example
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. The contribution of somatic variations is widely reported in cases where the DNA from affected tissue was found to harbor causal mutations whereas they were absent in the DNA from peripheral blood
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-4">4</xref>
                </sup>. Mutation reversal due to somatic variation has also been reported in Mendelian diseases, indicating the stochastic nature of these variations
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>
                </sup>. The rates of somatic variations have been a matter of debate, ranging between 10
                <sup>-4</sup> to 10
                <sup>-8</sup> per base-pair, per generation, depending on whether these estimates were genome-wide (lower estimates) or locus-specific (higher estimates)
                <sup>
                    <xref ref-type="bibr" rid="ref-6">6</xref>,
                    <xref ref-type="bibr" rid="ref-7">7</xref>
                </sup>. In addition, it has been speculated that rates of somatic variations might differ for different tissue-types and different developmental times but this remains to be clarified
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>.</p>
            <p>Somatic variations acquired and accumulated during the course of development have been modelled, predicting a higher risk of cancer and neurodegenerative diseases. The multitude of possible outcomes would increase with increased complexity of the tissue type
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-10">10</xref>
                </sup>. Thus, somatic variations could be of great importance for an organ like mammalian brain, which has complex structural and functional organization, high plasticity, and limited regenerative capabilities. The extreme interconnectivity of cortical neurons permits a disproportionately large impact of small changes while retaining robust adaptive mechanisms. Recently, there have been attempts to explore the extent of somatic variations in diverse healthy human tissues at the level of microarray or deep sequencing based copy number changes
                <sup>
                    <xref ref-type="bibr" rid="ref-11">11</xref>,
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup>. Interestingly, normal human brain, especially in the neuron rich regions, has been shown to harbor a wide variety of somatic variations &#x2013; ranging from whole chromosomes
                <sup>
                    <xref ref-type="bibr" rid="ref-13">13</xref>,
                    <xref ref-type="bibr" rid="ref-14">14</xref>
                </sup>, large-scale retro transpositions
                <sup>
                    <xref ref-type="bibr" rid="ref-15">15</xref>,
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup>, and copy number variations at the single neuron level
                <sup>
                    <xref ref-type="bibr" rid="ref-17">17</xref>
                </sup>. Recent reports have also started revealing the importance of such variations in neurological disorders
                <sup>
                    <xref ref-type="bibr" rid="ref-10">10</xref>
                </sup>. But there have been very few systematic studies so far exploring the nature, extent and impact of somatic variations at the single nucleotide resolution in neuron-rich parts of the healthy human brain
                <sup>
                    <xref ref-type="bibr" rid="ref-18">18</xref>,
                    <xref ref-type="bibr" rid="ref-19">19</xref>
                </sup>.</p>
            <p>In this study, we have analyzed single nucleotide level somatic variations between frontal cortex (rich in neurons) and corpus callosum (lack neurons) from healthy individuals and correlated the findings with markers of oxidative stress.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <sec>
                <title>Sample collection and DNA isolation</title>
                <p>Paired tissues were taken from a total of nine individuals, with age ranging from 23 years to 45 years (
                    <xref ref-type="other" rid="SM1">Supplementary table S1</xref>). For four individuals  (post-mortem, road accident victims), tissue sections from two different parts of the brain viz. frontal cortex (FC) and corpus callosum (CC), were procured from NIMHANS Brain Bank, Bangalore, India. For the other five individuals, peripheral blood and saliva was taken from healthy volunteers representing circulatory cell types with high turnover and therefore high likelihood of spontaneous somatic variations. The project was approved by institutional human ethics committee  of CSIR-Institute of Genomics and Integrative Biology and adhered to the international ethical guidelines (Declaration of Helsinki). For brain tissues, DNA was isolated using Omniprep Genomic DNA isolation kit (G-Biosciences, USA) and DNA from two other cell types viz. leukocytes (from blood) and epithelial cells (saliva) were isolated using Qiagen DNA kit (Qiagen, USA) and Oragene Saliva kit (DNA Genotek, Canada), respectively using manufacturer recommended protocols.</p>
            </sec>
            <sec>
                <title>Exome sequencing of different tissues and data analysis</title>
                <p>Exome capture for the isolated DNA was done using Illumina TruSeq Exome capture kit (62 Mb) and further exome libraries were sequenced (100 bp paired end) using Illumina Hiseq 2000 (Illumina, USA). Exome sequencing was done using manufacturer recommended protocols. A total of ~1.5 billion reads were generated for all the 18 samples. 
                    <xref ref-type="other" rid="SM1">Supplementary figure S1</xref> represents the overall pipeline followed for the analysis of the data and further text explains each step in detail. Raw data was checked for per base quality score and reads having 80% bases with phred quality score 30 (Q30) and greater were carried forward for downstream analysis and rest were discarded. Also, last few low quality bases were trimmed from all the reads (4&#x2013;10 bases, depending upon the sample quality). This was done using Fastx (version 0.0.13) and FastQC (version 0.10.0). About 9&#x2013;14% of data was removed from each sample. After checking quality of the data in various aspects, reads (Read 1 and Read 2 for each sample) were aligned to the 
                    <ext-link ext-link-type="uri" xlink:href="http://hgdownload.cse.ucsc.edu/goldenPath/hg19/bigZips/chromFa.tar.gz">reference genome (hg19)</ext-link> using BWA (version 0.6.1)
                    <sup>
                        <xref ref-type="bibr" rid="ref-20">20</xref>
                    </sup> allowing for maximum 2 mismatches. More than 98% percent of the data was aligned to reference for each sample. Data was also checked for PCR duplicates and the same were removed. Only reads with mapping quality (MQ) more than 40 were taken forward for further analysis. The sequence depth for the samples ranged between 91&#x00d7;&#x2013;120&#x00d7; (average 100&#x00d7;) for the FC-CC samples and 25&#x00d7;&#x2013;86&#x00d7; (average 51&#x00d7;) for the blood saliva samples (
                    <xref ref-type="other" rid="SM1">Supplementary figure S2</xref>). Data has been deposited in the NCBI Sequence Read Archive and can be found at accession number SRP045655.</p>
            </sec>
            <sec>
                <title>Somatic variation analysis</title>
                <p>Varscan2
                    <sup>
                        <xref ref-type="bibr" rid="ref-21">21</xref>
                    </sup> (version 2.3.5) somatic module along with Samtools (version 0.1.18)
                    <sup>
                        <xref ref-type="bibr" rid="ref-22">22</xref>
                    </sup> was used to call somatic variations from all the paired samples (CC vs. FC and blood vs. saliva). Firstly .bam files were processed using Samtools for making .mpileup files which were further processed through Varscan2 to call somatic variations using somatic SNP module. Following parameters were considered while calling variations: 
                    <list list-type="bullet">
                        <list-item>
                            <p>Minimum coverage to call somatic variation at a locus was kept at 8 reads</p>
                        </list-item>
                        <list-item>
                            <p>Minimum variant frequency to call a heterozygote was kept at 0.1 of total reads for that position.</p>
                        </list-item>
                        <list-item>
                            <p>Minimum variant frequency to call a homozygote was kept at 0.90 of total reads for that position.</p>
                        </list-item>
                        <list-item>
                            <p>Variants with more than 90% strand bias were removed</p>
                        </list-item>
                        <list-item>
                            <p>Minimum base quality score was kept at 20</p>
                        </list-item>
                        <list-item>
                            <p>Minimum mapping quality of a read was kept at 40</p>
                        </list-item>
                    </list>
				</p>
                <p>After calling variations, the data was checked for read bias. In this filter we excluded variations for which all the reads with the variant allele were read only from one direction (either F1R2 or F2R1) using in house developed perl scripts. Variations with more than 90% reads from one strand were removed. Annotation of all the variations called was done using Annovar (version 2012-03-08)
                    <sup>
                        <xref ref-type="bibr" rid="ref-23">23</xref>
                    </sup> and VcfCodingSnps (v1.5). Sites falling in regions with 87% and above identity with another genomic region were also removed from the data. Sites were termed as somatic sites if they had different genotype in two tissues and somatic p-value was less than 0.05. Overall out of a total 371 somatic sites (with p&lt;0.05) in brain 93.8% had at least 4 reads for the variant allele. All the sites confidently called as same genotype in both tissue types (p&lt;0.05) were termed as germline variations. Further, somatic variations with supporting reads for the variant allele &gt;10% of total reads were considered as heterozygotes and those with &lt;5% reads supporting variant allele were called homozygotes. All the somatic sites with their details are provided as CSV files under &#x2018;Data availability&#x2019;. Percentage of somatic variations across different sample-pairs was calculated by the following formula: (number of somatic variations/number of total variations) &#x00d7; 100. Number of somatic sites were defined as sites with different genotype between two tissues of the same individual. and Nnumber of total variations was refers to, all sites that had varying genotype from the reference genome (hg19) in two tissues of the same individual.  We also used MuTect
                    <sup>
                        <xref ref-type="bibr" rid="ref-24">24</xref>
                    </sup> (version 1.4.4) variant caller to call somatic variations from exome sequencing data for the brain samples. Details of the concordance between the two platforms is described in 
                    <xref ref-type="other" rid="SM1">Supplementary table S2</xref>. It was found that up to 78.8% of the somatic sites called by Varscan2 were also called as somatic sites by MuTect. Concordant sites between the two software also showed an enrichment of G:C&gt;T:A transversions with FC harboring heterozygotes. Pathway analysis of somatic variations was done using Gene Set Enrichment Analysis
                    <sup>
                        <xref ref-type="bibr" rid="ref-25">25</xref>
                    </sup> (release 2.2.0). The total 359 genes harbouring somatic variations from all brain samples (from varscan2 dataset) were included for pathway analysis. Pathway analysis for the concordant sites also showed similar results.</p>
            </sec>
            <sec>
                <title>Detection of neurons in brain samples</title>
                <p>
                    <underline>Western blotting for NeuN:</underline> Total tissue lysates from FC and CC were made using radio-immuno precipitation assay (RIPA) buffer from 5 samples (3 samples viz. Brain_152, Brain_156 and Brain_202 that were sequenced and extra 2 samples viz. Brain_174 and Brain_119 that were not sequenced). Two extra samples were considered to emphasize that this contrast in NeuN between FC and CC tissue is true across all samples and is not specific to only the ones that were sequenced. Protein estimation was done using bicinchoninic acid assay method (BCA method) and 30 micro gm (ug) of protein was used for western blotting for NeuN, which was selected as marker for detecting neuronal cell bodies in FC and CC. Anti-NeuN monoclonal antibody (ab104224, Abcam), raised in mouse was used at a dilution of 1:1000. This antibody gives 2&#x2013;3 bands between 46&#x2013;48 kDa as per manufacturer data-sheet, which we also observed. GAPDH (2118L- Cell Signalling, 1:2000) was used as loading control.   For this experiment a 5% stacking gel and a 12% resolving gel were used (both with 30% acrylamide, all reagent were from Sigma-Aldrich, USA). Images were analyzed by Image J 1.48 version.</p>
                <p>
                    <underline>Immuno-Fluorescence for NeuN:</underline> The same antibody was used to do immuno-fluorescence in both the tissues. Immunofluorescence for NeuN was performed on 8 &#x00b5;m sections of FC or CC. Briefly, tissue sections were fixed in chilled acetone, permeabilized with 0.1% Triton X-100, blocked in 1.5% FBS, incubated with Anti-NeuN monoclonal antibody (1:500, 16&#x2013;18hrs at 4&#x00b0;C) and anti mouse Alexa 488-conjugated secondary antibody (1:200, 1hr at RT), mounted with DAPI containing solution before taking images using a confocal microscope (at 63&#x00d7;)  [Zeiss-510 Meta, Carl Zeiss, Zen-2009 software  (December 2010 release)].</p>
            </sec>
            <sec>
                <title>ELISA for 8-OHdG</title>
                <p>8-OHdG, oxidative DNA damage marker, was measured in lysates of FC and CC by competitive ELISA (Cayman, USA). Briefly, 3 &#x00b5;g lysates were incubated with conjugate and antibody for 18 hrs and developed using Ellman&#x2019;s reagent and absorbance readings was taken at 405 nm and results were expressed in pg/3 &#x00b5;g for each lysate.</p>
            </sec>
            <sec>
                <title>Amplicon sequencing for validation of somatic sites</title>
                <p>We randomly selected 20 sites from Brain_156 (15% of the total sites) due to availability of sufficient DNA and this pair having the highest number of somatic sites, for validation using Illumina MiSeq Low Sample protocol (LS). Amplicons ranging from 200 bp to 350 bp for 14 sites were generated using PCR. The PCRs were done for 35 cyles with annealing temperatures at either 62 or 58 degrees C (primer details are provided under &#x2018;Data availability&#x2019;). Further all the amplicons from each tissue-type were pooled together in equimolar ratios, resulting in 2 freshly prepared libraries. These pooled amplicons were then processed using LS protocol of MiSeq sample preparation kit. About 12 million total reads were obtained from the two libraries. Quality check of trimming 3&#x2019; ends of the reads and filtering reads with less than 80% bases with Q30 phred quality score was applied on the raw data. Further, frequency of the variant allele (as observed in HiSeq data) was checked in both the tissues (FC and CC) in MiSeq data. A site was considered validated if the variant allele was supported by minimum of 100 reads along with at least two fold differences in the variant allele frequencies between the two tissues. On an average read depth of 5000&#x00d7; was obtained across all sites captured.</p>
            </sec>
        </sec>
        <sec sec-type="results">
            <title>Results</title>
            <supplementary-material id="DS0" orientation="portrait" position="float" xlink:href="https://f1000researchdata.s3.amazonaws.com/datasets/9495/692cc83f-ebcb-4cc6-9d68-7e8f0a93039b_e57821d6-b359-407a-9e36-1bb500ca63f5_New_Dataset_1.zip">
                <label>Raw data for &#x2018;Human brain harbors single nucleotide somatic variations in functionally relevant genes possibly mediated by oxidative stress&#x2019;</label>
                <caption>
                    <p>README.txt contains a description of the files.</p>
                </caption>
            </supplementary-material>
            <p>We analyzed about 1.5 billion reads from whole exome sequencing of 18 samples each having ~60 Mb coverage and identified 2,49,607 single nucleotide variations (SNVs), with an average of ~27,700 sites per sample (
                <xref ref-type="other" rid="SM1">Supplementary figure S3</xref>). For technical confidence of the genotype calls all the samples were genotyped on Illumina Infinium 660W-Quad microarrays and we observed 99.8%&#x2013;99.9% genotype concordance between the NGS and the microarray data (
                <xref ref-type="other" rid="SM1">Supplementary table S3</xref>).</p>
            <sec>
                <title>Human brain harbors higher somatic variations compared to circulatory tissue with a bias for non-synonymous changes</title>
                <p>The number of high-confidence somatic variation calls seen for each pair ranged between 32&#x2013;132 for the FC-CC pairs while it varied between 13&#x2013;35 for the blood-saliva pairs (
                    <xref ref-type="fig" rid="f1">Figure 1</xref>). The percentage of somatic variations ranged between 0.1%&#x2013;0.48% for the brain sample-pairs (FC-CC) and between 0.03%&#x2013;0.17% for the blood-saliva pairs. The number of germline variations across all samples was comparable (
                    <xref ref-type="other" rid="SM1">Supplementary figure S3</xref>). The observed somatic variations do not show any bias for the position of the variant loci. The distribution was 37&#x2013;65% in CDS, up to 57% in 3&#x2019; and 5&#x2019; UTRs and a minor proportion from other regions such as non-coding RNAs, splice sites etc. A comparison of these distributions between germline and somatic variations is represented in 
                    <xref ref-type="other" rid="SM1">Supplementary figure S4</xref>.</p>
                <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                    <label>Figure 1. </label>
                    <caption>
                        <title>Number of somatic variations in brain and blood saliva samples.</title>
                        <p>Blue bars represent brain samples and red bars represent blood-saliva samples.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11221/dac1ea89-70b2-4c85-9421-09586747a719_figure1.gif"/>
                </fig>
                <p>As much as 64% of the somatic variations in the brain samples would lead to non-synonymous changes at the protein level (
                    <xref ref-type="fig" rid="f2">Figure 2A</xref>). Whereas, for the germline variations the trend was towards higher synonymous variations as expected (
                    <xref ref-type="fig" rid="f2">Figure 2B</xref>). For the blood-saliva pairs such a consistent trend towards non-synonymous variations for the somatic sites was not observed (
                    <xref ref-type="other" rid="SM1">Supplementary figure S5</xref>). We analyzed the possible effect of the somatic sites at the amino-acid level for all samples and did not find a consistent bias for any particular amino acid change (
                    <xref ref-type="other" rid="SM1">Supplementary figure S6</xref>).</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <p>Distribution of synonymous and non-synonymous variations in brain samples 
                            <bold>A</bold>) for somatic variations and 
                            <bold>B</bold>) for germline variations.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11221/dac1ea89-70b2-4c85-9421-09586747a719_figure2.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Enrichment of G:C&gt;T:A somatic transversions in the brain</title>
                <p>Having observed somatic variation in the brain with a trend towards higher non-synonymous changes, we analyzed the level of transversion over transition amongst the somatic sites. Up to 87% of the total somatic variations found in FC-CC pairs were G:C&gt;T:A transversions (
                    <xref ref-type="fig" rid="f3">Figure 3A</xref>)! The germline variations (for all samples) matched the expected and reported distribution where A:T&gt;G:C (36%) and G:C&gt;A:T (38%) transitions were the most common types of changes (
                    <xref ref-type="fig" rid="f3">Figure 3B</xref>). In 
                    <xref ref-type="fig" rid="f3">Figure 3C</xref> we have represented the enrichment of G:C&gt;T:A transversion events as a proportion of somatic variations, with respect to the germline variations. The blood-saliva pairs did not show such consistent positive enrichment for the same class of transversion events (
                    <xref ref-type="other" rid="SM1">Supplementary figure S7</xref>).</p>
                <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                    <label>Figure 3. </label>
                    <caption>
                        <title>Enrichment of G:C&gt;T:A somatic transversion events in brain.</title>
                        <p>For each pair of FC-CC samples the absolute numbers of different types of changes are plotted for somatic (
                            <bold>A</bold>) and germline variations (
                            <bold>B</bold>). (
                            <bold>C</bold>) The difference in proportion between somatic SNVs to germline SNVs is plotted for the brain samples. The positive values on the vertical axis denote enrichment of the type of variation in the somatic subset while a negative value indicates enrichment of the type of variation in the germline subset. As evident in the figure only the G:C&gt;T:A transversions show enrichment for the somatic variations in brain.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11221/dac1ea89-70b2-4c85-9421-09586747a719_figure3.gif"/>
                </fig>
                <p>Surprisingly, for the G:C&gt;T:A transversion sites, 70&#x2013;100% of GT and CA heterozygotes were present in the frontal cortex and the corresponding homozygotes were observed in the corpus callosum (
                    <xref ref-type="fig" rid="f4">Figure 4</xref>). For all other sub-classes of somatic variations in brain and blood-saliva pairs the distribution of reference homozygotes versus heterozygotes did not show such a consistent and overriding bias (
                    <xref ref-type="other" rid="SM1">Supplementary figure S8</xref>).</p>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>Enrichment of GT and CA heterozygotes in the frontal cortex:</title>
                        <p>The figure shows that across the four pairs, for the somatic G:C&gt;T:A transversion sites majority of the heterozygotes (variant alleles) were present in the frontal cortex while the corresponding corpus callosum DNA had the homozygote (reference) genotype.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11221/dac1ea89-70b2-4c85-9421-09586747a719_figure4.gif"/>
                </fig>
                <p>The genes harboring the somatic variations in all the brain samples were analyzed for canonical pathways. It was observed that the genes were enriched for axon guidance pathways and neuronal processes related pathways (
                    <xref ref-type="fig" rid="f5">Figure 5</xref>) with p-values ranging from 0.04 to 4.9&#x00d7;10
                    <sup>-8</sup> (FDR corrected). When analyzed by individual samples, three out of four samples also showed similar enrichment &#x2013; but for Brain_152 although the relevant genes were found in the dataset but due to small number of variations, enrichment could not be established. It has been shown that DSBs in neuronal cells tend to occur in long genes involved in neuronal functions
                    <sup>
                        <xref ref-type="bibr" rid="ref-26">26</xref>
                    </sup>. We also find a similar trend in our data: 46% of the pathway genes harboring somatic variations were long (&gt;100kb) compared to total genes (18% are &gt;100kb, p&lt;0.002).</p>
                <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                    <label>Figure 5. </label>
                    <caption>
                        <title>Pathway enrichment analysis of genes harboring somatic variations:</title>
                        <p>Enrichment of genes harboring somatic variations in brain for pathways related to neuronal function. In the figure the horizontal axis shows the negative logarithm of FDR corrected p-value. Different biological processes are indicated on the left.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11221/dac1ea89-70b2-4c85-9421-09586747a719_figure5.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Increased accumulation of oxidative stress mediated modified guanine in frontal cortex</title>
                <p>The most common cause of G:C&gt;T:A transversions is mis-pairing of G to A (instead of G to C) due to modification of deoxy guanosine (dG) to 8-hydroxy-2&#x2019;-deoxy-Guanosine (8-OHdG) mediated by oxidative/metabolic stress
                    <sup>
                        <xref ref-type="bibr" rid="ref-27">27</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref-29">29</xref>
                    </sup>. We found significantly higher levels of 8-OHdG in the frontal cortex samples, when compared to the corpus callosum of the same individuals (
                    <xref ref-type="fig" rid="f6">Figure 6</xref>), which corresponded with the abundance of neuronal cells in the frontal cortex (
                    <xref ref-type="fig" rid="f7">Figure 7</xref>). Thus an increased accumulation of 8-OHdG in the frontal cortex (compared to the corpus callosum of the same individual) DNA might result in localized DNA variations with a bias towards G:C&gt;T:A transversions. Using immunoblotting and immuno fluorescence staining on tissues against the neuron specific marker NeuN we confirmed that the majority of the cells in the FC samples were neurons whereas, CC samples were almost devoid of neurons (
                    <xref ref-type="fig" rid="f7">Figure 7</xref>) &#x2013; implicating a direct correlation of abundance of neuronal cells with accumulation of 8-OHdG leading to G to T somatic transversions in the FC samples. Further effect of postmortem could be ruled out as one of the contributing factors, as time of collection of tissues, storage conditions and DNA isolation protocol were exactly the same for both the tissues.</p>
                <fig fig-type="figure" id="f6" orientation="portrait" position="float">
                    <label>Figure 6. </label>
                    <caption>
                        <title>Estimation of 8-OHdG levels in different brain tissues using ELISA:</title>
                        <p>Frontal cortex samples show significantly higher 8-OHdG accumulation. The figure represents concentration of 8-OHdG in the frontal cortex and corpus callosum samples. For each tissue type five independent samples were analyzed by ELISA. The error bars represent standard deviation (SD for FC, 57.5 and SD for CC, 24.4). The p value is calculated using paired T-test.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11221/dac1ea89-70b2-4c85-9421-09586747a719_figure6.gif"/>
                </fig>
                <fig fig-type="figure" id="f7" orientation="portrait" position="float">
                    <label>Figure 7. </label>
                    <caption>
                        <title>Frontal cortex is enriched for neuronal cells:</title>
                        <p>(
                            <bold>A</bold>): Western blot for NeuN in five pairs of FC-CC samples (for three of them sequence data is presented) shows distinct abundance of neurons in the FC in contrast to CC. GAPDH expression is used as a control. (
                            <bold>B</bold>): The same antibody is used in immuno-fluorescence to show the neuronal abundance in FC. Note the near absence of signal for NeuN in the corpus callosum panel.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/11221/dac1ea89-70b2-4c85-9421-09586747a719_figure7.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Validation of somatic variations using targeted amplicon resequencing</title>
                <p>Amplicon sequencing using MiSeq (Illumina) was performed for validation of the somatic variations. Out of the total somatic sites across all brain samples 20 sites from sample number Brain_156 were chosen for validation. We have randomly selected 20 sites from 156 (15% of the total sites) due to availability of DNA and this pair having the highest number of somatic sites. Data for 14 sites was obtained and 10 out of these 14 sites showed difference in variant allele frequencies between the two tissues (CC vs FC) in accordance with the HiSeq data (71% validation rate) (
                    <xref ref-type="other" rid="SM1">Supplementary Table S4</xref>). Rows marked in red shade are the ones that got validated between two platforms. It should be noted that although the allele frequencies of the variations in the validation set were not exactly the same as in the HiSeq data but the trend (same tissue harboring the higher amount of variant allele) was same. In general, low variant allele frequencies were obtained in MiSeq data as compared to HiSeq data. This could be attributed to very high read depth per site in MiSeq data. A trend of varying allele frequencies with read depth was observed, as in, lower the read depth higher the allele frequencies (and vice versa), thereby explaining the high frequencies in HiSeq data (with average read depth of 100&#x00d7;) as compared to MiSeq data (with average read depth of 5000&#x00d7;).</p>
            </sec>
        </sec>
        <sec sec-type="discussion">
            <title>Discussion</title>
            <p>Here, we report somatic variations in normal human brain at single base resolution in whole exomes. Earlier studies have reported somatic genomic rearrangements such as, aneuploidy, insertions/deletions, CNVs etc in the neurons as part of the normal brain development and neurogenesis
                <sup>
                    <xref ref-type="bibr" rid="ref-9">9</xref>,
                    <xref ref-type="bibr" rid="ref-13">13</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-17">17</xref>,
                    <xref ref-type="bibr" rid="ref-30">30</xref>&#x2013;
                    <xref ref-type="bibr" rid="ref-32">32</xref>
                </sup>. In our data the percentage of somatic variants in brain is 0.1%&#x2013;0.48%, which was significantly enriched for genes in axon guidance (details below). It has been reported that somatic events present in low abundance of the cells can bring about striking phenotypic consequences in the brain
                <sup>
                    <xref ref-type="bibr" rid="ref-10">10</xref>,
                    <xref ref-type="bibr" rid="ref-33">33</xref>
                </sup>. We observed an enrichment of somatic non-synonymous variations, which was unique and was not found in variants common to both tissues (germline variations, 
                <xref ref-type="fig" rid="f2">Figure 2</xref>) &#x2013; implying a functional neutrality or advantage of such variants. In absence of additional tissues and parental information for the individuals, we cannot definitively distinguish between inherited and acquired genotypes.</p>
            <p>	We have estimated the error rates to rule out that most of our variations can arise due to errors in sequencing experiments. Towards this we have performed genome-wide genotyping of the same samples and genotype discordance (error rates) between the microarray and the sequencing data varied from 0.002 to 0.0008 (
                <xref ref-type="other" rid="SM1">Supplementary table S3</xref>). Our observed somatic site frequencies were higher than that. In addition, to further strengthen the criteria, we have chosen to only accept those sites as variants, where the variant allele was represented by at least 10% of the total reads for that position. We also performed  targeted amplicon sequencing to validate a subset of the somatic sites.</p>
            <p>We observed a higher proportion of somatic variations between the FC-CC samples compared with the proportion found between blood-saliva samples. A recent report studied somatic sites between brain and the blood of the same individual and found higher somatic sites in the blood
                <sup>
                    <xref ref-type="bibr" rid="ref-34">34</xref>
                </sup>. This apparent contrast in the two findings is perhaps due to the fact that in our study we did not compare between blood and brain of the same individuals. In addition, a lower proportion of somatic variations between DNA from blood and saliva can be due to, inter-mixing of the two cell types and/or faster regeneration and circulatory nature of these cells resulting in dilution of clonal populations of cells harboring the somatic sites.</p>
            <p>Our data shows all possible types of nucleotide changes amongst the somatic SNVs, as would be expected for a random event, but with an unexpected bias (up to 87%) for G:C&gt;T:A transversions (
                <xref ref-type="fig" rid="f3">Figure 3A</xref>). Moreover, more than 70% of these transversion events were found in the frontal cortex while DNA from the corpus callosum harbored the homozygotes for the reference alleles (
                <xref ref-type="fig" rid="f4">Figure 4</xref>). Recently, two studies
                <sup>
                    <xref ref-type="bibr" rid="ref-18">18</xref>,
                    <xref ref-type="bibr" rid="ref-19">19</xref>
                </sup> have looked into somatic SNVs at the single neuron level. Although, neither of these studies report enrichment of G:C&gt;T:A transversions, but these transversions are the second most abundant class of somatic variations as reported by Hazen 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-19">19</xref>
                </sup> (16% of total somatic variations) in their dataset. Moreover, the choice of tissue in these two studies (single neurons) and ours (frontal cortex v/s corpus callosum) is different, which might be the reason for differences between the classes of somatic variations observed.</p>
            <p>It is well known that G:C&gt;T:A transversions are mediated primarily by oxidative stress which modifies deoxy-guanine (dG) to 8-hydroxy-2&#x2019;-deoxy-Guanosine (8-OHdG)
                <sup>
                    <xref ref-type="bibr" rid="ref-27">27</xref>
                </sup>. Towards this when probed for oxidative stress levels in two tissues, we also found significantly higher levels of 8-OHdG in the frontal cortex compared to the same individual&#x2019;s corpus callosum correlating the enriched transversion events with increased oxidative stress (
                <xref ref-type="fig" rid="f6">Figure 6</xref>). A recent study reported G:C&gt;T:A transversions arising in sequencing data as an artefact due to DNA shearing stress
                <sup>
                    <xref ref-type="bibr" rid="ref-35">35</xref>
                </sup>. We have tested for this bias in both the germline and somatic datasets. A major fraction of the somatic sites initially called by our pipeline was observed to have this bias and were removed in the modified analysis workflow. However, it is still possible that in the remaining data-set presented here, some of the G:C&gt;T:A transversions are actually artefacts. Interestingly, even in such a possible scenario, these artefacts are not randomly observed in all tissue types analysed unlike the earlier report
                <sup>
                    <xref ref-type="bibr" rid="ref-35">35</xref>
                </sup>. Instead, we observed that almost exclusively the GT and CA heterozygotes were in the FC samples. Further, the observation of higher 8-OHdG in FC was independent of the shearing stress as the experiments were performed on lysates isolated from fresh sections of the same tissue samples. Whether the specific cell types in FC make them more prone to either 
                <italic toggle="yes">in-vivo</italic> (biological) or 
                <italic toggle="yes">in-vitro</italic> (artefactual) stress mediated variations needs to be explored further. </p>
            <p>It is known that having an Adenine (A) 3&#x2019; to the oxidized G significantly reduces the efficiency of the repair process and thereby enhancing the possibility of a G&gt;T transversion
                <sup>
                    <xref ref-type="bibr" rid="ref-36">36</xref>
                </sup>. We also find a bias for 3&#x2019; Adenine for the somatic G:C&gt;T:A sites in our data (
                <xref ref-type="other" rid="SM1">Supplementary figure S9</xref>). As reported in the above-mentioned study, the artefactual C&gt;A transversions have an enrichment of C
                <underline>C</underline>G motif, which perhaps makes the base (underlined) more susceptible for oxidation
                <sup>
                    <xref ref-type="bibr" rid="ref-35">35</xref>
                </sup>. We did not observe any enrichment for this motif for the somatic sites found in our study.</p>
            <p>Our data indicates that normal brain accumulates single nucleotide somatic variations with age during the lifetime of an individual. This might happen due to various mechanisms, high oxidative stress generated during normal physiological brain activity, being one of them. Physiological levels of oxidative free radicals are essential in various key cellular processes such as cellular differentiation, proliferation and survival
                <sup>
                    <xref ref-type="bibr" rid="ref-37">37</xref>
                </sup> though pathological level is detrimental for cellular health. From our observed results, it seems that most of the variations appeared around the time of birth when neurons are rapidly dividing. It is known in the literature that the process of neurogenesis spans from E13 to E108
                <sup>
                    <xref ref-type="bibr" rid="ref-38">38</xref>
                </sup> and the number of neurons in an infant brain is in the order of 10
                <sup>10</sup> &#x2013; 10
                <sup>11</sup>. With our analysis threshold of at least 10% abundance of the variant allele, the variation should be present in 10
                <sup>6</sup>&#x2013;10
                <sup>7</sup> cells &#x2013; which is in order with the developmental time-frame. However, studies with more tissues (from brain and outside brain) from the same individuals would be needed to rule out other possible reasons. Oxidative stress is also induced during normal neurogenesis in adults
                <sup>
                    <xref ref-type="bibr" rid="ref-39">39</xref>
                </sup> and oxidative stress susceptible genetic alleles in drosophila are connected to axon guidance
                <sup>
                    <xref ref-type="bibr" rid="ref-40">40</xref>
                </sup>. A recent study showed that physiological levels of H
                <sub>2</sub>O
                <sub>2 </sub>are essential for neurogenesis. Their data revealed that exposure to H
                <sub>2</sub>O
                <sub>2</sub> mediated oxidative stress promoted neurogenesis of neural progenitor cells (NPC) in rat
                <sup>
                    <xref ref-type="bibr" rid="ref-41">41</xref>
                </sup>. In this context, interestingly, the somatic transversion events we found in brain samples were enriched for genes involved in processes like axonal guidance, neurogenesis etc. (
                <xref ref-type="fig" rid="f5">Figure 5</xref>). These indicate that the accumulation of somatic variations could be the possible molecular explanation for physiological oxidative stress mediated enhancement of neuronal differentiation from NPCs. Other linked processes like interaction between L1 and ankyrins, NCAM signaling, long term potentiation etc. were also found to be significantly enriched. These evidences indicate that the acquired somatic variations might provide required functional diversity in the growing neurons during development as well as during adult neurogenesis.</p>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusions</title>
            <p>Our study shows presence of somatic SNVs in functionally relevant genes in different parts of the brain possibly influenced by oxidative stress along with other known contributing factors. Recent reviews suggested that local somatic events could strike a balance between the plasticity and robustness of the genome indicating a continuum of normal-through-disease scenario
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. A study showed that oxidative stress mediated double-strand breaks (DSBs) in DNA of neuronal cells and its delayed repair was a feature of normal mice brain related with its learning ability
                <sup>
                    <xref ref-type="bibr" rid="ref-42">42</xref>
                </sup>. On similar lines our study also indicates that the acquired somatic variations might provide required functional diversity during development as well as during adult neurogenesis.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <p>The data referenced by this article are under copyright with the following copyright statement: Copyright: &#x00ef;&#x00bf;&#x00bd; 2016 Sharma A et al.</p>
            <p>Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).
                <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/"/>
            </p>
            <p>Data has been deposited in the NCBI Sequence Read Archive under accession number 
                <ext-link ext-link-type="uri" xlink:href="http://trace.ncbi.nlm.nih.gov/Traces/study/?acc=SRP045655">SRP045655</ext-link>.</p>
            <p>
				
                <italic toggle="yes">F1000Research</italic>: Dataset 1. Raw data for &#x2018;Human brain harbors single nucleotide somatic variations in functionally relevant genes possibly mediated by oxidative stress&#x2019;. 
                <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5256/f1000research.9495.d135507">10.5256/f1000research.9495.d135507</ext-link>
                <sup>
                    <xref ref-type="bibr" rid="ref-43">43</xref>
                </sup>
            </p>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgement</title>
            <p>We acknowledge the Human Brain Tissue Repository at the National Institute of Mental Health and Neuro Sciences (NIMHANS), Bengaluru, India for providing the brain samples. We thank Dr. Mohammed Faruq, Ms. Kiran Narta and Mr. Pradeep Tiwari for their technical help in the next generation sequencing experiments and statistical analyses. Mr. Manish Kumar is acknowledged for technical help in imaging and the high-performance computing facility at CSIR-IGIB is also acknowledged for the computational infrastructure. We thank Dr. Nidhan K. Biswas from the National Institute of Bio-Medical Genomics, West Bengal, India for critical comments on the analysis of the exome data. Dr. Sheetal Gandotra and Ms. Mansi Vishal are acknowledged for their critical comments on the manuscript.</p>
        </ack>
        <sec id="SM1" sec-type="supplementary-material">
            <title>Supplementary tables and figures</title>
            <p>
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/9495/995053d4-b9c3-4ed5-bd6f-e2799d80c632.docx">Click here to access the data</ext-link>.</p>
        </sec>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>De</surname>
                            <given-names>S</given-names>
                        </name>
					</person-group>:
                    <article-title>Somatic mosaicism in healthy human tissues.</article-title>
                    <source>
						
                        <italic toggle="yes">Trends Genet.</italic>
					</source>
                    <year>2011</year>;<volume>27</volume>(<issue>6</issue>):<fpage>217</fpage>&#x2013;<lpage>223</lpage>.
                    <pub-id pub-id-type="pmid">21496937</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.tig.2011.03.002</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Ellis</surname>
                            <given-names>NA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Ciocci</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>German</surname>
                            <given-names>J</given-names>
                        </name>
					</person-group>:
                    <article-title>Back mutation can produce phenotype reversion in Bloom syndrome somatic cells.</article-title>
                    <source>
						
                        <italic toggle="yes">Hum Genet.</italic>
					</source>
                    <year>2001</year>;<volume>108</volume>(<issue>2</issue>):<fpage>167</fpage>&#x2013;<lpage>173</lpage>.
                    <pub-id pub-id-type="pmid">11281456</pub-id>
                    <pub-id pub-id-type="doi">10.1007/s004390000447</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Gottlieb</surname>
                            <given-names>B</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Chalifour</surname>
                            <given-names>LE</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Mitmaker</surname>
                            <given-names>B</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>BAK1 gene variation and abdominal aortic aneurysms.</article-title>
                    <source>
						
                        <italic toggle="yes">Hum Mutat.</italic>
					</source>
                    <year>2009</year>;<volume>30</volume>(<issue>7</issue>):<fpage>1043</fpage>&#x2013;<lpage>1047</lpage>.
                    <pub-id pub-id-type="pmid">19514060</pub-id>
                    <pub-id pub-id-type="doi">10.1002/humu.21046</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Lindhurst</surname>
                            <given-names>MJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Sapp</surname>
                            <given-names>JC</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Teer</surname>
                            <given-names>JK</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>A mosaic activating mutation in 
                        <italic toggle="yes">AKT1</italic> associated with the Proteus syndrome.</article-title>
                    <source>
						
                        <italic toggle="yes">N Engl J Med.</italic>
					</source>
                    <year>2011</year>;<volume>365</volume>(<issue>7</issue>):<fpage>611</fpage>&#x2013;<lpage>619</lpage>.
                    <pub-id pub-id-type="pmid">21793738</pub-id>
                    <pub-id pub-id-type="doi">10.1056/NEJMoa1104017</pub-id>
                    <pub-id pub-id-type="pmcid">3170413</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Gregory JJ</surname>
                            <given-names>Jr</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Wagner</surname>
                            <given-names>JE</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Verlander</surname>
                            <given-names>PC</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Somatic mosaicism in Fanconi anemia: evidence of genotypic reversion in lymphohematopoietic stem cells.</article-title>
                    <source>
						
                        <italic toggle="yes">Proc Natl Acad Sci U S A.</italic>
					</source>
                    <year>2001</year>;<volume>98</volume>(<issue>5</issue>):<fpage>2532</fpage>&#x2013;<lpage>2537</lpage>.
                    <pub-id pub-id-type="pmid">11226273</pub-id>
                    <pub-id pub-id-type="doi">10.1073/pnas.051609898</pub-id>
                    <pub-id pub-id-type="pmcid">30172</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Araten</surname>
                            <given-names>DJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Golde</surname>
                            <given-names>DW</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>RH</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>A quantitative measurement of the human somatic mutation rate.</article-title>
                    <source>
						
                        <italic toggle="yes">Cancer Res.</italic>
					</source>
                    <year>2005</year>;<volume>65</volume>(<issue>18</issue>):<fpage>8111</fpage>&#x2013;<lpage>8117</lpage>.
                    <pub-id pub-id-type="pmid">16166284</pub-id>
                    <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-04-1198</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Baer</surname>
                            <given-names>CF</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Miyamoto</surname>
                            <given-names>MM</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Denver</surname>
                            <given-names>DR</given-names>
                        </name>
					</person-group>:
                    <article-title>Mutation rate variation in multicellular eukaryotes: causes and consequences.</article-title>
                    <source>
						
                        <italic toggle="yes">Nat Rev Genet.</italic>
					</source>
                    <year>2007</year>;<volume>8</volume>(<issue>8</issue>):<fpage>619</fpage>&#x2013;<lpage>631</lpage>.
                    <pub-id pub-id-type="pmid">17637734</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nrg2158</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Kennedy</surname>
                            <given-names>SR</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Loeb</surname>
                            <given-names>LA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Herr</surname>
                            <given-names>AJ</given-names>
                        </name>
					</person-group>:
                    <article-title>Somatic mutations in aging, cancer and neurodegeneration.</article-title>
                    <source>
						
                        <italic toggle="yes">Mech Ageing Dev.</italic>
					</source>
                    <year>2012</year>;<volume>133</volume>(<issue>4</issue>):<fpage>118</fpage>&#x2013;<lpage>126</lpage>.
                    <pub-id pub-id-type="pmid">22079405</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.mad.2011.10.009</pub-id>
                    <pub-id pub-id-type="pmcid">3325357</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Kim</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Shin</surname>
                            <given-names>JY</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kim</surname>
                            <given-names>JI</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Somatic deletions implicated in functional diversity of brain cells of individuals with schizophrenia and unaffected controls.</article-title>
                    <source>
						
                        <italic toggle="yes">Sci Rep.</italic>
					</source>
                    <year>2014</year>;<volume>4</volume>:<fpage>3807</fpage>.
                    <pub-id pub-id-type="pmid">24448323</pub-id>
                    <pub-id pub-id-type="doi">10.1038/srep03807</pub-id>
                    <pub-id pub-id-type="pmcid">3897957</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Poduri</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Evrony</surname>
                            <given-names>GD</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Cai</surname>
                            <given-names>X</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Somatic mutation, genomic variation, and neurological disease.</article-title>
                    <source>
						
                        <italic toggle="yes">Science.</italic>
					</source>
                    <year>2013</year>;<volume>341</volume>(<issue>6141</issue>):<fpage>1237758</fpage>.
                    <pub-id pub-id-type="pmid">23828942</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.1237758</pub-id>
                    <pub-id pub-id-type="pmcid">3909954</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>O'Huallachain</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Karczewski</surname>
                            <given-names>KJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Weissman</surname>
                            <given-names>SM</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Extensive genetic variation in somatic human tissues.</article-title>
                    <source>
						
                        <italic toggle="yes">Proc Natl Acad Sci U S A.</italic>
					</source>
                    <year>2012</year>;<volume>109</volume>(<issue>44</issue>):<fpage>18018</fpage>&#x2013;<lpage>18023</lpage>.
                    <pub-id pub-id-type="pmid">23043118</pub-id>
                    <pub-id pub-id-type="doi">10.1073/pnas.1213736109</pub-id>
                    <pub-id pub-id-type="pmcid">3497787</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Piotrowski</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Bruder</surname>
                            <given-names>CE</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Andersson</surname>
                            <given-names>R</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Somatic mosaicism for copy number variation in differentiated human tissues.</article-title>
                    <source>
						
                        <italic toggle="yes">Hum Mutat.</italic>
					</source>
                    <year>2008</year>;<volume>29</volume>(<issue>9</issue>):<fpage>1118</fpage>&#x2013;<lpage>1124</lpage>.
                    <pub-id pub-id-type="pmid">18570184</pub-id>
                    <pub-id pub-id-type="doi">10.1002/humu.20815</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Rehen</surname>
                            <given-names>SK</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Yung</surname>
                            <given-names>YC</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>McCreight</surname>
                            <given-names>MP</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Constitutional aneuploidy in the normal human brain.</article-title>
                    <source>
						
                        <italic toggle="yes">J Neurosci.</italic>
					</source>
                    <year>2005</year>;<volume>25</volume>(<issue>9</issue>):<fpage>2176</fpage>&#x2013;<lpage>2180</lpage>.
                    <pub-id pub-id-type="pmid">15745943</pub-id>
                    <pub-id pub-id-type="doi">10.1523/JNEUROSCI.4560-04.2005</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Yurov</surname>
                            <given-names>YB</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Iourov</surname>
                            <given-names>IY</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Vorsanova</surname>
                            <given-names>SG</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Aneuploidy and confined chromosomal mosaicism in the developing human brain.</article-title>
                    <source>
						
                        <italic toggle="yes">PLoS One.</italic>
					</source>
                    <year>2007</year>;<volume>2</volume>(<issue>6</issue>):<fpage>e558</fpage>.
                    <pub-id pub-id-type="pmid">17593959</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0000558</pub-id>
                    <pub-id pub-id-type="pmcid">1891435</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Coufal</surname>
                            <given-names>NG</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Garcia-Perez</surname>
                            <given-names>JL</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Peng</surname>
                            <given-names>GE</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>L1 retrotransposition in human neural progenitor cells.</article-title>
                    <source>
						
                        <italic toggle="yes">Nature.</italic>
					</source>
                    <year>2009</year>;<volume>460</volume>(<issue>7259</issue>):<fpage>1127</fpage>&#x2013;<lpage>1131</lpage>.
                    <pub-id pub-id-type="pmid">19657334</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nature08248</pub-id>
                    <pub-id pub-id-type="pmcid">2909034</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Kano</surname>
                            <given-names>H</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Godoy</surname>
                            <given-names>I</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Courtney</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>L1 retrotransposition occurs mainly in embryogenesis and creates somatic mosaicism.</article-title>
                    <source>
						
                        <italic toggle="yes">Genes Dev.</italic>
					</source>
                    <year>2009</year>;<volume>23</volume>(<issue>11</issue>):<fpage>1303</fpage>&#x2013;<lpage>1312</lpage>.
                    <pub-id pub-id-type="pmid">19487571</pub-id>
                    <pub-id pub-id-type="doi">10.1101/gad.1803909</pub-id>
                    <pub-id pub-id-type="pmcid">2701581</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>McConnell</surname>
                            <given-names>MJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Lindberg</surname>
                            <given-names>MR</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Brennand</surname>
                            <given-names>KJ</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Mosaic copy number variation in human neurons.</article-title>
                    <source>
						
                        <italic toggle="yes">Science.</italic>
					</source>
                    <year>2013</year>;<volume>342</volume>(<issue>6158</issue>):<fpage>632</fpage>&#x2013;<lpage>637</lpage>.
                    <pub-id pub-id-type="pmid">24179226</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.1243472</pub-id>
                    <pub-id pub-id-type="pmcid">3975283</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Lodato</surname>
                            <given-names>MA</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Woodworth</surname>
                            <given-names>MB</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Lee</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Somatic mutation in single human neurons tracks developmental and transcriptional history.</article-title>
                    <source>
						
                        <italic toggle="yes">Science.</italic>
					</source>
                    <year>2015</year>;<volume>350</volume>(<issue>6256</issue>):<fpage>94</fpage>&#x2013;<lpage>98</lpage>.
                    <pub-id pub-id-type="pmid">26430121</pub-id>
                    <pub-id pub-id-type="doi">10.1126/science.aab1785</pub-id>
                    <pub-id pub-id-type="pmcid">4664477</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Hazen</surname>
                            <given-names>JL</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Faust</surname>
                            <given-names>GG</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Rodriguez</surname>
                            <given-names>AR</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>The Complete Genome Sequences, Unique Mutational Spectra, and Developmental Potency of Adult Neurons Revealed by Cloning.</article-title>
                    <source>
						
                        <italic toggle="yes">Neuron.</italic>
					</source>
                    <year>2016</year>;<volume>89</volume>(<issue>6</issue>):<fpage>1223</fpage>&#x2013;<lpage>1236</lpage>.
                    <pub-id pub-id-type="pmid">26948891</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.neuron.2016.02.004</pub-id>
                    <pub-id pub-id-type="pmcid">4795965</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>H</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Durbin</surname>
                            <given-names>R</given-names>
                        </name>
					</person-group>:
                    <article-title>Fast and accurate short read alignment with Burrows-Wheeler transform.</article-title>
                    <source>
						
                        <italic toggle="yes">Bioinformatics.</italic>
					</source>
                    <year>2009</year>;<volume>25</volume>(<issue>14</issue>):<fpage>1754</fpage>&#x2013;<lpage>1760</lpage>.
                    <pub-id pub-id-type="pmid">19451168</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btp324</pub-id>
                    <pub-id pub-id-type="pmcid">2705234</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Koboldt</surname>
                            <given-names>DC</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Zhang</surname>
                            <given-names>Q</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Larson</surname>
                            <given-names>DE</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing.</article-title>
                    <source>
						
                        <italic toggle="yes">Genome Res.</italic>
					</source>
                    <year>2012</year>;<volume>22</volume>(<issue>3</issue>):<fpage>568</fpage>&#x2013;<lpage>576</lpage>.
                    <pub-id pub-id-type="pmid">22300766</pub-id>
                    <pub-id pub-id-type="doi">10.1101/gr.129684.111</pub-id>
                    <pub-id pub-id-type="pmcid">3290792</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>H</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Handsaker</surname>
                            <given-names>B</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Wysoker</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>The Sequence Alignment/Map format and SAMtools.</article-title>
                    <source>
						
                        <italic toggle="yes">Bioinformatics.</italic>
					</source>
                    <year>2009</year>;<volume>25</volume>(<issue>16</issue>):<fpage>2078</fpage>&#x2013;<lpage>2079</lpage>.
                    <pub-id pub-id-type="pmid">19505943</pub-id>
                    <pub-id pub-id-type="doi">10.1093/bioinformatics/btp352</pub-id>
                    <pub-id pub-id-type="pmcid">2723002</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Li</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Hakonarson</surname>
                            <given-names>H</given-names>
                        </name>
					</person-group>:
                    <article-title>ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data.</article-title>
                    <source>
						
                        <italic toggle="yes">Nucleic Acids Res.</italic>
					</source>
                    <year>2010</year>;<volume>38</volume>(<issue>16</issue>):<fpage>e164</fpage>.
                    <pub-id pub-id-type="pmid">20601685</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gkq603</pub-id>
                    <pub-id pub-id-type="pmcid">2938201</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Cibulskis</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Lawrence</surname>
                            <given-names>MS</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Carter</surname>
                            <given-names>SL</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples.</article-title>
                    <source>
						
                        <italic toggle="yes">Nat Biotechnol.</italic>
					</source>
                    <year>2013</year>;<volume>31</volume>(<issue>3</issue>):<fpage>213</fpage>&#x2013;<lpage>219</lpage>.
                    <pub-id pub-id-type="pmid">23396013</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nbt.2514</pub-id>
                    <pub-id pub-id-type="pmcid">3833702</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Subramanian</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Tamayo</surname>
                            <given-names>P</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Mootha</surname>
                            <given-names>VK</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.</article-title>
                    <source>
						
                        <italic toggle="yes">Proc Natl Acad Sci U S A.</italic>
					</source>
                    <year>2005</year>;<volume>102</volume>(<issue>43</issue>):<fpage>15545</fpage>&#x2013;<lpage>15550</lpage>.
                    <pub-id pub-id-type="pmid">16199517</pub-id>
                    <pub-id pub-id-type="doi">10.1073/pnas.0506580102</pub-id>
                    <pub-id pub-id-type="pmcid">1239896</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Wei</surname>
                            <given-names>PC</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Chang</surname>
                            <given-names>AN</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kao</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Long Neural Genes Harbor Recurrent DNA Break Clusters in Neural Stem/Progenitor Cells.</article-title>
                    <source>
						
                        <italic toggle="yes">Cell.</italic>
					</source>
                    <year>2016</year>;<volume>164</volume>(<issue>4</issue>):<fpage>644</fpage>&#x2013;<lpage>655</lpage>.
                    <pub-id pub-id-type="pmid">26871630</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.cell.2015.12.039</pub-id>
                    <pub-id pub-id-type="pmcid">4752721</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-27">
                <label>27</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Cheng</surname>
                            <given-names>KC</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Cahill</surname>
                            <given-names>DS</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kasai</surname>
                            <given-names>H</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>8-Hydroxyguanine, an abundant form of oxidative DNA damage, causes G----T and A----C substitutions.</article-title>
                    <source>
						
                        <italic toggle="yes">J Biol Chem.</italic>
					</source>
                    <year>1992</year>;<volume>267</volume>(<issue>1</issue>):<fpage>166</fpage>&#x2013;<lpage>172</lpage>.
                    <pub-id pub-id-type="pmid">1730583</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-28">
                <label>28</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Ohno</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Sakumi</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Fukumura</surname>
                            <given-names>R</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>8-oxoguanine causes spontaneous 
                        <italic toggle="yes">de novo</italic> germline mutations in mice.</article-title>
                    <source>
						
                        <italic toggle="yes">Sci Rep.</italic>
					</source>
                    <year>2014</year>;<volume>4</volume>: 4689.
                    <pub-id pub-id-type="pmid">24732879</pub-id>
                    <pub-id pub-id-type="doi">10.1038/srep04689</pub-id>
                    <pub-id pub-id-type="pmcid">3986730</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-29">
                <label>29</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Shibutani</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Takeshita</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Grollman</surname>
                            <given-names>AP</given-names>
                        </name>
					</person-group>:
                    <article-title>Insertion of specific bases during DNA synthesis past the oxidation-damaged base 8-oxodg.</article-title>
                    <source>
						
                        <italic toggle="yes">Nature.</italic>
					</source>
                    <year>1991</year>;<volume>349</volume>(<issue>6308</issue>):<fpage>431</fpage>&#x2013;<lpage>434</lpage>.
                    <pub-id pub-id-type="pmid">1992344</pub-id>
                    <pub-id pub-id-type="doi">10.1038/349431a0</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-30">
                <label>30</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Faggioli</surname>
                            <given-names>F</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Wang</surname>
                            <given-names>T</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Vijg</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Chromosome-specific accumulation of aneuploidy in the aging mouse brain.</article-title>
                    <source>
						
                        <italic toggle="yes">Hum Mol Genet.</italic>
					</source>
                    <year>2012</year>;<volume>21</volume>(<issue>24</issue>):<fpage>5246</fpage>&#x2013;<lpage>5253</lpage>.
                    <pub-id pub-id-type="pmid">22962300</pub-id>
                    <pub-id pub-id-type="doi">10.1093/hmg/dds375</pub-id>
                    <pub-id pub-id-type="pmcid">3510757</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-31">
                <label>31</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Mkrtchyan</surname>
                            <given-names>H</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Gross</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Hinreiner</surname>
                            <given-names>S</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>The human genome puzzle - the role of copy number variation in somatic mosaicism.</article-title>
                    <source>
						
                        <italic toggle="yes">Curr Genomics.</italic>
					</source>
                    <year>2010</year>;<volume>11</volume>(<issue>6</issue>):<fpage>426</fpage>&#x2013;<lpage>431</lpage>.
                    <pub-id pub-id-type="pmid">21358987</pub-id>
                    <pub-id pub-id-type="doi">10.2174/138920210793176047</pub-id>
                    <pub-id pub-id-type="pmcid">3018723</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-32">
                <label>32</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Baillie</surname>
                            <given-names>JK</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Barnett</surname>
                            <given-names>MW</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Upton</surname>
                            <given-names>KR</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Somatic retrotransposition alters the genetic landscape of the human brain.</article-title>
                    <source>
						
                        <italic toggle="yes">Nature.</italic>
					</source>
                    <year>2011</year>;<volume>479</volume>(<issue>7374</issue>):<fpage>534</fpage>&#x2013;<lpage>537</lpage>.
                    <pub-id pub-id-type="pmid">22037309</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nature10531</pub-id>
                    <pub-id pub-id-type="pmcid">3224101</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-33">
                <label>33</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Poduri</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Evrony</surname>
                            <given-names>GD</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Cai</surname>
                            <given-names>X</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Somatic activation of 
                        <italic toggle="yes">AKT3</italic> causes hemispheric developmental brain malformations.</article-title>
                    <source>
						
                        <italic toggle="yes">Neuron.</italic>
					</source>
                    <year>2012</year>;<volume>74</volume>(<issue>1</issue>):<fpage>41</fpage>&#x2013;<lpage>48</lpage>.
                    <pub-id pub-id-type="pmid">22500628</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.neuron.2012.03.010</pub-id>
                    <pub-id pub-id-type="pmcid">3460551</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-34">
                <label>34</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Holstege</surname>
                            <given-names>H</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Pfeiffer</surname>
                            <given-names>W</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Sie</surname>
                            <given-names>D</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Somatic mutations found in the healthy blood compartment of a 115-yr-old woman demonstrate oligoclonal hematopoiesis.</article-title>
                    <source>
						
                        <italic toggle="yes">Genome Res.</italic>
					</source>
                    <year>2014</year>;<volume>24</volume>(<issue>5</issue>):<fpage>733</fpage>&#x2013;<lpage>742</lpage>.
                    <pub-id pub-id-type="pmid">24760347</pub-id>
                    <pub-id pub-id-type="doi">10.1101/gr.162131.113</pub-id>
                    <pub-id pub-id-type="pmcid">4009603</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-35">
                <label>35</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Costello</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Pugh</surname>
                            <given-names>TJ</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Fennell</surname>
                            <given-names>TJ</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation.</article-title>
                    <source>
						
                        <italic toggle="yes">Nucleic acids research.</italic>
					</source>
                    <year>2013</year>;<volume>41</volume>(<issue>6</issue>):<fpage>e67</fpage>.
                    <pub-id pub-id-type="pmid">23303777</pub-id>
                    <pub-id pub-id-type="doi">10.1093/nar/gks1443</pub-id>
                    <pub-id pub-id-type="pmcid">3616734</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-36">
                <label>36</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Hatahet</surname>
                            <given-names>Z</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Zhou</surname>
                            <given-names>M</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Reha-Krantz</surname>
                            <given-names>LJ</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>In search of a mutational hotspot.</article-title>
                    <source>
						
                        <italic toggle="yes">Proc Natl Acad Sci U S A.</italic>
					</source>
                    <year>1998</year>;<volume>95</volume>(<issue>15</issue>):<fpage>8556</fpage>&#x2013;<lpage>8561</lpage>.
                    <pub-id pub-id-type="pmid">9671716</pub-id>
                    <pub-id pub-id-type="pmcid">21114</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-37">
                <label>37</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Ray</surname>
                            <given-names>PD</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Huang</surname>
                            <given-names>BW</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Tsuji</surname>
                            <given-names>Y</given-names>
                        </name>
					</person-group>:
                    <article-title>Reactive oxygen species (ros) homeostasis and redox regulation in cellular signaling.</article-title>
                    <source>
						
                        <italic toggle="yes">Cell Signal.</italic>
					</source>
                    <year>2012</year>;<volume>24</volume>(<issue>5</issue>):<fpage>981</fpage>&#x2013;<lpage>990</lpage>.
                    <pub-id pub-id-type="pmid">22286106</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.cellsig.2012.01.008</pub-id>
                    <pub-id pub-id-type="pmcid">3454471</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-38">
                <label>38</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Stiles</surname>
                            <given-names>J</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Jernigan</surname>
                            <given-names>TL</given-names>
                        </name>
						</person-group>:
                    <article-title>The basics of brain development.</article-title>
                    <source>
						
                        <italic toggle="yes">Neuropsychol Rev.</italic>
					</source>
                    <year>2010</year>;<volume>20</volume>(<issue>4</issue>):<fpage>327</fpage>&#x2013;<lpage>48</lpage>.
                    <pub-id pub-id-type="pmid">21042938</pub-id>
                    <pub-id pub-id-type="doi">10.1007/s11065-010-9148-4</pub-id>
                    <pub-id pub-id-type="pmcid">2989000</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-39">
                <label>39</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Walton</surname>
                            <given-names>NM</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Shin</surname>
                            <given-names>R</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Tajinda</surname>
                            <given-names>K</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Adult neurogenesis transiently generates oxidative stress.</article-title>
                    <source>
						
                        <italic toggle="yes">PLoS One.</italic>
					</source>
                    <year>2012</year>;<volume>7</volume>(<issue>4</issue>):<fpage>e35264</fpage>.
                    <pub-id pub-id-type="pmid">22558133</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0035264</pub-id>
                    <pub-id pub-id-type="pmcid">3340368</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-40">
                <label>40</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Jordan</surname>
                            <given-names>KW</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Craver</surname>
                            <given-names>KL</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Magwire</surname>
                            <given-names>MM</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Genome-wide association for sensitivity to chronic oxidative stress in 
                        <italic toggle="yes">Drosophila melanogaster.</italic>
                    </article-title>
                    <source>
						
                        <italic toggle="yes">PLoS One.</italic>
					</source>
                    <year>2012</year>;<volume>7</volume>(<issue>6</issue>):<fpage>e38722</fpage>.
                    <pub-id pub-id-type="pmid">22715409</pub-id>
                    <pub-id pub-id-type="doi">10.1371/journal.pone.0038722</pub-id>
                    <pub-id pub-id-type="pmcid">3371005</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-41">
                <label>41</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Perez Estrada</surname>
                            <given-names>C</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Covacu</surname>
                            <given-names>R</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Sankavaram</surname>
                            <given-names>SR</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Oxidative stress increases neurogenesis and oligodendrogenesis in adult neural progenitor cells.</article-title>
                    <source>
						
                        <italic toggle="yes">Stem Cells Dev.</italic>
					</source>
                    <year>2014</year>;<volume>23</volume>(<issue>19</issue>):<fpage>2311</fpage>&#x2013;<lpage>2327</lpage>.
                    <pub-id pub-id-type="pmid">24773127</pub-id>
                    <pub-id pub-id-type="doi">10.1089/scd.2013.0452</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-42">
                <label>42</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Suberbielle</surname>
                            <given-names>E</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Sanchez</surname>
                            <given-names>PE</given-names>
                        </name>
						
                        <name name-style="western">
                            <surname>Kravitz</surname>
                            <given-names>AV</given-names>
                        </name>
						
                        <etal/>
					</person-group>:
                    <article-title>Physiologic brain activity causes DNA double-strand breaks in neurons, with exacerbation by amyloid-&#x03b2;.</article-title>
                    <source>
						
                        <italic toggle="yes">Nat Neurosci.</italic>
					</source>
                    <year>2013</year>;<volume>16</volume>(<issue>5</issue>):<fpage>613</fpage>&#x2013;<lpage>621</lpage>.
                    <pub-id pub-id-type="pmid">23525040</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nn.3356</pub-id>
                    <pub-id pub-id-type="pmcid">3637871</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-43">
                <label>43</label>
                <mixed-citation publication-type="data">
                    <person-group person-group-type="author">
						
                        <name name-style="western">
                            <surname>Sharma</surname>
                            <given-names>A</given-names>
                        </name>
						
                        <collab>Genomics &amp; Molecular Medicine Unit, CSIR-Institute of Genomics &amp; Integrative Biology, Delhi - 110020, India, Academy of Scientific and Innovative Research, CSIR-Institute of Genomics &amp; Integrative Biology (AcSIR-IGIB), Delhi - 110020, India</collab>
						
                        <etal/>
					</person-group>:
                    <article-title>Dataset 1 in: Human brain harbors single nucleotide somatic variations in functionally relevant genes possibly mediated by oxidative stress.</article-title>
                    <source>
						
                        <italic toggle="yes">F1000Research.</italic>
					</source>
                    <year>2016</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5256/f1000research.9495.d135507">Data Source</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report18425">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.11221.r18425</article-id>
            <title-group>
                <article-title>Reviewer response for version 2</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Sakumi</surname>
                        <given-names>Kunihiko</given-names>
                    </name>
                    <xref ref-type="aff" rid="r18425a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r18425a1">
                    <label>1</label>Division of Neurofunctional Genomics, Kyushu University, Fukuoka, Japan</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>13</day>
                <month>12</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Sakumi K</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport18425" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9495.2"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The revised version of manuscript has been modified almost appropriately.</p>
            <p> But, still I could not read the 9 somatic variation .csv files in Dataset 1, probably due to font problem. I recommend that the authors convert the csv file into pdf (or ppt) files for worldwide readers.</p>
            <p> </p>
            <p> 
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/9495/d337a99d-c16f-4498-89ef-afe06ae7ca6c.png">See attached image</ext-link>
            </p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment2393-18425">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>mukhopadhyay</surname>
                            <given-names>arijit</given-names>
                        </name>
                        <aff>Institute of Genomics and Integrative Biology, Delhi, India</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No Competing Interests.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>31</day>
                    <month>12</month>
                    <year>2016</year>
                </pub-date>
            </front-stub>
            <body>
                <p>We are sorry about the technical difficulty in reading the file. There seems to be a problem of file conversion. We have now uploaded a new .csv version of the file.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report17435">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.10226.r17435</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Maitra</surname>
                        <given-names>Arindam</given-names>
                    </name>
                    <xref ref-type="aff" rid="r17435a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0002-0249-8856</uri>
                </contrib>
                <aff id="r17435a1">
                    <label>1</label>National Institute of Biomedical Genomics, Kalyani, India</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>7</day>
                <month>11</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Maitra A</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport17435" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9495.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The manuscript is well written. However, I would like to point out that oxoG artefacts can also be generated during fragmentation of DNA by sonication (standard step in the method followed by authors). Given that only 71% of these mutations could be verified (the verification was not by an orthogonal method but the same sequencing technology), I am not sure if there are some residual artefacts in the final data set (even after all filteration and taking into account the fact that both tissue and blood DNA were subjected to the same method).</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment2348-17435">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>mukhopadhyay</surname>
                            <given-names>arijit</given-names>
                        </name>
                        <aff>Institute of Genomics and Integrative Biology, Delhi, India</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>5</day>
                    <month>12</month>
                    <year>2016</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Rev Comment: The manuscript is well written. However, I would like to point out that oxoG artefacts can also be generated during fragmentation of DNA by sonication (standard step in the method followed by authors). Given that only 71% of these mutations could be verified (the verification was not by an orthogonal method but the same sequencing technology), I am not sure if there are some residual artefacts in the final data set (even after all filteration and taking into account the fact that both tissue and blood DNA were subjected to the same method).</p>
                <p> </p>
                <p> Ans:&#x00a0;We understand the reviewer&#x2019;s concern and in general agree with him. It is possible that if we were able to validate all the sites reported in the study we might get a good proportion (at least 30% by the validation rate) of the sites to be false positives for oxoG artefacts as well as other less known reasons. It is also true that given our stringent analysis thresholds we would also have some false negatives. Recently published estimates of somatic variations in single neurons in the whole genomes also points to this possibility. Given the scope of the present study we will not be able to accurately quantify either type of errors. However, we would like to point out that the main point of our study is not the exact locations of the somatic variations but the fact that physiological oxidative stress, in moderation, can generate such variants which have a biological significance (in terms of pathway enrichment). We hope that the reviewer will see merit in the argument.</p>
                <p> The reviewer is right to point out that our validation platform was not an orthogonal technique. We chose Mi-Seq over other methods considering not only the orthogonality but also robustness of the platform and error rates. Comparing all parameters Mi-Seq was our method of choice as we reasoned that even with the same sequencing chemistry, we should be able to validate true positives given a very high depth of reads obtained.</p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report17217">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.10226.r17217</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Sakumi</surname>
                        <given-names>Kunihiko</given-names>
                    </name>
                    <xref ref-type="aff" rid="r17217a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r17217a1">
                    <label>1</label>Division of Neurofunctional Genomics, Kyushu University, Fukuoka, Japan</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>26</day>
                <month>10</month>
                <year>2016</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2016 Sakumi K</copyright-statement>
                <copyright-year>2016</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport17217" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.9495.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In the manuscript, Sharma et al. reported that adult human brains harbor single nucleotide somatic variations in the genome. Using whole exome sequencing by Illumina Truseq and Hiseq2000 system, authors compared single nucleotide variations between frontal cortex (FC) and corpus callosum (CC) of human brains. These variations were partially validated by Amplicon sequencing using MiSeq. The authors found that G to T transversion and 8-OHdG levels were increased in FC. They observed that the somatic variations were enriched in genes belonging to axon guidance pathways and neuronal processes related pathways. They propose that oxidative stress influences single nucleotide somatic variations in normal human brain.</p>
            <p> </p>
            <p> There is a problem with the interpretation of the data. Standard protocol of Truseq uses 100 ng of DNA, which is equivalent to 10
                <sup>7</sup> human cells. To get 10% of alt sequence, 2 x 10
                <sup>6</sup> cells should carry the same variation in hetero. In the adult brain tissue, it is hardly to occur the 8-OHdG caused mutation at the same site of 2 x 10
                <sup>6</sup> cell genome. If this observation is correct, at least 21 times DNA replication events are required to expand the cell population carrying the same somatic variation. Such mosaicism is well known in development as described by the authors in Introduction. Probably, the seed of the observed single nucleotide variation caused by 8-OHdG was generated around birth, rapidly neuronal cells replicating stage, and not after adulthood.</p>
            <p> </p>
            <p> Another possibility to explain the 10% of alt sequence is contamination of blood cells in brain tissue sample. There is no quantitative purity analysis of tissue (source of DNA). If blood or other type of cell were contaminated in the FC sample, somatic mosaicism might be detectable in this system. If the author could provide the SNV of blood DNA from the same brain donor individual, we can get the answer.&#x00a0;&#x00a0;</p>
            <p> </p>
            <p> I recommend the authors to reconsider the data, and to prepare the model including time scale of mutation accumulation reflecting these results.&#x00a0;</p>
            <p> </p>
            <p> Comments: 
                <list list-type="bullet">
                    <list-item>
                        <p>Figure 4 -</p>
                        <p> In this figure, proportion of genotype exhibits completely symmetrical pattern between CC and FC.&#x00a0;I cannot understand the reason.&#x00a0;The authors should discuss this observation with any possible mechanism.&#x00a0;&#x00a0;</p>
                    </list-item>
                </list> 
                <list list-type="bullet">
                    <list-item>
                        <p>p9, 1
                            <sup>st</sup> paragraph -</p>
                        <p> To avoid inappropriate bias, authors should use high fidelity DNA polymerase for PCR, and Sanger sequencing method to validate the variant ratio, quantitatively.&#x00a0;By showing the chromatogram pattern, we can discuss the variant ratio quantitatively.</p>
                    </list-item>
                </list> 
                <list list-type="bullet">
                    <list-item>
                        <p>Dataset 1 -</p>
                        <p> I could not read the 9 somatic variation.csv files in Dataset 1.</p>
                    </list-item>
                </list>
            </p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment2349-17217">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>mukhopadhyay</surname>
                            <given-names>arijit</given-names>
                        </name>
                        <aff>Institute of Genomics and Integrative Biology, Delhi, India</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No Competing interests</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>5</day>
                    <month>12</month>
                    <year>2016</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Rev Comment:&#x00a0;In the manuscript, Sharma et al. reported that adult human brains harbor single nucleotide somatic variations in the genome. Using whole exome sequencing by Illumina Truseq and Hiseq2000 system, authors compared single nucleotide variations between frontal cortex (FC) and corpus callosum (CC) of human brains. These variations were partially validated by Amplicon sequencing using MiSeq. The authors found that G to T transversion and 8-OHdG levels were increased in FC. They observed that the somatic variations were enriched in genes belonging to axon guidance pathways and neuronal processes related pathways. They propose that oxidative stress influences single nucleotide somatic variations in normal human brain.</p>
                <p> </p>
                <p> There is a problem with the interpretation of the data. Standard protocol of Truseq uses 100 ng of DNA, which is equivalent to 10
                    <sup>7</sup>&#x00a0;human cells. To get 10% of alt sequence, 2 x 10
                    <sup>6</sup>&#x00a0;cells should carry the same variation in hetero. In the adult brain tissue, it is hardly to occur the 8-OHdG caused mutation at the same site of 2 x 10
                    <sup>6</sup>&#x00a0;cell genome. If this observation is correct, at least 21 times DNA replication events are required to expand the cell population carrying the same somatic variation. Such mosaicism is well known in development as described by the authors in Introduction. Probably, the seed of the observed single nucleotide variation caused by 8-OHdG was generated around birth, rapidly neuronal cells replicating stage, and not after adulthood.</p>
                <p> Ans: We agree with the reviewer. As it is known in the literature, for human brain the neurogenesis process starts at E13 and finishes by E108. The cell division is asymmetric, as after a certain point from each division results one mitotic and one post-mitotic neuron &#x2013; and hence accurate prediction of the number of dividing neurons within this time-frame is not possible. According to the estimates an infant brain has neurons in the order of 10
                    <sup>10</sup> &#x2013; 10
                    <sup>11</sup>. As pointed by the reviewer, following our criteria at least 10
                    <sup>6</sup> cells needs to have the genomic variation. So, we also hypothesize that the variations we are observing had occurred before birth or within the first couple of years after birth. Looking at the specific type of majority of the changes (G&gt;T transversions), it is even more compelling that this is most likely as the high metabolic demand of the cortical region would create a local oxidative stress and such &#x2018;errors&#x2019; might have a rapid clonal expansion if these would enhance the &#x2018;fitness&#x2019; of the local population.</p>
                <p> </p>
                <p> Rev Comment:&#x00a0;Another possibility to explain the 10% of alt sequence is contamination of blood cells in brain tissue sample. There is no quantitative purity analysis of tissue (source of DNA). If blood or other type of cell were contaminated in the FC sample, somatic mosaicism might be detectable in this system. If the author could provide the SNV of blood DNA from the same brain donor individual, we can get the answer.&#x00a0;&#x00a0;</p>
                <p> Ans: We understand the reviewer&#x2019;s concern. However, we would like to argue that the observed results cannot come from contamination for the following reasons: 
                    <list list-type="order">
                        <list-item>
                            <p>We see a specific pattern in the frontal cortex of heterozygous sites which is very different from that of corpus callosum obtained from the same brain. Such a pattern would not appear &#x00a0;for each sample if it was due to contamination</p>
                        </list-item>
                        <list-item>
                            <p>We see enrichment of relevant biological processes for each individual only for the relevant tissue type. Such a pattern is also not expected for a random contamination event</p>
                        </list-item>
                    </list> We agree that data on blood SNV for the same samples would be useful in addressing it in a more robust way &#x2013; but unfortunately, such samples were not available.</p>
                <p> Rev Comment:&#x00a0;I recommend the authors to reconsider the data, and to prepare the model including time scale of mutation accumulation reflecting these results.&#x00a0;</p>
                <p> </p>
                <p> Ans: We attempted to devise a model but finally could not do it as the neuronal division is asymmetric. It starts at E13 initially in a symmetric manner (i.e. two mitotic daughter cells from one mother cell) but soon becomes asymmetric (one mitotic and one post-mitotic daughter cells) which relies on a milieu of chemical cues &#x2013; both inducing as well as inhibitory. Thus any model would be highly inaccurate. Instead, in the revised version we have now added the following sentences under the discussion section. We hope the reviewer will find it acceptable.</p>
                <p> &#x201c;From our observed results, it seems that most of the variations appeared around the time of birth when neurons are rapidly dividing. It is known in the literature that the process of neurogenesis spans from E13 to E108 and the number of neurons in an infant brain is in the order of 10
                    <sup>10</sup> &#x2013; 10
                    <sup>11</sup>. With our analysis threshold of at least 10% abundance of the variant allele, the variation should be present in 10
                    <sup>6</sup>-10
                    <sup>7</sup> cells &#x2013; which is in order with the developmental time-frame. However, studies with more tissues (from brain and outside brain) from the same individuals would be needed to rule out other possible reasons.&#x201d;</p>
                <p> </p>
                <p> Rev Comments: 
                    <list list-type="bullet">
                        <list-item>
                            <p>Figure 4 -</p>
                            <p> In this figure, proportion of genotype exhibits completely symmetrical pattern between CC and FC.&#x00a0;I cannot understand the reason.&#x00a0;The authors should discuss this observation with any possible mechanism.&#x00a0;&#x00a0;</p>
                        </list-item>
                    </list> Ans: The reason is our analysis pipeline. As described in the manuscript any variation qualifies as a somatic variation where genotype of one tissue (e.g. frontal cortex) varies from that of the other pair (e.g. corpus callosum). Hence, by definition if at a locus FC genotype is heterozygous then CC would have the homozygous genotype &#x2013; and vice versa. This is why we see a mirror image of genotype distribution in figure 4. 
                    <list list-type="bullet">
                        <list-item>
                            <p>p9, 1
                                <sup>st</sup>&#x00a0;paragraph -</p>
                            <p> To avoid inappropriate bias, authors should use high fidelity DNA polymerase for PCR, and Sanger sequencing method to validate the variant ratio, quantitatively.&#x00a0;By showing the chromatogram pattern, we can discuss the variant ratio quantitatively.</p>
                        </list-item>
                    </list> Ans: High-fidelity DNA polymerase was used in the amplification steps. For Sanger sequencing, majority of these sites would not be picked up due to the low abundance of variant allele. Sanger sequencing is more efficient when the variant allele frequency is 20% and above. This is why we opted for a very deep sequencing approach like Mi-Seq. 
                    <list list-type="bullet">
                        <list-item>
                            <p>Dataset 1 -</p>
                            <p> I could not read the 9 somatic variation.csv files in Dataset 1.</p>
                        </list-item>
                    </list> Ans: We are sorry for the technical difficulty. We have freshly uploaded the files (New Dataset 1.zip) and hope that it is readable now.</p>
            </body>
        </sub-article>
    </sub-article>
</article>
