<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN" "http://jats.nlm.nih.gov/publishing/1.2/JATS-journalpublishing1.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article" dtd-version="1.2" xml:lang="en">
    <front>
        <journal-meta>
            <journal-id journal-id-type="pmc">F1000Research</journal-id>
            <journal-title-group>
                <journal-title>F1000Research</journal-title>
            </journal-title-group>
            <issn pub-type="epub">2046-1402</issn>
            <publisher>
                <publisher-name>F1000 Research Limited</publisher-name>
                <publisher-loc>London, UK</publisher-loc>
            </publisher>
        </journal-meta>
        <article-meta>
            <article-id pub-id-type="doi">10.12688/f1000research.11547.1</article-id>
            <article-categories>
                <subj-group subj-group-type="heading">
                    <subject>Research Article</subject>
                </subj-group>
                <subj-group>
                    <subject>Articles</subject>
                    <subj-group>
                        <subject>Bioinformatics</subject>
                    </subj-group>
                </subj-group>
            </article-categories>
            <title-group>
                <article-title>Investigation of chimeric reads using the MinION</article-title>
                <fn-group content-type="pub-status">
                    <fn>
                        <p>[version 1; peer review: 1 approved, 1 approved with reservations]</p>
                    </fn>
                </fn-group>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>White</surname>
                        <given-names>Ruby</given-names>
                    </name>
                    <uri content-type="orcid">https://orcid.org/0000-0002-2503-9169</uri>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Pellefigues</surname>
                        <given-names>Christophe</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Ronchese</surname>
                        <given-names>Franca</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="no">
                    <name>
                        <surname>Lamiable</surname>
                        <given-names>Olivier</given-names>
                    </name>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <contrib contrib-type="author" corresp="yes">
                    <name>
                        <surname>Eccles</surname>
                        <given-names>David</given-names>
                    </name>
                    <uri content-type="orcid">https://orcid.org/0000-0003-4634-4995</uri>
                    <xref ref-type="corresp" rid="c1">a</xref>
                    <xref ref-type="aff" rid="a1">1</xref>
                </contrib>
                <aff id="a1">
                    <label>1</label>Malaghan Institute of Medical Research, Wellington, 6242, New Zealand</aff>
            </contrib-group>
            <author-notes>
                <corresp id="c1">
                    <label>a</label>
                    <email xlink:href="mailto:bioinformatics@gringene.org">bioinformatics@gringene.org</email>
                </corresp>
                <fn fn-type="con">
                    <p>RW: Sample preparation and QC; CP: Mouse injections, RNA extraction; FR: Project oversight; OL: Sample preparation, project design and oversight; DE: DNA sequencing and bioinformatics analysis. All authors contributed towards the preparation of the manuscript.</p>
                </fn>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>The R9.4 flow cell and sequencing kit (SQK-LSK208) used for this experiment were provided free of charge by ONT as replacements for a purchased kit and flow cell where the phenomena of chimeric reads was initially discovered. ONT provided advice regarding the sample preparation protocols, including the suggestion of a slow overnight ligation step.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>5</day>
                <month>5</month>
                <year>2017</year>
            </pub-date>
            <pub-date pub-type="collection">
                <year>2017</year>
            </pub-date>
            <volume>6</volume>
            <elocation-id>631</elocation-id>
            <history>
                <date date-type="accepted">
                    <day>5</day>
                    <month>5</month>
                    <year>2017</year>
                </date>
            </history>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2017 White R et al.</copyright-statement>
                <copyright-year>2017</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <self-uri content-type="pdf" xlink:href="https://f1000research.com/articles/6-631/pdf"/>
            <abstract>
                <p>Following a nanopore sequencing run of PCR products of three amplicons less than 1kb, an abundance of reads failed quality control due to template/complement mismatch. A BLAST search demonstrated that some of the failed reads mapped to two different genes &#x2013; an unexpected observation, given that PCR was carried out separately for each amplicon.</p>
                <p>A further investigation was carried out specifically to search for chimeric reads, using separate barcodes for each amplicon and trying two different ligation methods prior to sample loading. Despite the separation of ligation products, chimeric reads formed from different amplicons were still observed in the base-called sequence. The long-read nature of nanopore sequencing presents an effective tool for the discovery and filtering of chimeric reads. We have found that at least 1.7% of reads prepared using the Nanopore LSK002 2D Ligation Kit include post-amplification chimeric elements. This finding has potential implications for other amplicon sequencing technologies, as the process is unlikely to be specific to the sample preparation used for nanopore sequencing.</p>
            </abstract>
            <kwd-group kwd-group-type="author">
                <kwd>chimerism</kwd>
                <kwd>MinION</kwd>
                <kwd>interferon</kwd>
                <kwd>amplicon</kwd>
                <kwd>R9.4</kwd>
                <kwd>signal</kwd>
                <kwd>nanopore</kwd>
            </kwd-group>
            <funding-group>
                <award-group id="fund-1">
                    <funding-source>Health Research Council of New Zealand</funding-source>
                    <award-id>HRC14/1003</award-id>
                </award-group>
                <funding-statement>This work was funded in full by Health Research Council of NZ Independent Research Organization (IRO) funding to the Malaghan Institute of Medical Research (grant number HRC14/1003).</funding-statement>
                <funding-statement>
                    <italic>The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</italic>
                </funding-statement>
            </funding-group>
        </article-meta>
    </front>
    <body>
        <sec sec-type="intro">
            <title>Introduction</title>
            <p>High-throughput DNA sequencing is a rapidly evolving field with new methods and applications introduced almost weekly
                <sup>
                    <xref ref-type="bibr" rid="ref-1">1</xref>
                </sup>. One of the most recent sequencing technologies available on the market is the MinION sequencing device from Oxford Nanopore Technologies (ONT)
                <sup>
                    <xref ref-type="bibr" rid="ref-2">2</xref>
                </sup>. A brief overview of MinION sequencing technology is discussed in our previous study on mitochondrial genome assembly
                <sup>
                    <xref ref-type="bibr" rid="ref-3">3</xref>
                </sup>.</p>
            <p>Instead of exploiting base-pairing as in the sequencing by synthesis approach used by Illumina and others, nanopore sequencing uses an electronic sensor to detect DNA via a change in electric current (reviewed in 
                <xref ref-type="bibr" rid="ref-4">4</xref>). The MinION&#x2019;s flow cell is comprised of 2048 wells containing a membrane perforated by nanopores. Ligated with a molecular motor, a single stranded DNA molecule passes through the pore, altering the recorded current. After the electronic sequencing is carried out, a software basecalling algorithm transforms the current trace into a modelled DNA sequence. The advantages of the MinION are rapid library preparation, portability
                <sup>
                    <xref ref-type="bibr" rid="ref-5">5</xref>,
                    <xref ref-type="bibr" rid="ref-6">6</xref>
                </sup>, long molecule sequencing
                <sup>
                    <xref ref-type="bibr" rid="ref-7">7</xref>
                </sup>, and sequencing of non-model modifications of the DNA strand
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>
                </sup>. With the recent improvement in the chemistry of the MinION, ONT has overcome the majority of issues associated with low yield and high error rates that have limited the range of its application. The MinION sequencing device has now been successfully applied to sequence genomes of a wide range of sizes, from bacterial and viral genomes
                <sup>
                    <xref ref-type="bibr" rid="ref-9">9</xref>,
                    <xref ref-type="bibr" rid="ref-10">10</xref>
                </sup>, amplicon sequencing like bacterial 16S rRNA sequencing
                <sup>
                    <xref ref-type="bibr" rid="ref-11">11</xref>
                </sup>, and more recently a human genome
                <sup>
                    <xref ref-type="bibr" rid="ref-12">12</xref>
                </sup>. The MinION has also been used for cDNA sequencing
                <sup>
                    <xref ref-type="bibr" rid="ref-13">13</xref>
                </sup>, for detecting DNA methylation patterns without chemical treatment
                <sup>
                    <xref ref-type="bibr" rid="ref-8">8</xref>,
                    <xref ref-type="bibr" rid="ref-14">14</xref>
                </sup>, and for direct RNA sequencing with detection of modified 16S rRNA nucleotides
                <sup>
                    <xref ref-type="bibr" rid="ref-15">15</xref>
                </sup>.</p>
            <p>Using the most recent R9.4 flow cells, we have evaluated the MinION technology for the amplicon sequencing of highly similar genes. Since we have an interest in interferon response during large parasitic infection
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>
                </sup>, we sequenced the type I Interferon (IFN) family. Type I IFNs are a family of intronless antiviral response genes comprised, in mice, of 14 highly homologous 
                <italic toggle="yes">Ifna</italic> members, as well as the genes 
                <italic toggle="yes">Ifnb</italic>, 
                <italic toggle="yes">Ifnk</italic> and 
                <italic toggle="yes">Ifne</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-17">17</xref>
                </sup>. In humans, sequence similarity across the 14 members of the 
                <italic toggle="yes">Ifna</italic> genes is 70&#x2013;80%, with a further 35% sequence similarity between 
                <italic toggle="yes">Ifna</italic> and 
                <italic toggle="yes">Ifnb</italic>. Type I IFN has both an important role in innate antiviral immunity and in mounting adaptive T helper cell responses
                <sup>
                    <xref ref-type="bibr" rid="ref-16">16</xref>,
                    <xref ref-type="bibr" rid="ref-18">18</xref>
                </sup>. Based on previous observations, we aimed to identify which type I IFN member(s) were responsible for driving the type I IFN signalling in our infection model.</p>
            <p>Due to the high homology between the 
                <italic toggle="yes">Ifna</italic> family, accurately detecting quantitative expression of the different gene members by Sanger sequencing or next generation sequencing is difficult. We instead employed nanopore sequencing, which allowed us to acquire full-length reads from each individual sequence that were amplified by the PCR reaction. We aimed to determine the relative quantities of the various 
                <italic toggle="yes">Ifna</italic> family and 
                <italic toggle="yes">Ifnb</italic> transcripts, in 
                <italic toggle="yes">Nb</italic> treated mouse ear tissue using MinION; therefore enabling both the differentiation between the various 
                <italic toggle="yes">Ifna</italic> genes, and the potential to perform quantitative analysis.</p>
        </sec>
        <sec sec-type="methods">
            <title>Methods</title>
            <p>

                <italic toggle="yes">Nippostrongylus brasiliensis</italic> was originally sourced from Lindsey Dent of the University of Adelaide, South Australia and has been maintained for 22 years by serial passage at the Malaghan Institute. Female Lewis rats were bred and used for maintenance of the 
                <italic toggle="yes">N. brasiliensis</italic> life cycle when 4 months of age (and weight over 150g), as outlined in Camberis 
                <italic toggle="yes">et al.</italic>
                <sup>
                    <xref ref-type="bibr" rid="ref-19">19</xref>
                </sup>.</p>
            <p>Two 8-week-old C57BL6/J male mice (Jackson Laboratories, approx 23g), housed and bred at the Malaghan Institute of Medical Research under specific pathogen free conditions, respecting the local and New Zealand ethics guidelines, were chosen for the investigation. 300 dead infective 
                <italic toggle="yes">N. brasiliensis</italic> L3 larvae (Nb) were injected intradermally in each ear of one mouse in 30uL PBS after anaesthesia with an intraperitoneal injection of 200ul ketamine/xylazine. Another mouse was similarly anaesthetised and injected intradermally in each ear with 30uL PBS. The mice were euthanised in a 
                <italic toggle="yes">CO</italic>
                <sub>2</sub> chamber 3h post injection and ears (approx 27&#x2013;30mg in weight) were immediately harvested and conserved in RNALater at 4C for &lt;1h. RNA extraction of each whole ear (30mg) was done in 1mL of Trizol following the products&#x2019; guidelines (Thermofisher). cDNA was synthesised using the High Capacity RNA-to-cDNA kit (Applied Biosystems), according to the manufacturer&#x2019;s instructions. Only the cDNA from the 
                <italic toggle="yes">N. brasiliensis</italic>-treated mouse was used for this investigation. 
                <italic toggle="yes">Ifna</italic>, 
                <italic toggle="yes">Ifnb</italic>, and 
                <italic toggle="yes">Actb</italic> amplicons were generated using specific primers: 
                <italic toggle="yes">IfnaF</italic> (ATGGCTAGRCTCTGTGCTTTCCT) and 
                <italic toggle="yes">IfnaR</italic> (AGGGCTCTCCAGAYTTCTGCTCTG)
                <sup>
                    <xref ref-type="bibr" rid="ref-20">20</xref>
                </sup>; 
                <italic toggle="yes">IfnbF</italic> (CTGGCTTCCATCATGAACAA) and 
                <italic toggle="yes">IfnbR</italic> (GCAACCACCACTCATTCTGA); and 
                <italic toggle="yes">ActbF</italic> (AGGGAAATCGTGCGTGACAT) and 
                <italic toggle="yes">ActbR</italic> (ACGCAGCTCAGTAACAGTCC), which were purchased from Integrated DNA Technologies. PCR amplification was performed using Phusion High-Fidelity PCR Kit (Thermo Scientific) with 25ng cDNA, see 
                <xref ref-type="fig" rid="f1">Figure 1</xref>. The cycling conditions were as follows: denaturation at 98 for 30 seconds; 35 cycles of 98 degrees for 10 seconds, 61 degrees for 30 seconds, 72 degrees for 30 seconds; final extension of 72 degrees for 10 minutes. Samples were held at 4 degrees until use for PCR clean up and gel electrophoresis. PCR products were cleaned using QIAquick PCR Purification Kit (QIAGEN) and verified by gel electrophoresis.</p>
            <fig fig-type="figure" id="f1" orientation="portrait" position="float">
                <label>Figure 1. </label>
                <caption>
                    <title>Gel-electrophoresis image of PCR products amplified for this investigation.</title>
                    <p>(
                        <bold>A</bold>) Amplicons were observed for Ifna and Actb from both PBS treated (1&amp;2, not sequenced in this investigation) and 
                        <italic toggle="yes">Nb</italic>-treated (3) samples at the expected sizes of 535 bp, and 524 bp respectively. The Ifnb gene from the 
                        <italic toggle="yes">Nb</italic>-treated sample (3) failed to amplify during this first attempt. (
                        <bold>B</bold>) A repeat amplification of Ifnb from 
                        <italic toggle="yes">Nb</italic>-treated sample was carried out, producing a single band of approximately 600bp. This was run alongside amplicons of Ifna, Ifnb and Actb from genomic DNA; however genomic amplicons were not used for subsequent MinION sequencing.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12473/5fad195b-3555-4482-9bb8-2e1be7addd18_figure1.gif"/>
            </fig>
            <p>

                <italic toggle="yes">Ifna</italic> cDNA were amplified by PCR using primers designed across a highly-conserved region of all 
                <italic toggle="yes">Ifna</italic> coding sequences, which resulted in a mixed PCR product containing all 14 
                <italic toggle="yes">Ifna</italic> genes. cDNAs of 
                <italic toggle="yes">Ifnb</italic> and 
                <italic toggle="yes">Actb</italic> were amplified separately and used as quantification controls. Altogether, the three pooled amplicons were loaded into a flow cell and sequenced. Among the reads that we obtained, we noticed long chimeric reads comprising of two or more sequences from different amplicons. We decided to further examine this phenomenon.</p>
            <p>Ethics approval for maintenance of the 
                <italic toggle="yes">N. brasiliensis</italic> life cycle is overseen and approved by the Victoria University of Wellington Animal Ethics Committee. C57BL/6J mice were originally obtained from The Jackson Laboratory, Bar Harbour, Maine, USA, and maintained at the Biomedical Research Unit of the Malaghan Institute of Medical Research by brother X sister mating. Breeding pairs were refreshed regularly to maintain the genetic integrity of the strain. Mice were maintained in specific pathogen-free conditions, and housed and cared for according to the concepts of &#x201c;A Culture of Care&#x201d; of the Ministry of Primary Industries, NZ. All mouse experiments were approved by the Victoria University Animal Ethics Committee (permit number 23907) and carried out according to institutional guidelines.</p>
            <sec>
                <title>Library preparation</title>
                <p>The ONT Native Barcoding Kit (EXP-NBD002) and 2D Ligation Sequencing Kit (SQK-LSK208) were used to prepare the samples for sequencing, as per the manufacturer&#x2019;s protocol. Briefly, purified PCR amplicon products were bluntended, ligated with barcode sequences, pooled in approximately equimolar amounts, then ligated with flow cell adapters and a hairpin linker. In order to explore the effect of ligation method on the degree of chimerism, two different adapter/hairpin ligation reactions were carried out: one using the standard quick (10-minute) ligation, and the other using an overnight ligation at 4&#x00b0; Celsius. No additional adapter-free controls were used; it has been our prior experience that sequencing does not proceed in a callable fashion unless adapter sequences are present. The barcoding scheme used in the library preparation is shown in 
                    <xref ref-type="fig" rid="f2">Figure 2</xref>. Samples were quantified after barcoding for overnight ligation (2.14 
                    <italic toggle="yes">ng/&#x00b5;l</italic>, 2.54 
                    <italic toggle="yes">ng/&#x00b5;l</italic> and 2.56 
                    <italic toggle="yes">ng/&#x00b5;l</italic> for 
                    <italic toggle="yes">Ifna</italic>, 
                    <italic toggle="yes">Ifnb</italic>, and 
                    <italic toggle="yes">Actb</italic> respectively) and for quick ligation (2.13 
                    <italic toggle="yes">ng/&#x00b5;l</italic>, 2.68 
                    <italic toggle="yes">ng/&#x00b5;l</italic> and 2.45 
                    <italic toggle="yes">ng/&#x00b5;l</italic> for 
                    <italic toggle="yes">Ifna</italic>, 
                    <italic toggle="yes">Ifnb</italic>, and 
                    <italic toggle="yes">Actb</italic> respectively). These samples were normalised and pooled together to give 26.6ng each in 33.1
                    <italic toggle="yes">&#x00b5;l</italic> distilled water for ligation. After adapter ligation, the quick ligation method had no detectable nucleic acid, as seen using a fluorescence quantitation with the Quantus fluorometer (Promega), while the overnight ligation quantified at 0.239ng/
                    <italic toggle="yes">&#x00b5;l</italic>. We decided to pool the samples together anyway, and were pleasantly surprised to discover a substantial proportion of reads from quick-barcoded sequences.</p>
                <fig fig-type="figure" id="f2" orientation="portrait" position="float">
                    <label>Figure 2. </label>
                    <caption>
                        <title>Sample preparation workflow demonstrating the steps used to aid in the identification of the stage at which chimeric reads were formed.</title>
                        <p>Mouse cDNA was extracted and separately amplified for three different amplicons. The amplified product was then separated and barcoded based on the intended ligation process. Barcoded products were pooled and ligated to adapters via the overnight or the quick ligation method, then finally pooled together for sequencing.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12473/5fad195b-3555-4482-9bb8-2e1be7addd18_figure2.gif"/>
                </fig>
            </sec>
            <sec>
                <title>Base calling</title>
                <p>Reads were initially basecalled during the sequencing runs in January 2017 using Metrichor 2D basecalling, from MinKNOW v1.3.25. An initial analysis of called reads demonstrated substantial disagreement between base calls and the raw signal (e.g. hairpin adapter sequences matching multiple times when the signal showed only one present), so reads were recalled as in March 2017 using Albacore v0.7.5.</p>
            </sec>
        </sec>
        <sec sec-type="results | discussion">
            <title>Results and discussion</title>
            <p>During the initial MinION sequencing run to investigate the expression of 
                <italic toggle="yes">Ifna</italic>-family members in mice (comparing with 
                <italic toggle="yes">Ifnb</italic> and 
                <italic toggle="yes">Actb</italic> transcripts), we encountered issues with 2D basecalling through the Metrichor web service, which seemed to be due to failed alignment of component 1D strands. A BLAST search on some of the longest basecalled 1D reads led to a discovery that some reads had multiple mappings to our target 
                <italic toggle="yes">Ifna</italic>-family members. Further exploration of the data demonstrated a situation in which both 
                <italic toggle="yes">Ifna</italic> and 
                <italic toggle="yes">Actb</italic> sequences were present in the same read (see 
                <xref ref-type="fig" rid="f3">Figure 3</xref>). This was an unexpected result; we had carried out separate PCR reactions for each transcript, so were not expecting reads to appear that mapped to different transcripts. Our conclusion was that chimeric ligation of input DNA was occurring at some stage during the sample preparation process, but all we were able to determine at the time was that this chimerism was happening some time after the PCR, but before the sequencing. The present experiment was designed in light of these prior results to more easily quantify the degree of ligation that was happening.</p>
            <fig fig-type="figure" id="f3" orientation="portrait" position="float">
                <label>Figure 3. </label>
                <caption>
                    <title>A chimeric read that was discovered during the preliminary investigation of interferon expression.</title>
                    <p>This read mapped to both beta-actin and interferon alpha, suggesting that a ligation of sequence had occurred, either during sample preparation or 
                        <italic toggle="yes">in-silico</italic>.</p>
                </caption>
                <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12473/5fad195b-3555-4482-9bb8-2e1be7addd18_figure3.gif"/>
            </fig>
            <sec>
                <title>Read counts</title>
                <p>Despite using a 2D ligation chemistry in the sample preparation, and selecting out hairpin-containing reads using streptavidin beads, the majority of reads could not be called as an aligned 2D sequence: of 329,591 sequenced reads, 299,124 were basecalled by Albacore, and 1005 (0.3%) of these basecalled reads had an aligned 2D sequence (see 
                    <xref ref-type="other" rid="SF1">Supplementary File 1</xref>). The reasons behind this basecall failure were not investigated. Any called reads that were not called as 2D were processed further as 1D sequence, i.e. the remaining 298,119 (99.7%) of called reads.</p>
            </sec>
            <sec>
                <title>Read mapping</title>
                <p>Called 1D reads were mapped to 
                    <italic toggle="yes">Actb</italic>, 
                    <italic toggle="yes">Ifnb1</italic>, an 
                    <italic toggle="yes">Ifna</italic> consensus sequence, additional interferon sequences, the ONT control strand sequence, and known ONT adapter sequences (see 
                    <xref ref-type="other" rid="SF2">Supplementary File 2</xref>) using LAST v833
                    <sup>
                        <xref ref-type="bibr" rid="ref-21">21</xref>
                    </sup>. A total of 261,183 reads (87.6% of called 1D reads) were discovered that mapped to at least one known amplicon and/or barcode sequence.</p>
            </sec>
            <sec>
                <title>Categorisation of chimeric reads</title>
                <p>Using a process of elimination, a total of 4563 reads (1.7% of amplicon or barcode-mappable 1D reads) were discovered with basecalled sequences that were definitively chimeric (see 
                    <xref ref-type="other" rid="SF5">Supplementary File 5</xref>). These reads mapped at least once to either one of the three amplicon sequences, or at least once to one of the six barcode sequences. These were broken into four categories (with some overlap) based on the observed combinations of barcode and amplicon sequences (see 
                    <xref ref-type="fig" rid="f4">Figure 4</xref>):</p>
                <list list-type="bullet">
                    <list-item>
                        <label>1.</label>
                        <p>Repeated identical amplicons aligned in the same direction</p>
                    </list-item>
                    <list-item>
                        <label>2.</label>
                        <p>At least two distinct amplicons</p>
                    </list-item>
                    <list-item>
                        <label>3.</label>
                        <p>At least two distinct barcode</p>
                    </list-item>
                    <list-item>
                        <label>4.</label>
                        <p>Disagreement between barcode and amplicon</p>
                    </list-item>
                </list>
                <fig fig-type="figure" id="f4" orientation="portrait" position="float">
                    <label>Figure 4. </label>
                    <caption>
                        <title>Definitively chimeric reads mapped during the sequencing run.</title>
                        <p>Chimeric read categories are not disjointed: different categories may intersect with each other. Reads that mapped to repeated identical, but reverse-complemented sequences, are not included in these chimeric results, as it was not possible to distinguish at the base sequence level between such a duplicated sequence fragment and a 2D read with hairpin.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12473/5fad195b-3555-4482-9bb8-2e1be7addd18_figure4.gif"/>
                </fig>
                <p>The highest proportion of chimeric reads were associated with repeated identical amplicons, with 3441 reads seen (75% of all definitively chimeric reads). This suggests that an amplicon sequencing procedure will be particularly susceptible to read chimerism, as the same sequence will appear in increased abundance compared to an untargeted sequencing approach. The low-temperature overnight ligation had a much higher proportion of repeated amplicons than the quick ligation; in this case it appears that the quick ligation was better at reducing the occurrence of chimeric reads, despite prior expectations. Of the definitively chimeric reads, 2869 included at least one overnight barcode (1.8% of 159,188 amplicon-mapped reads with an overnight barcode), and 1203 included at least one quick barcode (2.6% of 45,850 amplicon-mapped reads with a quick barcode). While it appears that the use of overnight ligation has helped somewhat to reduce chimeric reads, a substantial proportion of chimeric reads still remain.</p>
                <p>If a cassette of adjacent 
                    <italic toggle="yes">Ifna</italic> genes were transcribed together, it is possible that this cassette could be amplified together as a single sequence. These sequences would appear to be chimeric (and fall into the "Repeated amplicons" category), but wouldn&#x2019;t have any intermediate barcodes. The count similarities for repeated 
                    <italic toggle="yes">Ifna</italic>, 
                    <italic toggle="yes">Ifnb1</italic> and 
                    <italic toggle="yes">Actb</italic> genes in 
                    <xref ref-type="table" rid="T1">Table 1</xref> suggest that this is not happening at any significant level.</p>
                <table-wrap id="T1" orientation="portrait" position="anchor">
                    <label>Table 1. </label>
                    <caption>
                        <title>Chimeric read counts split into categories depending on the number of amplicons and barcodes seen.</title>
                        <p> Only categories with a count of 5 or more are displayed.</p>
                    </caption>
                    <table content-type="article-table" frame="hsides">
                        <thead>
                            <tr>
                                <th align="center" colspan="3" rowspan="1">Amplicon count</th>
                                <th align="center" colspan="3" rowspan="1">Barcodes seen</th>
                                <th align="center" colspan="1" rowspan="1">Read</th>
                            </tr>
                            <tr>
                                <th align="center" colspan="1" rowspan="1">Ifna</th>
                                <th align="center" colspan="1" rowspan="1">Ifnb1</th>
                                <th align="center" colspan="1" rowspan="1">Actb</th>
                                <th align="center" colspan="1" rowspan="1">overnight</th>
                                <th align="center" colspan="1" rowspan="1">quick</th>
                                <th align="center" colspan="1" rowspan="1">disagreement</th>
                                <th align="center" colspan="1" rowspan="1">count</th>
                            </tr>
                        </thead>
                        <tbody>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">876</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">803</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">704</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">378</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">246</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">224</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">201</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">140</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">125</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">108</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">99</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">79</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">67</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">64</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">48</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">42</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">41</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">37</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">36</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">34</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">31</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">27</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td align="center" colspan="1" rowspan="1">1</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">25</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">19</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">3</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">15</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">15</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">2</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td align="center" colspan="1" rowspan="1">7</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">3</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">7</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">3</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">6</td>
                            </tr>
                            <tr>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">0</td>
                                <td align="center" colspan="1" rowspan="1">&#x25cb;</td>
                                <td colspan="1" rowspan="1"/>
                                <td colspan="1" rowspan="1"/>
                                <td align="center" colspan="1" rowspan="1">5</td>
                            </tr>
                        </tbody>
                    </table>
                </table-wrap>
            </sec>
            <sec>
                <title>Categorisation of non-chimeric reads</title>
                <p>After elimination of definitively chimeric reads, 256,620 reads remained that appeared to map uniquely to single sequences (see 
                    <xref ref-type="fig" rid="f5">Figure 5</xref>). A small proportion of these sequences (14,223; or 5.5%) had detectable barcode sequences, but did not map to any amplicons (i.e. mappable to an overnight or quick barcode sequence only). It is expected that these unmapped barcoded sequences were unamplified mouse cDNA sequences.</p>
                <fig fig-type="figure" id="f5" orientation="portrait" position="float">
                    <label>Figure 5. </label>
                    <caption>
                        <title>Amplicons mapped from basecalled non-chimeric reads.</title>
                        <p>(
                            <bold>A</bold>) Amplicon counts split by barcode type. (
                            <bold>B</bold>) Sequence only, quick barcode, and overnight barcode counts for amplicon-mapped sequences.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12473/5fad195b-3555-4482-9bb8-2e1be7addd18_figure5.gif"/>
                </fig>
                <p>A difference in read counts was observed between overnight-barcoded sequences and quick-barcoded sequences (77.8% overnight, 22.2% quick), which was consistent with the difference in input amount observed during sample preparation. An attempt was made during sample preparation to add in the three different amplicon preparations in equimolar quantities, which was more successful for the 
                    <italic toggle="yes">Actb</italic> preparation (33.6%) than it was for the 
                    <italic toggle="yes">Ifna</italic> and 
                    <italic toggle="yes">Ifnb</italic> preparations (42.7% and 23.7%, respectively).</p>
                <p>An additional categorisation of 
                    <italic toggle="yes">Ifna</italic> family members (see 
                    <xref ref-type="other" rid="SF3">Supplementary File 3</xref>) was attempted, but is not presented here as it detracts from the main chimeric read investigation. Intermediate results and a processing script from this categorisation are available in verbose form as and 
                    <xref ref-type="other" rid="SF4">Supplementary File 4</xref>.</p>
            </sec>
            <sec>
                <title>Read signal confirmation of chimerism</title>
                <p>A few of the reads were investigated at the raw signal level to make sure that the electrical trace was in agreement with the base-called signal. A demonstrative signal trace for a non-chimeric 2D read comprising of a single barcode-adapted amplicon is shown in 
                    <xref ref-type="fig" rid="f6">Figure 6</xref>. Read traces typically began with a high-current (but relatively uniform) open pore state, followed by an intermediate stall signal (also fairly uniform), after which the highly variable sequence trace begins. Hairpin adapters could be easily identified in the raw signal as a bridge structure a little over halfway through a 2D sequence.</p>
                <fig fig-type="figure" id="f6" orientation="portrait" position="float">
                    <label>Figure 6. </label>
                    <caption>
                        <title>Demonstrative raw signal for a non-chimeric read (from an 
                            <italic toggle="yes">Ifnb1</italic> amplicon).</title>
                        <p>The recorded signal for this read starts with a very short open pore blip, followed by a long stall of 2.5s, then an NB05-flanked coding 
                            <italic toggle="yes">Ifnb1</italic> sequence that took just under 3s to transition through the pore, then an NB05-flanked non-coding 
                            <italic toggle="yes">Ifnb1</italic> sequence that took 2.5s seconds to transition through the pore. 
                            <italic toggle="yes">Note: These figures have been annotated with approximate region boundaries based on the order of hits to the base-called sequence</italic>.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12473/5fad195b-3555-4482-9bb8-2e1be7addd18_figure6.gif"/>
                </fig>
                <p>A number of situations were observed in the basecalled sequence where ligation during sample prep seems to have occurred, and in some cases this ligation resulted in multiple hairpin adapters being ligated in the same sequence. One such occurrence of this is seen in 
                    <xref ref-type="fig" rid="f7">Figure 7</xref>, where two barcoded overnight sequences from two different amplicons (
                    <italic toggle="yes">Ifnb1</italic> and 
                    <italic toggle="yes">Ifna2</italic>) were joined together. Because two amplicons were concatenated, this ligation must have happened after the barcoding step of sample preparation (i.e. during adapter ligation).</p>
                <fig fig-type="figure" id="f7" orientation="portrait" position="float">
                    <label>Figure 7. </label>
                    <caption>
                        <title>Demonstrative raw signal for a chimeric read (containing a strand dissociation event and two separate hairpin events within the same base-called sequence).</title>
                        <p>The recorded signal begins with a very short open pore state (0.1s), followed by a long stall (2.5s), then an NB04-flanked 
                            <italic toggle="yes">Ifna2</italic> non-coding sequence with a transition time of 2.5s. At this stage there appears to be the beginning of a hairpin sequence that is finished by a pore stall. This was followed by a coding 
                            <italic toggle="yes">Ifnb1</italic> sequence with a transition time of 2s, then a hairpin, then an NB05-flanked non-coding 
                            <italic toggle="yes">Ifnb1</italic> sequence (2.5s), and finally an NB04-flanked coding 
                            <italic toggle="yes">Ifna2</italic> sequence (2.5s). Barcodes detected from this read (NB04/NB05) suggest that the chimeric sequence was likely formed during overnight ligation.</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12473/5fad195b-3555-4482-9bb8-2e1be7addd18_figure7.gif"/>
                </fig>
                <p>This finding has potential implications for other sequencing technologies, as the ligation process used for sample preparation is unlikely to be specific for nanopore sequencing. The formation of chimeric reads during sample preparation may be one explanation for the index switching phenomenon seen in Illumina-sequenced reads (e.g. see
                    <sup>
                        <xref ref-type="bibr" rid="ref-22">22</xref>&#x2013;
                        <xref ref-type="bibr" rid="ref-24">24</xref>
                    </sup>), and presents a substantial problem for dual-indexed reads where identical indexes are used for different samples. Where dual-indexed reads are not used, ligation of reads with the same index may still be problematic depending on the particular sequencing application.</p>
            </sec>
            <sec>
                <title>

                    <italic toggle="yes">In-silico</italic> chimerism (1D
                    <sup>2</sup>)</title>
                <p>There were 8 instances where both an overnight and a quick barcode were observed in the basecalled sequence. In all such cases, there appears to have been a very short pore-protein dissociation between the sequencing of the two sequences (i.e. these were chimeric reads formed from 
                    <italic toggle="yes">in-silico</italic> ligation). The dissociation was only noticeable after inspecting the raw signal: a very short blip in the signal that matched the open pore current (e.g. see 
                    <xref ref-type="fig" rid="f8">Figure 8</xref>).</p>
                <fig fig-type="figure" id="f8" orientation="portrait" position="float">
                    <label>Figure 8. </label>
                    <caption>
                        <title>Demonstrative raw signal for a read (base-called as chimeric) that appeared to be from two different ligation preparations.</title>
                        <p>The recorded signal begins with a long open pore period (2.9s), and a short stall (0.1s), followed by NB11-flanked coding and non-coding 
                            <italic toggle="yes">Actb</italic> sequences (transition time of 2.5s for each). There is a very short open-pore blip at around 8s, followed by a short stall (0.1s), then NB06-flanked coding and non-coding 
                            <italic toggle="yes">Actb</italic> sequences (transition time of 2.5s for each).</p>
                    </caption>
                    <graphic orientation="portrait" position="float" xlink:href="https://f1000research-files.f1000.com/manuscripts/12473/5fad195b-3555-4482-9bb8-2e1be7addd18_figure8.gif"/>
                </fig>
                <p>It is likely to be the case that similar situations involving fast pore reloading are present in other reads, but not easily detectable from the called sequence because other barcode/amplicon combinations fit the expected base calling pattern. Considering that this situation can happen with non-identical sequences, software that is able to flag the presence of dissociation and/or stall events that are not at the start of the raw signal would be useful, as these features suggest that the base call is not likely to be a correct single sequence.</p>
                <p>The imminent release of ONT&#x2019;s R9.5 flow cells and 1D
                    <sup>2</sup> base calling will exploit this phenomenon of fast sequence loading into pores in order to produce high-accuracy reads derived from a combined template/complement base call (i.e. replacing the current hairpin-based 2D call). This replaces the 2D sample preparation process that we used for this investigation (see 
                    <xref ref-type="bibr" rid="ref-25">25</xref>).</p>
            </sec>
        </sec>
        <sec sec-type="conclusions">
            <title>Conclusions</title>
            <p>It is apparent from our investigation that chimeric reads can exist in the output of sequencing runs, and we recommend that researchers consider this possibility when interpreting their own results. As a result, it is a good idea to include easily-detectable adapters when sequencing DNA. These adapters, particularly if present at both ends of a sequence, will help substantially in the identification (and if necessary, filtering) of concatenated sequences that are not native to the sample.</p>
            <p>Although a non-negligible 1.7% of reads were found to have post-amplification chimeric elements, careful quality control of reads after long-read sequencing should be able to identify and exclude the majority of chimeric reads that are produced during a sequencing run.</p>
        </sec>
        <sec>
            <title>Data availability</title>
            <p>The data referenced by this article are under copyright with the following copyright statement: Copyright: &#x00ef;&#x00bf;&#x00bd; 2017 White R et al.</p>
            <p>Data associated with the article are available under the terms of the Creative Commons Zero "No rights reserved" data waiver (CC0 1.0 Public domain dedication).
                <ext-link ext-link-type="uri" xlink:href="http://creativecommons.org/publicdomain/zero/1.0/"/>
            </p>
            <p>Raw read signal and basecalled reads have been uploaded to ENA under accession number 
                <ext-link ext-link-type="uri" xlink:href="http://www.ebi.ac.uk/ena/data/view/PRJEB20601">PRJEB20601</ext-link>. Additional supplementary scripts used for FASTQ file filtering, mapping, and raw signal investigation are available as part of David Eccles&#x2019; bioinformatics script repository (doi, 
                <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5281/zenodo.556966">10.5281/zenodo.556966</ext-link>)
                <sup>
                    <xref ref-type="bibr" rid="ref-26">26</xref>
                </sup>. The following scripts from that repository were used for intermediate discovery and result generation:</p>
            <p>
                <bold>maf_bcsplit.pl</bold> Converting MAF format to machine-readable CSV with forward-oriented location information</p>
            <p>
                <bold>pos_aggregate.pl</bold> Merging adjacent MAF matches to the same target sequence in the same orientation</p>
            <p>
                <bold>fastx-fetch.pl</bold> Retrieving sequences from a FASTQ/FASTA file given a a list of identifiers (possibly as a text file)</p>
            <p>
                <bold>fastx-length.pl</bold> Generating length information and aggregate statistics for a FASTQ/FASTA file</p>
            <p>
                <bold>length_plot.r</bold> Generating "digital electrophoresis" image and read density plots given a file containing length information</p>
            <p>

                <bold>porejuicer.py</bold> Extracting raw data and called FASTQ files from FAST5 files</p>
            <p>A rough shell command script (including additional dead-end attempts at discovery &amp; analysis) is provided for reproduction and/or extension of these findings to other investigations (see 
                <xref ref-type="other" rid="SF6">Supplementary File 6</xref>).</p>
        </sec>
    </body>
    <back>
        <ack>
            <title>Acknowledgements</title>
            <p>We would like to thank the Nanopore Community for providing help and insightful discussion for this investigation.</p>
        </ack>
        <sec sec-type="supplementary-material">
            <title>Supplementary material</title>
            <p id="SF1">

                <bold>Supplementary File 1:</bold> Base calling summary from Albacore v0.7.5.</p>
            <p>
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/11547/ec920861-8317-4f25-90f8-61ebfde983fd.txt">Click here to access the data</ext-link>
            </p>
            <p id="SF2">

                <bold>Supplementary File 2:</bold> Reference sequences used for the initial amplicon mapping.</p>
            <p>
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/11547/721cb622-4393-489b-98ff-cb950e3eef85.fa">Click here to access the data</ext-link>
            </p>
            <p id="SF3">

                <bold>Supplementary File 3:</bold> Reference sequences used for 
                <italic toggle="yes">Ifna</italic> paralog mapping.</p>
            <p>
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/11547/1af9453a-88a3-4ed8-a4eb-1194918070d1.fa">Click here to access the data</ext-link>
            </p>
            <p id="SF4">

                <bold>Supplementary File 4:</bold> R script and intermediate data files used for 
                <italic toggle="yes">Ifna</italic>-family gene counting.</p>
            <p>
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/11547/67a33160-3885-4a40-b963-82a4e88cd52a.gz">Click here to access the data</ext-link>
            </p>
            <p id="SF5">

                <bold>Supplementary File 5:</bold> R script and intermediate data files used for chimeric read filtering.</p>
            <p>
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/11547/1db4c575-53f8-4ee5-870d-b3d9f38eae4b.gz">Click here to access the data</ext-link>
            </p>
            <p id="SF6">

                <bold>Supplementary File 6:</bold> Shell/process script for reproducing the data analysis.</p>
            <p>
                <ext-link ext-link-type="uri" xlink:href="https://f1000researchdata.s3.amazonaws.com/supplementary/11547/f7ba0e4e-0a9f-4008-8cd3-b3f55641a45e.txt">Click here to access the data</ext-link>
            </p>
        </sec>
        <ref-list>
            <ref id="ref-1">
                <label>1</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Levy</surname>
                            <given-names>SE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Myers</surname>
                            <given-names>RM</given-names>
                        </name>
</person-group>:
                    <article-title>Advancements in Next-Generation Sequencing.</article-title>
                    <source>

                        <italic toggle="yes">Annu Rev Genomics Hum Genet.</italic>
</source>
                    <year>2016</year>;<volume>17</volume>(<issue>1</issue>):<fpage>95</fpage>&#x2013;<lpage>115</lpage>.
                    <pub-id pub-id-type="pmid">27362342</pub-id>
                    <pub-id pub-id-type="doi">10.1146/annurev-genom-083115-022413</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-2">
                <label>2</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Mikheyev</surname>
                            <given-names>AS</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tin</surname>
                            <given-names>MM</given-names>
                        </name>
</person-group>:
                    <article-title>A first look at the Oxford Nanopore MinION sequencer.</article-title>
                    <source>

                        <italic toggle="yes">Mol Ecol Resour.</italic>
</source>
                    <year>2014</year>;<volume>14</volume>(<issue>6</issue>):<fpage>1097</fpage>&#x2013;<lpage>1102</lpage>.
                    <pub-id pub-id-type="pmid">25187008</pub-id>
                    <pub-id pub-id-type="doi">10.1111/1755-0998.12324</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-3">
                <label>3</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Chandler</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Camberis</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bouchery</surname>
                            <given-names>T</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Annotated mitochondrial genome with nanopore r9 signal for 
                        <italic toggle="yes">nippostrongylus brasiliensis</italic> [version 1; referees: 1 approved, 2 approved with reservations].</article-title>
                    <source>

                        <italic toggle="yes">F1000Res.</italic>
</source>
                    <year>2017</year>;<volume>6</volume>:<fpage>56</fpage>.
                    <pub-id pub-id-type="doi">10.12688/f1000research.10545.1</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-4">
                <label>4</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Reuter</surname>
                            <given-names>JA</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Spacek</surname>
                            <given-names>DV</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Snyder</surname>
                            <given-names>MP</given-names>
                        </name>
</person-group>:
                    <article-title>High-throughput sequencing technologies.</article-title>
                    <source>

                        <italic toggle="yes">Mol Cell.</italic>
</source>
                    <year>2015</year>;<volume>58</volume>(<issue>4</issue>):<fpage>586</fpage>&#x2013;<lpage>597</lpage>.
                    <pub-id pub-id-type="pmid">26000844</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.molcel.2015.05.004</pub-id>
                    <pub-id pub-id-type="pmcid">4494749</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-5">
                <label>5</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Walter</surname>
                            <given-names>MC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zwirglmaier</surname>
                            <given-names>K</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Vette</surname>
                            <given-names>P</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>MinION as part of a biomedical rapidly deployable laboratory.</article-title>
                    <source>

                        <italic toggle="yes">J Biotechnol.</italic>
</source>
                    <year>2016</year>; pii: S0168-1656(16)31640-6.
                    <pub-id pub-id-type="pmid">27939320</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.jbiotec.2016.12.006</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-6">
                <label>6</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Castro-Wallace</surname>
                            <given-names>SL</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Chiu</surname>
                            <given-names>CY</given-names>
                        </name>

                        <name name-style="western">
                            <surname>John</surname>
                            <given-names>KK</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Nanopore dna sequencing and genome assembly on the international space station.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2016</year>.
                    <pub-id pub-id-type="doi">10.1101/077651</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-7">
                <label>7</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Urbanc</surname>
                            <given-names>JM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Bliss</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lawrence</surname>
                            <given-names>CE</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Sequencing ultra-long dna molecules with the oxford nanopore minion.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2015</year>.
                    <pub-id pub-id-type="doi">10.1101/019281</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-8">
                <label>8</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Simpson</surname>
                            <given-names>JT</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Workman</surname>
                            <given-names>RE</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Zuzarte</surname>
                            <given-names>PC</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Detecting DNA cytosine methylation using nanopore sequencing.</article-title>
                    <source>

                        <italic toggle="yes">Nat Methods.</italic>
</source>
                    <year>2017</year>;<volume>14</volume>(<issue>4</issue>):<fpage>407</fpage>&#x2013;<lpage>410</lpage>.
                    <pub-id pub-id-type="pmid">28218898</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nmeth.4184</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-9">
                <label>9</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Deschamps</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mudge</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cameron</surname>
                            <given-names>C</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Characterization, correction and 
                        <italic toggle="yes">de novo</italic> assembly of an Oxford Nanopore genomic dataset from 
                        <italic toggle="yes">Agrobacterium tumefaciens</italic>.</article-title>
                    <source>

                        <italic toggle="yes">Sci Rep.</italic>
</source>
                    <year>2016</year>;<volume>6</volume>(<issue>1</issue>): 28625.
                    <pub-id pub-id-type="pmid">27350167</pub-id>
                    <pub-id pub-id-type="doi">10.1038/srep28625</pub-id>
                    <pub-id pub-id-type="pmcid">4923883</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-10">
                <label>10</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Quick</surname>
                            <given-names>J</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Grubaugh</surname>
                            <given-names>ND</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Pullan</surname>
                            <given-names>ST</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Multiplex pcr method for minion and illumina sequencing of zika and other virus genomes directly from clinical samples.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2017</year>.
                    <pub-id pub-id-type="doi">10.1101/098913</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-11">
                <label>11</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Ben&#x00ed;tez-P&#x00e1;ez</surname>
                            <given-names>A</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Portune</surname>
                            <given-names>KJ</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Sanz</surname>
                            <given-names>Y</given-names>
                        </name>
</person-group>:
                    <article-title>Species-level resolution of 16s rrna gene amplicons sequenced through the minion&#x2122; portable nanopore sequencer.</article-title>
                    <source>

                        <italic toggle="yes">Gigascience.</italic>
</source>
                    <year>2016</year>;<volume>5</volume>(<issue>1</issue>):<fpage>4</fpage>.
                    <pub-id pub-id-type="pmid">26823973</pub-id>
                    <pub-id pub-id-type="doi">10.1186/s13742-016-0111-z</pub-id>
                    <pub-id pub-id-type="pmcid">4730766</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-12">
                <label>12</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Jain</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Koren</surname>
                            <given-names>S</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Quick</surname>
                            <given-names>J</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Nanopore sequencing and assembly of a human genome with ultra-long read.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2017</year>.
                    <pub-id pub-id-type="doi">10.1101/128835</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-13">
                <label>13</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hargreaves</surname>
                            <given-names>AD</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mulley</surname>
                            <given-names>JF</given-names>
                        </name>
</person-group>:
                    <article-title>Assessing the utility of the Oxford Nanopore MinION for snake venom gland cDNA sequencing.</article-title>
                    <source>

                        <italic toggle="yes">PeerJ.</italic>
</source>
                    <year>2015</year>;<volume>3</volume>:<fpage>e1441</fpage>.
                    <pub-id pub-id-type="pmid">26623194</pub-id>
                    <pub-id pub-id-type="doi">10.7717/peerj.1441</pub-id>
                    <pub-id pub-id-type="pmcid">4662598</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-14">
                <label>14</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Rand</surname>
                            <given-names>AC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jain</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Eizenga</surname>
                            <given-names>JM</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Mapping DNA methylation with high-throughput nanopore sequencing.</article-title>
                    <source>

                        <italic toggle="yes">Nat Methods.</italic>
</source>
                    <year>2017</year>;<volume>14</volume>(<issue>4</issue>):<fpage>411</fpage>&#x2013;<lpage>413</lpage>.
                    <pub-id pub-id-type="pmid">28218897</pub-id>
                    <pub-id pub-id-type="doi">10.1038/nmeth.4189</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-15">
                <label>15</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Smith</surname>
                            <given-names>AM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Jain</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Mulroney</surname>
                            <given-names>L</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Reading canonical and modified nucleotides in 16s ribosomal rna using nanopore direct rna sequencing.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2017</year>.
                    <pub-id pub-id-type="doi">10.1101/132274</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-16">
                <label>16</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Connor</surname>
                            <given-names>LM</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Tang</surname>
                            <given-names>SC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Cognard</surname>
                            <given-names>E</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Th2 responses are primed by skin dendritic cells with distinct transcriptional profiles.</article-title>
                    <source>

                        <italic toggle="yes">J Exp Med.</italic>
</source>
                    <year>2017</year>;<volume>214</volume>(<issue>1</issue>):<fpage>125</fpage>&#x2013;<lpage>142</lpage>.
                    <pub-id pub-id-type="pmid">27913566</pub-id>
                    <pub-id pub-id-type="doi">10.1084/jem.20160470</pub-id>
                    <pub-id pub-id-type="pmcid">5206495</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-17">
                <label>17</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>van Pesch</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Lanaya</surname>
                            <given-names>H</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Renauld</surname>
                            <given-names>JC</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Characterization of the murine alpha interferon gene family.</article-title>
                    <source>

                        <italic toggle="yes">J Virol.</italic>
</source>
                    <year>2004</year>;<volume>78</volume>(<issue>15</issue>):<fpage>8219</fpage>&#x2013;<lpage>8228</lpage>.
                    <pub-id pub-id-type="pmid">15254193</pub-id>
                    <pub-id pub-id-type="doi">10.1128/jvi.78.15.8219-8228.2004</pub-id>
                    <pub-id pub-id-type="pmcid">446145</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-18">
                <label>18</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Brinkmann</surname>
                            <given-names>V</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Geiger</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Alkan</surname>
                            <given-names>S</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Interferon alpha increases the frequency of interferon gamma-producing human CD4+ t cells.</article-title>
                    <source>

                        <italic toggle="yes">J Exp Med.</italic>
</source>
                    <year>1993</year>;<volume>178</volume>(<issue>5</issue>):<fpage>1655</fpage>&#x2013;<lpage>1663</lpage>.
                    <pub-id pub-id-type="pmid">8228812</pub-id>
                    <pub-id pub-id-type="doi">10.1084/jem.178.5.1655</pub-id>
                    <pub-id pub-id-type="pmcid">2191249</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-19">
                <label>19</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Camberis</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Le Gros</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Urban</surname>
                            <given-names>J</given-names>
                            <suffix>Jr</suffix>
                        </name>
</person-group>:
                    <article-title>Animal model of 
                        <italic toggle="yes">Nippostrongylus brasiliensis</italic> and 
                        <italic toggle="yes">Heligmosomoides polygyrus.</italic>
</article-title>
                    <source>

                        <italic toggle="yes">Curr Protoc Immunol.</italic>
</source>
                    <year>2003</year>;<volume>Chapter 19</volume>:<fpage>Unit 19.12</fpage>.
                    <pub-id pub-id-type="pmid">18432905</pub-id>
                    <pub-id pub-id-type="doi">10.1002/0471142735.im1912s55</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-20">
                <label>20</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>D&#x00e9;moulins</surname>
                            <given-names>T</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Baron</surname>
                            <given-names>ML</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Kettaf</surname>
                            <given-names>N</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Poly (I:C) induced immune response in lymphoid tissues involves three sequential waves of type I IFN expression.</article-title>
                    <source>

                        <italic toggle="yes">Virology.</italic>
</source>
                    <year>2009</year>;<volume>386</volume>(<issue>2</issue>):<fpage>225</fpage>&#x2013;<lpage>236</lpage>.
                    <pub-id pub-id-type="pmid">19232663</pub-id>
                    <pub-id pub-id-type="doi">10.1016/j.virol.2009.01.024</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-21">
                <label>21</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Frith</surname>
                            <given-names>MC</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Hamada</surname>
                            <given-names>M</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Horton</surname>
                            <given-names>P</given-names>
                        </name>
</person-group>:
                    <article-title>Parameters for accurate genome alignment.</article-title>
                    <source>

                        <italic toggle="yes">BMC Bioinformatics.</italic>
</source>
                    <year>2010</year>;<volume>11</volume>(<issue>1</issue>):<fpage>80</fpage>.
                    <pub-id pub-id-type="pmid">20144198</pub-id>
                    <pub-id pub-id-type="doi">10.1186/1471-2105-11-80</pub-id>
                    <pub-id pub-id-type="pmcid">2829014</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-22">
                <label>22</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Sinha</surname>
                            <given-names>R</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Stanley</surname>
                            <given-names>G</given-names>
                        </name>

                        <name name-style="western">
                            <surname>Gulati</surname>
                            <given-names>GS</given-names>
                        </name>

                        <etal/>
</person-group>:
                    <article-title>Index switching causes &#x201c;spreading-of-signal&#x201d; among multiplexed samples in illumina hiseq 4000 dna sequencing.</article-title>
                    <source>

                        <italic toggle="yes">bioRxiv.</italic>
</source>
                    <year>2017</year>.
                    <pub-id pub-id-type="doi">10.1101/125724</pub-id>
                </mixed-citation>
            </ref>
            <ref id="ref-23">
                <label>23</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Hadfield</surname>
                            <given-names>J</given-names>
                        </name>
</person-group>:
                    <article-title>Index mis-assignment between samples on hiseq 4000 and x-ten.</article-title>Core-Genomics Blog,<year>2016</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://enseqlopedia.com/2016/12/index-mis-assignment-between-samples-on-hiseq-4000-and-x-ten/">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-24">
                <label>24</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Bushnell</surname>
                            <given-names>B</given-names>
                        </name>
</person-group>:
                    <article-title>Introducing crossblock, a bbtool for removing cross-contamination.</article-title>Seqanswers discussion thread,<year>2017</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://seqanswers.com/forums/showthread.php?t=73736">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-25">
                <label>25</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Brown</surname>
                            <given-names>C</given-names>
                        </name>
</person-group>:
                    <article-title>Gridion x5 - the sequel.</article-title>Technology presentation,<year>2017</year>.
                    <ext-link ext-link-type="uri" xlink:href="https://nanoporetech.com/videos/clive-brown-gridion-x5-sequel">Reference Source</ext-link>
                </mixed-citation>
            </ref>
            <ref id="ref-26">
                <label>26</label>
                <mixed-citation publication-type="journal">
                    <person-group person-group-type="author">

                        <name name-style="western">
                            <surname>Eccles</surname>
                            <given-names>D (gringer)</given-names>
                        </name>
</person-group>:
                    <article-title>gringer/bioinfscripts: Chimeric read update</article-title>.<year>2017</year>.
                    <ext-link ext-link-type="uri" xlink:href="http://dx.doi.org/10.5281/zenodo.556966">Data Source</ext-link>
                </mixed-citation>
            </ref>
        </ref-list>
    </back>
    <sub-article article-type="reviewer-report" id="report23252">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.12473.r23252</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Timp</surname>
                        <given-names>Winston</given-names>
                    </name>
                    <xref ref-type="aff" rid="r23252a1">1</xref>
                    <role>Referee</role>
                    <uri content-type="orcid">https://orcid.org/0000-0003-2083-6027</uri>
                </contrib>
                <aff id="r23252a1">
                    <label>1</label>Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>I have two patents licensed to Oxford Nanopore.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>19</day>
                <month>6</month>
                <year>2017</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2017 Timp W</copyright-statement>
                <copyright-year>2017</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport23252" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.11547.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve-with-reservations</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>In this manuscript, the authors discuss the detection and potential sources of chimeric reads from minION (nanopore) sequencing. Though the manuscript has some interesting analyses and ideas, I have some problems with the results that should be addressed, specifically points 1, 3, 4, 5 and 7: 
                <list list-type="order">
                    <list-item>
                        <p>Though 2D libraries were prepared - only *0.3%* of the reads were actually 2D, the vast majority were 1D. This is in my view quite surprising - though with 2D libraries I have seen plenty of 1D reads mixed in, this level of 2D/1D for a 2D prep suggests something strange upstream of sequencing is occurring.&#x00a0;However, the authors decline to address it, &#x201c;The reasons behind this basecall failure were not investigated&#x201d;. &#x00a0;I think this must be addressed more carefully to understand the results.</p>
                    </list-item>
                    <list-item>
                        <p>I feel the low % of 2D reads is important because it may play into the source of the chimeras - if the 2D calling is failing due to heterogenous DNA strands - i.e. hybridization of an IFN to an Actb for example, then end polishing and adapting would lead to a 2D read where the two strands don&#x2019;t match, hence called as 1D.</p>
                    </list-item>
                    <list-item>
                        <p>The authors suggest that amplicon sequencing is more susceptible to chimeras because &#x201c;the same sequencing will appear in increased abundance&#x201d; - I&#x2019;m not clear on why that makes chimeras more frequent, just that it makes them more likely to be easily detectable. &#x00a0;</p>
                    </list-item>
                    <list-item>
                        <p>The authors discuss &#x201c;multiple hairpin adapters being ligated in the same sequence&#x201d;.&#x00a0;I don&#x2019;t understand how the authors think this is possible? There are only two free ends of DNA, and if there are hairpins on both, the DNA will not be able to enter the pore.&#x00a0;Instead I suggest it could be the proposed &#x201c;in silico&#x201d; chimerism the authors later discuss.</p>
                    </list-item>
                    <list-item>
                        <p>PCR Chimeras are not unknown in the literature - having been described, for example, in Sanger and 454 here (PMID: 21212162, 20833233). The authors&#x2019; assumption that the chimerism is occurring downstream of PCR needs to be demonstrated - Figure 3 suggests that the length of the chimeric is not outside the range of either Illumina or Sanger sequencing, so could be easily validated with these technologies.</p>
                    </list-item>
                    <list-item>
                        <p>But - given the multiple ligation steps of this protocol, it seems likely that the dA-tailing failing some fraction of the time could results in blunt-end ligation and chimeric reads.</p>
                    </list-item>
                    <list-item>
                        <p>How does enrichment look comparing the overnight to quick ligation for the different categories of chimerism detailed? The only results given are overall chimerism.</p>
                    </list-item>
                    <list-item>
                        <p>The authors only tried overnight ligation/quick ligation for the last ligation step, but not for the barcode ligation step.&#x00a0;I also wonder if a PCR-barcode may have given better results - the multiple ligations may have led to a higher rate of chimeras, as the end-polishing likely had some fraction of blunt-ended amplicons.</p>
                    </list-item>
                    <list-item>
                        <p>Another possible point was the the authors may have not added enough (relative) adaptors - the relative high concentration of template allowed self-ligations to be more frequent.&#x00a0;Adaptor dimers are probably easier to eliminate in this case than chimeras.</p>
                    </list-item>
                </list>
            </p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Not applicable</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Partly</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Partly</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>Biophysics, Sequencing, Epigenetics</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.</p>
        </body>
        <sub-article article-type="response" id="comment2810-23252">
            <front-stub>
                <contrib-group>
                    <contrib contrib-type="author">
                        <name>
                            <surname>Eccles</surname>
                            <given-names>David</given-names>
                        </name>
                        <aff>Malaghan Institute of Medical Research, New Zealand</aff>
                    </contrib>
                </contrib-group>
                <author-notes>
                    <fn fn-type="conflict">
                        <p>
                            <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                    </fn>
                </author-notes>
                <pub-date pub-type="epub">
                    <day>20</day>
                    <month>6</month>
                    <year>2017</year>
                </pub-date>
            </front-stub>
            <body>
                <p>Thanks for you comments. We will be incorporating your suggested changes into a revised manuscript. To make sure we've got the right idea about your suggestions, here are my initial impressions: 
                    <list list-type="order">
                        <list-item>
                            <p>I understand what happened now after talking to Forrest Brennen at the London Calling conference this year. 2D reads were not called because the older hairpin was used. Even though we used a newer 2D kit, the barcode kit had the old hairpin sequence in it (which wasn't detected by the Albacore caller used at the time). This mis-call has either been fixed now, or will be fixed soon -- I'll re-call the reads with the most recent Albacore and if no improvement will talk to ONT about the correct software tweaks to fix it. In any case, plenty of hairpin sequences were detected in the linear/1D base calls.</p>
                        </list-item>
                        <list-item>
                            <p>We initially thought that the failure of 2D was due to chimeric reads, because it seemed to be occurring at the 2D alignment step. However, reads that were very obviously not chimeric were still failing the 2D calling. There is a chance that a properly called chimeric read will fail the 2D alignment step, but I expect it will still have a called template + complement sequence. I don't recall seeing many situations where the chimerism had happened on a single strand; it was mostly double-stranded fragments that had joined together.</p>
                        </list-item>
                        <list-item>
                            <p>My theory on why amplicon sequencing is more susceptible to chimeras is that it encourages the formation of base pairing structures (e.g. quadruplexes) that bring the ends of similar sequences closer to each other. I don't know how deep we should go into this; it's a hypothesis about why they could be in higher abundance for our specific experiment, but we haven't tested whether or not amplicon sequencing runs have a higher rate of chimeric reads.</p>
                        </list-item>
                        <list-item>
                            <p>Multiple hairpin adapters do make sense; David Stoddart (ONT sample prep guru) was with me when I was doing some "napkin drawings" of the structure that was formed by a 3-hairpin sequence, and helped correct a bit of the structure. We should add that into the paper (I think he kept the drawing, but I can make another one).</p>
                        </list-item>
                        <list-item>
                            <p>PCR chimeras may exist in our results, but appear to be of low abundance according to the electrophoresis plot, and I've tried to analyse the results in such a way that PCR chimeras would be excluded. Our experimental design was such that barcoding (and mixing of separate amplicons) happened after the PCR was done, so the results should be at worst an underestimate of chimerism, and PCR chimeras should be observable in the data.</p>
                        </list-item>
                        <list-item>
                            <p>Yes; dA-tailing failure seems to be a likely explanation. Due to the physics of chemical reactions, there are going to be some that don't work. However, it is a little bit curious that the overnight ligation produced more chimeric reads. If the chimerism were due solely to dA-tailing failure, then increased abundance for overnight ligation doesn't make sense.</p>
                        </list-item>
                        <list-item>
                            <p>Different categories of chimerism are outlined in Table 1, and the four categories are broken down into overnight/quick in figure 5. I've included a full text description of each 
                                <bold>*read*</bold> in the supplementary information, but we felt that it was overly complex to include all those details in a graph.</p>
                        </list-item>
                        <list-item>
                            <p>The overnight/quick difference was unexpected (particularly in the direction that it happened). While ONT have discontinued their 2D hairpin kit, we would still be able to carry out a subsequent investigation in the future to look at overnight vs quick ligation during barcoding.</p>
                        </list-item>
                        <list-item>
                            <p>We tried to add the recommended amount of adapters to the samples, and the called results suggest that adapter dimers were minimal. Those situations are very easy to pull out at the analysis stage, because there are no amplicon sequences between adapters.</p>
                        </list-item>
                    </list>
                </p>
            </body>
        </sub-article>
    </sub-article>
    <sub-article article-type="reviewer-report" id="report22553">
        <front-stub>
            <article-id pub-id-type="doi">10.5256/f1000research.12473.r22553</article-id>
            <title-group>
                <article-title>Reviewer response for version 1</article-title>
            </title-group>
            <contrib-group>
                <contrib contrib-type="author">
                    <name>
                        <surname>Robison</surname>
                        <given-names>Keith E.</given-names>
                    </name>
                    <xref ref-type="aff" rid="r22553a1">1</xref>
                    <role>Referee</role>
                </contrib>
                <aff id="r22553a1">
                    <label>1</label>Warp Drive Bio, Cambridge, MA, USA</aff>
            </contrib-group>
            <author-notes>
                <fn fn-type="conflict">
                    <p>
                        <bold>Competing interests: </bold>No competing interests were disclosed.</p>
                </fn>
            </author-notes>
            <pub-date pub-type="epub">
                <day>17</day>
                <month>5</month>
                <year>2017</year>
            </pub-date>
            <permissions>
                <copyright-statement>Copyright: &#x00a9; 2017 Robison KE</copyright-statement>
                <copyright-year>2017</copyright-year>
                <license xlink:href="https://creativecommons.org/licenses/by/4.0/">
                    <license-p>This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</license-p>
                </license>
            </permissions>
            <related-article ext-link-type="doi" id="relatedArticleReport22553" related-article-type="peer-reviewed-article" xlink:href="10.12688/f1000research.11547.1"/>
            <custom-meta-group>
                <custom-meta>
                    <meta-name>recommendation</meta-name>
                    <meta-value>approve</meta-value>
                </custom-meta>
            </custom-meta-group>
        </front-stub>
        <body>
            <p>The authors have provided a useful report on technical aspects of the emerging Oxford Nanopore MinION DNA sequencing technology. &#x00a0;As noted in the paper, the library preparation methods scrutinized here are commonly found in multiple advanced DNA sequencing technologies, and so lessons learned in this work are likely applicable elsewhere.</p>
            <p> </p>
            <p> In the methods section, the authors report injected "dead infectious" worms, but not how the worms were killed.</p>
            <p> </p>
            <p> It would be greatly preferable for all of the input DNA amounts to library preparations to be given in both mass and fmol. &#x00a0;While it is common to report masses of DNA, the ligations really are dependent on the availability of DNA ends.&#x00a0;</p>
            <p> </p>
            <p> The point Figure 8 is trying to convey would be greatly enhanced by adding a zoom of the region around 8s in the plot in which the temporal sequence barcodeNB11-open pore--stall-barcode NB06 is seen. &#x00a0;Zooms of other transitions should be considered as well.</p>
            <p>Is the work clearly and accurately presented and does it cite the current literature?</p>
            <p>Yes</p>
            <p>If applicable, is the statistical analysis and its interpretation appropriate?</p>
            <p>Yes</p>
            <p>Are all the source data underlying the results available to ensure full reproducibility?</p>
            <p>Yes</p>
            <p>Is the study design appropriate and is the work technically sound?</p>
            <p>Yes</p>
            <p>Are the conclusions drawn adequately supported by the results?</p>
            <p>Yes</p>
            <p>Are sufficient details of methods and analysis provided to allow replication by others?</p>
            <p>Yes</p>
            <p>Reviewer Expertise:</p>
            <p>NA</p>
            <p>I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.</p>
        </body>
    </sub-article>
</article>
