Keywords
Gene-editing , CRISPR-Cas, AAV, plasmid integration
Gene-editing , CRISPR-Cas, AAV, plasmid integration
Nuclease based gene-editing techniques (Cas9/ZFN/TALEN - CaZiTa) introduce a double stranded break at a specified location with the guide of DNA-binding proteins (ZFN/TALEN) or RNA (CRISPR)1. Delivery within cells is a critical step for such gene-therapies2. The ability of recombinant adeno-associated virus (rAAV) to enter cells makes it a perfect choice as a vector for gene therapy3. A plasmid comprising the rAAV, CaZiTa, guide RNAs (for CRISPR), etc. is expected to enter the cell, edit, and remain episomal4. However, the rather obvious danger of integration of the plasmid into the genome is ignored. A recent pre-print5 has highlighted that bacterial genes from the template plasmid (pCR2.1-TOPO) has integrated into bovine genomes6.
Recently, Massachusetts General Hospital published data on CRISPR edits (Accid:PRJNA563918), and concluded that ‘AAV integration should be recognised as a common outcome for applications that utilize AAV for genome editing’7. However, there was no report of Cas9 integration. Here, data showing Cas9 integrating in the exact edit sites is shown. The same caveat applies to all three CaZiTa gene-therapies1.
Sequencing data was obtained from BioProject accession number, PRJNA563918, using SRA download tools (https://github.com/ncbi/sra-tools/wiki). This project aims at finding “the genomic consequences of transduction with AAV vectors encoding CRISPR-Cas nucleases”7.
The reads were uniquified, split into open-reading frames using the getorf (version:6.6.0.0) program from the EMBOSS suite (http://emboss.sourceforge.net/download/)8.
Command: “getorf -find 1 -sequence <infile> -outseq <output> -minsize 10”.
Exact 10-kmer were aligned to the Cas9 protein (Accid:AGZ01981.1, 1417aa). Subsequently, these reads were then aligned to the gene of interest - TMC1(Accid:NG_008213.1) or DMD(Accid:NM_001314034.1). The online BLAST interface using the default settings (https://blast.ncbi.nlm.nih.gov/Blast.cgi) was used to generate images of integration with the genes of interest9. The number of reads with integration is conservative, since reads with less than a 10-kmer match were ignored. The AAV genome used was Accid:MK163936.1. Sequences that integrate Cas9/AAV with TMC1 and DMD are provided in Extended data: TMC1.cas9.fa/TMC1.aav.fa and DMD.cas9.fa/DMD.aav.fa, respectively.
TMC1, a transmembrane protein, is required for proper functioning of cochlear hair cells, and has been implicated in hearing loss and prelingual deafness10. In mice, this gene is located in chr19 (Accid:NG_008213.1)). Table 1 lists the samples sequenced for targeting this gene. As expected, the non-injected controls have no Cas9 reads. The sample SRR10068671 (marked with triple asterisks) has probably been mis-annotated as one with “no gRNA”, since Cas9 integrations are detected in that sample. From the Cas9 integration site, the guide RNA (gRNA) for the TMC1 gene can be deduced to be “CATGGTAATGTCCCTCCTGGGGA”, although this information is not yet available. The sequences for these reads are provided as extended data. As a specific example, the sequence (SRR10068639.63180.1) encodes the 26aa peptide=SPEKLLMYHHDPQTYQKLKLIMEQYG from Cas9 and is merged with the TMC1 gene (Figure 1).
The paired reads are clubbed into a single file, and then uniquified. Open reading frames from these are then matched to Cas9 proteins for 10 aa, and subsequently these reads are matched to the gene of interest (TMC1 or DMD), helping identify the exact edit site. The sample SRR10068671 (marked with triple asterisks) has probably been mis-annotated as one with “no gRNA”, since Cas9 integrations are detected in that sample. These integrations happen both for SpCas9 and SaCas9. From the Cas9 integration site, the gRNA for the TMC1 gene can be deduced to be “CATGGTAATGTCCCTCCTGGGGA”, although this information is not yet available. CN, cortical neuron; CA: cochlea, adult.
Duchenne muscular dystrophy (DMD), a genetic disorder associated with progressive muscle degeneration (dystrophy), is caused by aberrations in the dystrophin protein, located on chromosome X in humans and mice11. According to the data (Accid:PRJNA563918), different exons and introns have been targeted using two different variations of the Cas9 - SpCas9 and SaCas9 - which differ in certain characteristics12. SaCas9 does seem empirically to have more samples without any integration, but that might just be random. The sequences for these reads are provided as extended data. As a specific example, the sequence (SRR10068622.33932.1) encodes the 12aa peptide=LDATLIHQSITG from Cas9 and is merged with the DMD gene (Figure 2).
The integration of AAV into the genome has already been noted7. This has been replicated in this study as well. The AAV genome used was Accid:MK163936.1. The reads can be found in Extended data: TMC1.aav.fa (N=5000) and DMD.aav.fa (N=14000).
Gene-editing based therapies provide revolutionary hope for curing many diseases. However, ensuring safety of any such endeavours must be paramount to avoid doing more harm13,14. Plasmid integration is one such potential hazard5. Recently, Massachusetts General Hospital reported high-levels of in vivo AAV into the genome while providing sequencing data (Accid:PRJNA563918), and concluded that ‘AAV integration should be recognized as a common outcome for applications that utilize AAV for genome editing’7. In this study, in addition to AAV, Cas9 integration is shown in the same samples. The same caveat applies to gene-therapies using CaZiTa1. Off-target edits (OTE) are an important aspect of CRISPR-cas gene-editing15–19. Such integrations, found by targeting amplicon sequencing, will only get worse due to OTEs, which are hard to detect20–22. Another problem is pre-existing immunity to Cas9 proteins23,24. This problem can be mitigated by sending the CaZiTa as protein25, but that would seriously restrict the use-cases, and also suffer from OTEs, large deletions, complex rearrangements and translocations26, or even including fragments from exosomes27. This is an important distinction between AAV and CaZiTa integration, since AAV integration can be tolerated (integration at chromosome breakage points28, though there is debate on its role in hepatocellular carcinoma29,30), Cas9 integration is a huge, and unacceptable danger. After integration, individuals may be expressing Cas9, and that is fraught with genotoxic danger.
Raw sequence reads from Massachusetts General Hospital, Accession number PRJNA563918, https://www.ncbi.nlm.nih.gov/bioproject/PRJNA563918/
Zenodo: Sequencing data from Massachusetts General Hospital shows Cas9 integration into the genome, highlighting a serious hazard in gene-editing therapeutics, https://doi.org/10.5281/zenodo.346030531
This project contains the following extended data:
Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Genome editing, Hematopoietic stem cell gene therapy
Is the work clearly and accurately presented and does it cite the current literature?
Partly
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Partly
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Partly
References
1. Liang X, Potter J, Kumar S, Zou Y, et al.: Rapid and highly efficient mammalian cell engineering via Cas9 protein transfection.J Biotechnol. 2015; 208: 44-53 PubMed Abstract | Publisher Full TextCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: T cell Biology; RNA Biology; Cancer Biology; Molecular Biology; Regulation of gene expression; Protein-RNA interactions; Biochemistry and Molecular Biophysics
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | ||
---|---|---|
1 | 2 | |
Version 1 04 Nov 19 |
read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)