Keywords
Black Bengal goat, whole genome sequence, short reads, SNP
This article is included in the Genomics and Genetics gateway.
Black Bengal goat, whole genome sequence, short reads, SNP
The Black Bengal goat (BBG) is a small-sized breed of goat (Capra hircus) distributed throughout Bangladesh, West Bengal, Bihar, and Orissa regions of northeastern India. The predominant coat color of this breed is black but it is also found in brown, white and gray (Jalil et al., 2018). It is a heritage goat breed of Bangladesh, and well known for its high fertility, excellent meat and skin quality. This animal is a source of high quality meat, milk, and leather, and contributes substantially to the economy of Bangladesh (Amin et al., 2000; Faruque et al., 2017). The BBG is reported to have originated from wild goat, also known as the bezoar or Pasang (Capra aegagrus) (Herre & Röhrs, 1990), having introgressed genes from the markhor (Capra falconeri). Inheritance of genetic materials from the goats from the Southern region of China to the BBG has been hypothesized, given the historical cultural and geographical connection between South China and the Bengal across the South-Eastern offshoot of the Tibetan plateau (Nozawa, 1991). Despite its economic importance, no large-scale genomic resource is available to date for this goat breed. Here we used the Illumina HiSeq sequencing platform to sequence the whole genome of the BBG, generated short reads and identified high quality genome wide distributed single nucleotide polymorphisms (SNPs). These data might provide useful insight for conducting genome wide association studies, the identification of quantitative trait loci (QTLs) and functional genomic analysis of the Black Bengal goat.
The experimental goat was reared at Bangladesh Livestock Research Institute (BLRI) goat farm under semi-intensive management system including slatted floor, well ventilated open sided house attached to pasture. All efforts were made to ameliorate harm to the animal. A small piece of ear tissue from an adult (30 months old) pedigreed goat (BioSample SAMN10391846) was collected by ear punching using a sterilized tissue puncher following local anesthesia (Lidocaine hydrochloride, 2%) on the right ear and immediately frozen into liquid nitrogen. Prior to ear punching, the goat was handled calmly with great care by a trained animal operator to prevent distress and injury to the animal and the handler. The tissue punching site was finally treated with antiseptic cream (Cetrimide, 0.5% and chlorhexidine digluconate 0.1%). All the animal procedure conformed the guidelines of the AWEC (Animal Welfare and Ethical Committee) of Bangladesh Agricultural University.
The tissue was finely ground by Micro Pestle (Sigma-Aldrich cat # SIAL501ZZ0), and high molecular weight DNA was extracted from the fresh frozen tissue using the Phenol:chloroform:isoamyl alcohol method (Sambrook & Russell, 2001). DNA purity was evaluated by NanoDrop 1000 Spectrophotometer (Life Technologies, CA, USA) and 0.8% agarose gel electrophoresis. DNA quantity was quantified using Qubit 2.0 Fluorometer and Qubit dsDNA HS Assay Kit (Life Technologies, CA, USA cat # Q32851). DNA was fragmented by acoustic disruption using Covaris S220 ultrasonicator and then underwent end repair, detailing, adapter ligation and purification (NEBNext UltraII DNA library Prep Kit cat # E7645S) following manufacturer instructions. The purified DNA was further selected for the right size before PCR amplification for library construction. The preliminary quantification and dilution of the library was performed using Qubit 2.0 Fluorometer, and, then Agilent 2100 Bioanalyzer was used to determine the insert size and nucleic acid concentration of the resulting library. The effective concentration of each sample in the library mixture was determined by qPCR (ABI 7500, Applied Biosystems, CA, USA) using the KAPA Library Quantification Kit (Cat. # KK4824) following the manufacturer’s standard protocol with the primer pair Primer 1: 5'-AAT GAT ACG GCG ACC ACC GA-3' Primer 2: 5'-CAA GCA GAA GAC GGC ATA CGA-3'. The PCR conditions were as follow: an initial denaturation at 95°C for 5 min followed by 35 cycles (denaturation at 95°C for 30 sec, annealing/extension/data acquisition at 60°C for 45 sec) and melt curve analysis at 65 – 95°C before sequencing to ensure the accuracy of the sample concentration.
Sequencing was performed on the Illumina system (HiseqX) according to manufacturer’s instructions. The samples were sequenced using a 2 × 150 paired-end (PE) configuration (GENEWIZ, Suzhou, China) using Illumina Truseq SBS Kit v4 (cat # FC-401-4003) in high output mode. Base calling was achieved with the sequencer built-in software Real-Time Analysis (RTA) (v1.5.15.1), which performs real-time conversion of the four fluorescent signals obtained from CCD (charge-coupled device) to binary base call (BCL) data. BCL data were then converted to fastq files using bcl2fastq (v2.17, Illumina). Data demultiplexing was then performed simultaneously based on index information.
Primary analysis was performed using the sequencer’s built-in software HCS (v3.4.0) to determine whether the read can pass the chastity filter based on the signal quality of the first 25 cycles. If the read had no more than 2 out of the 25 cycles with chastity values below 0.6, the read was called PF (Pass Filter). PF clusters converted by bcl2fastq were called PF data and stored in FASTQ format. The raw data were filtered to remove adapter sequences, PE reads having Q scores of < 20 and N composition of >10%. After primary cleaning of reads, mitochondrial genomes were removed. Then the remaining high quality, contamination-free reads were aligned to both the San Clemente (GCA_001704415.1) and the Yunnan black goat genome (GCA_000317765.2) separately using Bowtie2 (v2.3.4.3) (Langmead & Salzberg, 2012). Samtools (v1.9) (Li et al., 2009) was used to convert the resulting SAM sequence alignment files to BAM format, followed by sorting, indexing and quality filtering. BCFtools (v1.9) (Narasimhan et al., 2016) was used to call and filter the variants.
A total of 833,469,900 raw reads consisting of 125,020,485,000 bases were obtained by sequencing of one male BBG sample (BioSample SAMN10391846). After the QC, a total of 812,209,030 reads containing 118,911,538,136 bases were kept which was 97.45% of the total raw reads. After quality filtration and removal of the mitochondrial genome, the reads were aligned to the San Clemente and the Yunnan black goat genome which resulted in 98.65% (properly paired, 94.81%) and 98.50% (properly paired, 97.10%) of the reads aligning, respectively. Additionally, a total of 9,497,875 high quality SNPs (Q ≥ 20) along with 1,023,359 indels were identified in BBG versus the San Clemente genome (See underlying data (Mollah et al., 2019a)). Similarly 8,746,849 high quality (Q ≥ 20) SNPs along with 842,706 indels were identified BBG versus the Yunnan black goat genome genome (See underlying data (Mollah et al., 2019b)). The transition and transversion ratio was 2.27 and 2.29 in BBG against the San Clemente and the Yunnan black goat respectively.
Capra hircus isolate:Black Bengal Goat breed:Black Bengal Goat (goat). BioProject PRJNA504436:
https://www.ncbi.nlm.nih.gov/bioproject/PRJNA504436
This project contains the following underlying data:
SRR8549904 (Alignment of BBG whole genome sequences with Yunnan Black goat)
SRR8549413 (Alignment of BBG whole genome sequence with San Clemente goat)
SRR8182317 (WGS of Black Bengal Goat: Adult male ear tissue)
Figshare: Genome-wide distributed SNPs identified in the Black Bengal Goat versus the San Clemente genome. https://doi.org/10.6084/m9.figshare.7708010.v1 (Mollah et al., 2019a)
This project contains the following underlying data:
BBG_aln_SanClemente.bam.calls.q20.bcf (SNP data identified in the Black Bengal Goat versus the San Clemente genome)
BBG_aln_SanClemente.bam.calls.q20.bcf.stats (output file from analysis of .bcf file with bcftools)
Figshare: Genome-wide distributed SNPs identified in the Black Bengal goat versus the Yunnan Black goat genome. https://doi.org/10.6084/m9.figshare.7707929.v1 (Mollah et al., 2019b)
This project contains the following underlying data:
BBG_aln_Yunnan.bam.calls.q20.bcf (SNP data identified in the Black Bengal Goat versus the Yunnan Black goat genome)
BBG_aln_Yunnan.bam.calls.q20.bcf.stats (output file from analysis of .bcf file with bcftools)
Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).
This work was supported partly by Bangladesh Agricultural University Research System (BAURES) [2018/671].
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
We thank Bangladesh Livestock Research Institute (BLRI) for providing pedigreed Black Bengal goat for sample collection. We also acknowledge the BdREN (Bangladesh Research and Education Network, UGC) for sharing high performance computing infrastructure.
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the rationale for creating the dataset(s) clearly described?
Yes
Are the protocols appropriate and is the work technically sound?
Yes
Are sufficient details of methods and materials provided to allow replication by others?
Partly
Are the datasets clearly presented in a useable and accessible format?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Genomics, Bioinformatic analysis
Is the rationale for creating the dataset(s) clearly described?
Partly
Are the protocols appropriate and is the work technically sound?
Yes
Are sufficient details of methods and materials provided to allow replication by others?
Yes
Are the datasets clearly presented in a useable and accessible format?
Yes
References
1. Dong Y, Xie M, Jiang Y, Xiao N, et al.: Sequencing and automated whole-genome optical mapping of the genome of a domestic goat (Capra hircus).Nat Biotechnol. 2013; 31 (2): 135-41 PubMed Abstract | Publisher Full TextCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: Genome assembly
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | ||
---|---|---|
1 | 2 | |
Version 1 21 Mar 19 |
read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)