ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Data Note

Whole genome sequence and genome-wide distributed single nucleotide polymorphisms (SNPs) of the Black Bengal goat

[version 1; peer review: 2 approved with reservations]
PUBLISHED 21 Mar 2019
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Genomics and Genetics gateway.

Abstract

The Black Bengal goat (BBG) is a dwarf sized heritage goat (Capra hircus) breed from Bangladesh, and is well known for its high fertility, excellent meat and skin quality. Here we present the first whole genome sequence and genome-wide distributed single nucleotide polymorphisms (SNPs) of the BBG. A total of 833,469,900 raw reads consisting of 125,020,485,000 bases were obtained by sequencing one male BBG sample. The reads were aligned to the San Clemente and the Yunnan black goat genome which resulted in 98.65% (properly paired, 94.81%) and 98.50% (properly paired, 97.10%) of the reads aligning, respectively. Notably, the estimated sequencing coverages were 48.22X and 44.28X compared to published San Clemente and the Yunnan black goat genomes respectively. On the other hand, a total of 9,497,875 high quality SNPs (Q ≥ 20) along with 1,023,359 indels, and 8,746,849 high quality SNPs along with 842,706 indels were identified in BBG against the San Clemente and Yunnan black goat genomes respectively. The dataset is publicly available from NCBI BioSample (SAMN10391846), Sequence Read Archive (SRR8182317, SRR8549413 and SRR8549904), with BioProject ID PRJNA504436. These data might be useful genomic resources in conducting genome wide association studies, identification of quantitative trait loci (QTLs) and functional genomic analysis of the Black Bengal goat.

Keywords

Black Bengal goat, whole genome sequence, short reads, SNP

Introduction

The Black Bengal goat (BBG) is a small-sized breed of goat (Capra hircus) distributed throughout Bangladesh, West Bengal, Bihar, and Orissa regions of northeastern India. The predominant coat color of this breed is black but it is also found in brown, white and gray (Jalil et al., 2018). It is a heritage goat breed of Bangladesh, and well known for its high fertility, excellent meat and skin quality. This animal is a source of high quality meat, milk, and leather, and contributes substantially to the economy of Bangladesh (Amin et al., 2000; Faruque et al., 2017). The BBG is reported to have originated from wild goat, also known as the bezoar or Pasang (Capra aegagrus) (Herre & Röhrs, 1990), having introgressed genes from the markhor (Capra falconeri). Inheritance of genetic materials from the goats from the Southern region of China to the BBG has been hypothesized, given the historical cultural and geographical connection between South China and the Bengal across the South-Eastern offshoot of the Tibetan plateau (Nozawa, 1991). Despite its economic importance, no large-scale genomic resource is available to date for this goat breed. Here we used the Illumina HiSeq sequencing platform to sequence the whole genome of the BBG, generated short reads and identified high quality genome wide distributed single nucleotide polymorphisms (SNPs). These data might provide useful insight for conducting genome wide association studies, the identification of quantitative trait loci (QTLs) and functional genomic analysis of the Black Bengal goat.

Methods

Experimental animal

The experimental goat was reared at Bangladesh Livestock Research Institute (BLRI) goat farm under semi-intensive management system including slatted floor, well ventilated open sided house attached to pasture. All efforts were made to ameliorate harm to the animal. A small piece of ear tissue from an adult (30 months old) pedigreed goat (BioSample SAMN10391846) was collected by ear punching using a sterilized tissue puncher following local anesthesia (Lidocaine hydrochloride, 2%) on the right ear and immediately frozen into liquid nitrogen. Prior to ear punching, the goat was handled calmly with great care by a trained animal operator to prevent distress and injury to the animal and the handler. The tissue punching site was finally treated with antiseptic cream (Cetrimide, 0.5% and chlorhexidine digluconate 0.1%). All the animal procedure conformed the guidelines of the AWEC (Animal Welfare and Ethical Committee) of Bangladesh Agricultural University.

Sample processing

The tissue was finely ground by Micro Pestle (Sigma-Aldrich cat # SIAL501ZZ0), and high molecular weight DNA was extracted from the fresh frozen tissue using the Phenol:chloroform:isoamyl alcohol method (Sambrook & Russell, 2001). DNA purity was evaluated by NanoDrop 1000 Spectrophotometer (Life Technologies, CA, USA) and 0.8% agarose gel electrophoresis. DNA quantity was quantified using Qubit 2.0 Fluorometer and Qubit dsDNA HS Assay Kit (Life Technologies, CA, USA cat # Q32851). DNA was fragmented by acoustic disruption using Covaris S220 ultrasonicator and then underwent end repair, detailing, adapter ligation and purification (NEBNext UltraII DNA library Prep Kit cat # E7645S) following manufacturer instructions. The purified DNA was further selected for the right size before PCR amplification for library construction. The preliminary quantification and dilution of the library was performed using Qubit 2.0 Fluorometer, and, then Agilent 2100 Bioanalyzer was used to determine the insert size and nucleic acid concentration of the resulting library. The effective concentration of each sample in the library mixture was determined by qPCR (ABI 7500, Applied Biosystems, CA, USA) using the KAPA Library Quantification Kit (Cat. # KK4824) following the manufacturer’s standard protocol with the primer pair Primer 1: 5'-AAT GAT ACG GCG ACC ACC GA-3' Primer 2: 5'-CAA GCA GAA GAC GGC ATA CGA-3'. The PCR conditions were as follow: an initial denaturation at 95°C for 5 min followed by 35 cycles (denaturation at 95°C for 30 sec, annealing/extension/data acquisition at 60°C for 45 sec) and melt curve analysis at 65 – 95°C before sequencing to ensure the accuracy of the sample concentration.

Sequencing

Sequencing was performed on the Illumina system (HiseqX) according to manufacturer’s instructions. The samples were sequenced using a 2 × 150 paired-end (PE) configuration (GENEWIZ, Suzhou, China) using Illumina Truseq SBS Kit v4 (cat # FC-401-4003) in high output mode. Base calling was achieved with the sequencer built-in software Real-Time Analysis (RTA) (v1.5.15.1), which performs real-time conversion of the four fluorescent signals obtained from CCD (charge-coupled device) to binary base call (BCL) data. BCL data were then converted to fastq files using bcl2fastq (v2.17, Illumina). Data demultiplexing was then performed simultaneously based on index information.

Primary analysis was performed using the sequencer’s built-in software HCS (v3.4.0) to determine whether the read can pass the chastity filter based on the signal quality of the first 25 cycles. If the read had no more than 2 out of the 25 cycles with chastity values below 0.6, the read was called PF (Pass Filter). PF clusters converted by bcl2fastq were called PF data and stored in FASTQ format. The raw data were filtered to remove adapter sequences, PE reads having Q scores of < 20 and N composition of >10%. After primary cleaning of reads, mitochondrial genomes were removed. Then the remaining high quality, contamination-free reads were aligned to both the San Clemente (GCA_001704415.1) and the Yunnan black goat genome (GCA_000317765.2) separately using Bowtie2 (v2.3.4.3) (Langmead & Salzberg, 2012). Samtools (v1.9) (Li et al., 2009) was used to convert the resulting SAM sequence alignment files to BAM format, followed by sorting, indexing and quality filtering. BCFtools (v1.9) (Narasimhan et al., 2016) was used to call and filter the variants.

Validation

A total of 833,469,900 raw reads consisting of 125,020,485,000 bases were obtained by sequencing of one male BBG sample (BioSample SAMN10391846). After the QC, a total of 812,209,030 reads containing 118,911,538,136 bases were kept which was 97.45% of the total raw reads. After quality filtration and removal of the mitochondrial genome, the reads were aligned to the San Clemente and the Yunnan black goat genome which resulted in 98.65% (properly paired, 94.81%) and 98.50% (properly paired, 97.10%) of the reads aligning, respectively. Additionally, a total of 9,497,875 high quality SNPs (Q ≥ 20) along with 1,023,359 indels were identified in BBG versus the San Clemente genome (See underlying data (Mollah et al., 2019a)). Similarly 8,746,849 high quality (Q ≥ 20) SNPs along with 842,706 indels were identified BBG versus the Yunnan black goat genome genome (See underlying data (Mollah et al., 2019b)). The transition and transversion ratio was 2.27 and 2.29 in BBG against the San Clemente and the Yunnan black goat respectively.

Data availability

Underlying data

Capra hircus isolate:Black Bengal Goat breed:Black Bengal Goat (goat). BioProject PRJNA504436:

https://www.ncbi.nlm.nih.gov/bioproject/PRJNA504436

This project contains the following underlying data:

  • SRR8549904 (Alignment of BBG whole genome sequences with Yunnan Black goat)

  • SRR8549413 (Alignment of BBG whole genome sequence with San Clemente goat)

  • SRR8182317 (WGS of Black Bengal Goat: Adult male ear tissue)

Figshare: Genome-wide distributed SNPs identified in the Black Bengal Goat versus the San Clemente genome. https://doi.org/10.6084/m9.figshare.7708010.v1 (Mollah et al., 2019a)

This project contains the following underlying data:

  • BBG_aln_SanClemente.bam.calls.q20.bcf (SNP data identified in the Black Bengal Goat versus the San Clemente genome)

  • BBG_aln_SanClemente.bam.calls.q20.bcf.stats (output file from analysis of .bcf file with bcftools)

Figshare: Genome-wide distributed SNPs identified in the Black Bengal goat versus the Yunnan Black goat genome. https://doi.org/10.6084/m9.figshare.7707929.v1 (Mollah et al., 2019b)

This project contains the following underlying data:

  • BBG_aln_Yunnan.bam.calls.q20.bcf (SNP data identified in the Black Bengal Goat versus the Yunnan Black goat genome)

  • BBG_aln_Yunnan.bam.calls.q20.bcf.stats (output file from analysis of .bcf file with bcftools)

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 21 Mar 2019
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Mollah MBR, Bhuiyan MSA, Khandoker MAMY et al. Whole genome sequence and genome-wide distributed single nucleotide polymorphisms (SNPs) of the Black Bengal goat [version 1; peer review: 2 approved with reservations]. F1000Research 2019, 8:318 (https://doi.org/10.12688/f1000research.18316.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 21 Mar 2019
Views
10
Cite
Reviewer Report 24 Apr 2019
Almas A. Gheyas, The Roslin Institute, University of Edinburgh, Midlothian, UK 
Approved with Reservations
VIEWS 10
The article is mostly well written and provides sufficient background about the importance of the Black Bengal goat as a rationale for generating the genomic data. There are, however, several places in the Methods section, which would require further information ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Gheyas AA. Reviewer Report For: Whole genome sequence and genome-wide distributed single nucleotide polymorphisms (SNPs) of the Black Bengal goat [version 1; peer review: 2 approved with reservations]. F1000Research 2019, 8:318 (https://doi.org/10.5256/f1000research.20035.r46138)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
14
Cite
Reviewer Report 10 Apr 2019
Jason Miller, College of Natural Sciences and Mathematics, Shepherd University, Shepherdstown, WV, USA 
Approved with Reservations
VIEWS 14
With reference genome assemblies already available for the goat species Capra hircus, this manuscript announces a set of whole-genome shotgun reads from one individual of a breed that had yet to be sequenced. The data size is small, constituting less ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Miller J. Reviewer Report For: Whole genome sequence and genome-wide distributed single nucleotide polymorphisms (SNPs) of the Black Bengal goat [version 1; peer review: 2 approved with reservations]. F1000Research 2019, 8:318 (https://doi.org/10.5256/f1000research.20035.r46140)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 21 Mar 2019
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.