ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Data Article

Molecular data analysis of selected housekeeping and informational genes from nineteen Campylobacter jejuni genomes

[version 1; peer review: 2 not approved]
PUBLISHED 14 Mar 2013
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

Campylobacter jejuni (C. jejuni) is a rapidly evolving bacterial species with massive genetic recombination potential to generate niche specific genotypes. Generally the housekeeping gene lineage has been evidenced to undergo lateral gene transfer and recombination fairly frequently compared to the information processing gene lineage. During such exchanges, gene amelioration takes place over time acquiring the host genomes’ molecular characteristics. In this study fifty genes that comprised twenty five housekeeping lineage genes and twenty five information processing lineage genes from nineteen C. jejuni genomes were studied. These nineteen genomes included seven C. jejuni isolates that belonged to the same genotype or multilocus sequence type ST-474. The genes from both lineages were tested for recombination and the guanine-cytosine (GC) variation. This paper details about the data collected and the analyses performed in the corresponding research article entitled ”Campylobacter jejuni genomes exhibit notable GC variation within housekeeping genes”. Further, this paper provides details on the results that are not included in the research paper to provide completeness to the study conducted. The gene sequences from the seven C. jejuni ST-474 isolates were submitted to the GenBank and the corresponding gene IDs are provided for referencing purposes.

Background

Mulitlocus sequence typing (MLST) is a technique devised to sub-divide bacterial populations1. MLST exploits the internal fragments that are approximately 470 to 500 base pairs of the housekeeping genes and constructs allelic profiles based on variations in the nucleotide sequences of (usually) seven housekeeping genes. Bacterial isolates are grouped into different clusters2. C. jejuni is a zoonotic pathogen that causes gastroenteritis in humans worldwide and the natural competency and plasticity of C. jejuni have not been investigated outside the MLST housekeeping genes38. This technique has subsequently been exploited to structure and investigate the association of C. jejuni populations where, previous researches inferred that a mutation at any site can occur somewhere in the population within the space of a week79. Hybrid alleles were shown to be more common in some of the MLST loci (tkt, aspA, gltA and glyA) and in the MOMP gene of C. jejuni (interchangeably designated as porA) as a result of recombination and, has been interpreted as a consequential measure of convergence of C. jejuni and C. coli7,8,10.

Housekeeping alleles of C. jejuni strains (genotypes) share 86% nucleotide sequence identity with each other however, the housekeeping alleles cover lesser than < 0.2% of the entire genome11. The molecular differences at the genome level and/or at the full gene level may not be detected even when MLST is combined with other antigenic genes such as porA. Moreover, the housekeeping alleles or genes are involved in specific metabolic function which may not reflect the evolutionary events in other genes involved in a different function in a genome. Further, there is no evidence to suggest that MLST alleles are identical at their full length level and it is not known if a detailed knowledge of MLST alleles can capture the true evolutionary history of the other genes in a genome.

Objectives

Based on these background information and knowledge gap, the nucleotide sequence data from two different lineages of genes, the metabolic housekeeping and the informational genes were collected from 19 C. jejuni genomes. The main objectives were

(1) To select genes those are categorised as housekeeping genes.

(2) To ensure the presence of these genes in all of the C. jejuni genomes compared without segmentation.

(3) To categorise them according to the functions they are involved in.

(4) To create two different nucleotide datasets for comparison and,

(5) To test different genetic hypotheses.

Materials and methods

Selection of metabolic and informational housekeeping genes

Although genes have been classified as housekeeping and informational genes, the list of genes that can be categorised as housekeeping genes have not be defined very clearly. Hence in this study we collected the set of genes that have been used in routine multilocus sequence typing (MLST) schemes for different bacterial species from the PubMLST database and from previous phylogenetic studies that used housekeeping genes for analyses1214. The gene names were matched with the C. jejuni NCTC 11168 genome (GenBank accession number NC002163) that is fully annotated and has been used as reference in many other genome comparative studies1520. Following the identification of genes, a total of 50 genes were selected and these were further examined for their presence in other C. jejuni reference genomes to be analysed in this study. Full length nucleotide sequences of the selected set of housekeeping genes were retrieved in FASTA format from the GenBank from all of the selected C. jejuni genomes. Further the genes were classified into operational genes (metabolic housekeeping genes) and informational genes based on their function by referring to the KEGG pathway and the gene function websites. The positions of the categorised selected subset of genes (metabolic housekeeping and informational genes) are marked on the reference C. jejuni NCTC 11168 (GenBank accession number NC002163) circular genome and are shown in Figure 1A and Figure 1B, respectively. Here after in this paper the genes are referred to as housekeeping genes (metabolic housekeeping genes) and informational genes for the purpose of plain comparison and interpretation.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure1a.gif

Figure 1A. Represents the housekeeping genes (n = 25) on the circular NCTC 11168 C. jejuni genome and the MLST housekeeping genes used in typing of C. jejuni are in red fonts.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure1b.gif

Figure 1B. Represents the informational genes (n = 25) located on the circular NCTC 11168 C. jejuni genome.

Reference C. jejuni genomes and nucleotide sequences from MLST ST-474 isolates

Details of the reference C. jejuni genomes, housekeeping and informational genes, their names and their functions are provided in Table 1, Table 2 and Table 3, respectively. Twelve fully sequenced C. jejuni genomes were used to compare 50 selected genes. The gene sequences were downloaded from the GenBank database. Seven C. jejuni MLST ST-474 isolates were sequenced at the Institute of Veterinary, Animal and Biomedical Sciences, Massey University, Palmerston North, New Zealand. The gene predictions were kindly provided by Dr. Biggs Patrick from Massey Genome Service, Massey University, Palmerston North for the seven C. jejuni MLST ST-474 isolates. These genomes were studied in partial fulfilment of the PhD project by the first author of this paper. The nucleotide sequences for the selected genes (n = 50) were retrieved from the gene predictions from the seven C. jejuni ST-474 genomes in FASTA format and gene sequences for five of the four ST-474 isolates P179a, H704a, H73020, P569a and P694a were submitted to the GenBank and their sequence ids are given in Table 4 and Table 5 for metabolic and informational or repair genes, respectively. In addition this data is accessible below:

Table 1. Campylobacter jejuni subsp. jejuni reference genomes.

SpeciesStrainSize (Mb)%GCORFsDisease/sourceYear and place of isolationGenBank
Campylobacter jejuni subsp. jejuni NCTC 111681.6430.51643Clinical Food poisoning1977, UK AL111168
Campylobacter jejuni subsp. jejuni RM12211.7830.31838Chicken2005*, USA CP000025
Campylobacter jejuni subsp. jejuni 81–1761.630.61653Clinical Food poisoning1981, USA CP000538
Campylobacter jejuni subsp. jejuni 811161.6330.51626Clinical Food poisoning2007, UK CP000814
Campylobacter jejuni subsp. jejuni CG84211.630.41512Clinical Food poisoningns ABGQ00000000
Campylobacter jejuni subsp. jejuni HB93-131.730.61710Clinical GBS2006*, USA AANQ00000000
Campylobacter jejuni subsp. jejuni CG84861.6530.41425Clinical Food poisoningns AASY00000000
Campylobacter jejuni subsp. jejuni CF93-61.6730.51757Clinical MFS2006*, Japan AANJ00000000
Campylobacter jejuni subsp. jejuni 84–251.6730.41748Clinical Meningitis2006*, USA AANT00000000
Campylobacter jejuni subsp. jejuni 260.941.6530.51716Clinical GBSns AANK00000000
Campylobacter jejuni subsp. jejuni IA39021.6430.51718Sheep abortion2010, USA CP001876
Campylobacter jejuni subsp.doylei 269.971.8530.62094Human blood2007, UK CP000768
Campylobacter jejuni subsp. jejuni-P179a ST-474---PoultryNew Zealand-
Campylobacter jejuni subsp. jejuni-P110b ST-474---PoultryNew Zealand AEIO00000000.1
Campylobacter jejuni subsp. jejuni-P694a ST-474---PoultryNew Zealand-
Campylobacter jejuni subsp. jejuni-P569a ST-474---PoultryNew Zealand-
Campylobacter jejuni subsp. jejuni-H22082 ST-474---HumanNew Zealand AEIP01000008.1
Campylobacter jejuni subsp. jejuni-H704a ST-474---HumanNew Zealand-
Campylobacter jejuni subsp. jejuni-H73020 ST-474---HumanNew Zealand-

ST: Multilocus sequence sequence type; GBS: Guillain-Barré syndrome; ORFs: Open reading frames; Mb: Mega bases; GC: Guanine: Cytosine; ns: not stated; *:Date of start of project.

Table 2. Metabolic genes selected from the MLST schemes of different bacterial species for comparison.

GeneGene IDName of the geneFunction
argFCj0994cProbable ornithine carbamoyltransferaseAmino acid biosynthesis - Glutamate family
aroECj0405Probable shikimate 5-dehydrogenaseAmino acid biosynthesis - Aromatic amino acid family
atpDCj0107Probable ATP synthase F1 sector beta subunitEnergy metabolism
dapECj1048cProbable succinyl-diaminopimelate desuccinylaseAmino acid biosynthesis - Aspartate family
ftsZCj0696Probable cell division proteinCell division
fumCCj1364cProbable fumarate hydrataseProbable fumarate hydratase
gapACj1403cGlyceraldehyde 3-phosphate dehydrogenaseEnergy metabolism
gltBCj0007Probable glutamate synthase (NADPH) large subunitEnergy metabolism TCA
groELCj122160 kD chaperoninHeat Shock
hemNCj0580cProbable oxidoreductaseOxygen-independent coproporphyrinogen-III oxidases
ilvDCj0013Probable dihydroxy-acid dehydrataseAmino acid biosynthesis - Glutamate family
infBCj0136Probable translation initiation factor IF-2Protein translation
lysACj0314Probable diaminopimelate decarboxylaseAmino acid biosynthesis - Aspartate family
nuoDCj1576cProbable NADH dehydrogenase I chain DEnergy metabolism - Respiration - Aerobic
pycACj1037cPossible pyruvate carboxylase A subunitCentral intermediary metabolism - Gluconeogenesis
sdhACj0437Probable succinate dehydrogenase flavoprotein subunitEnergy metabolism - Tricarboxylic acid cycle
trpBCj0348Probable tryptophan synthase beta chainAmino acid biosynthesis
trpCCj0498Probable indole-3-glycerol phosphate synthaseAmino acid biosynthesis - Aromatic amino acid family
aspACj0087AspartaseCentral intermediary metabolism
glnACj0699cGlutamine synthetaseCentral intermediary metabolism
gltACj1682cCentral intermediary metabolismCitrate synthase
glyACj0402Serine hydroxy methyl transferaseCentral intermediary metabolism
glmMCj0360Phospho glucosamine mutaseCentral intermediary metabolism
tkt Cj1645TransketolaseCentral intermediary metabolism
uncA/atpACj0105ATP synthase a subunitCentral intermediary metabolism

Table 3. Genes of repair mechanism, their gene ID, name, function and the pathways they are involved in.

GenesGene IDName of the geneFunctionPathway
dnaE_Cj0718DNA polymerase III subunit alphaIt is a main replicative polymerase(Catalyses DNA-template-directed extension of the 3’- end of a DNA strand by one nucleotide at a time)Purine metabolism, pyrimidine metabolism, metabolic pathways, DNA replication, mismatch repair and homologous recombination
gidA_Cj1188ctRNA uridine 5-carboxymethylaminomethyl modification enzymeInvolved in the modification of the wobble third base in tRNAs (5- carboxymethylaminomethyl modification (mnm(5)s(2)U) of the wobble uridine base in some tRNAs; also known as a glucose-inhibited cell division protein A)tRNA modification and regulation
guaA_Cj1248Probable GMP synthase
(glutamine-hydrolysing)
Purine ribonucleotide biosynthesisPurine metabolism
gyrA_Cj1027cDNA gyrase subunit ANegatively supercoils closed circular double-stranded DNA or prevents from super-coiling that is deleterious to bacterial survivalDirect DNA repair mechanism
gyrB_Cj0003DNA gyrase subunit BNegatively super-coils closed circular double-stranded DNADirect DNA repair mechanism
ligACj0586NAD-dependent DNA ligaseCatalyses the formation of phosphodiester linkages and is essential for DNA replication and repair of damaged DNA (between 5’-phosphoryl and 3’-hydroxyl groups in double-stranded DNA using NAD as a coenzyme and as the energy source for the reaction)Direct DNA repair mechanism, DNA replication, base excision repair, nucleotide excision repair and mismatch repair
mfdrepairCj1085cTranscription-repair coupling factorPrevents or corrects mutagenic nucleotides by removing them from DNA strands during replicationNucleotide excision repair – DNA
mutSCj1052cRecombination and DNA strand exchange inhibitor proteinInvolved in blocking homologous recombination and inhibits DNA strand exchange (has ATPase activity stimulated by recombination intermediates)Mismatch excision repair – DNA repair
mutY_Cj1620cProbable A/G-specific adenine glycosylaseCorrects incorrectly paired bases during DNA replication and involved in recombinational repairBase excision repair – DNA repair
ogt Cj0836Methylated-DNA-protein-cysteine methyltransferaseDirect DNA repair by alkylation reversalDirect DNA repair
polACj0338cDNA polymerase IHas 3’-5’ exonuclease, 5’-3’ exonuclease and 5’-3’polymerase activities, primarily functions to fill gaps during DNA replication and repairReplication and DNA repair
pyrC_Cj0259Dihydroorotase Pyrimidineribonucleotide biosynthesis: catalyses the formation of N-carbamoyl-L-aspartate from (S)-dihydroorotate in pyrimidine biosynthesisPyrimidine nucleotide biosynthesis
pyrG_Cj0027cCTP synthetaseCatalyses the ATP-dependent amination of UTP to CTP with either Lglutamine or ammonia as the source of nitrogenPyrimidine nucleotide biosynthesis
recA_Cj1673cRecombinase AInvolved in recombinational repair of DNA damage. Catalyses the hydrolysis of ATP in the presence of single-stranded DNA, the ATP-dependent uptake of single stranded DNA by duplex DNA, and the ATP-dependent hybridisation of homologous single-stranded DNAsRecombinational repair
recJ_Cj0028Putative single-stranded-DNAspecific exonucleaseSynthesis and modification of macromolecules - DNA replication, restriction/modification, recombination and repairRecombinational repair
recNCj0642Putative DNA repair proteinDNA repair proteinRecombinational repair
recRCj1263Recombination proteinInvolved in a recombinational process of DNA repairRecombinational repair
rplB_Cj1704c50S ribosomal protein L2One of the primary rRNA-binding proteins; required for association of the 30S and 50S subunits to form the 70S ribosome, for tRNA binding and peptide bond formationRibosomal gene
rpoB_Cj0479DNA-directed RNA polymerase subunit beta’RNA synthesis, RNA modification and DNA transcriptionRNA polymerase
rpoD_Cj1001RNA polymerase sigma factorSigma factors are initiation factors that promote the attachment of RNA polymerase to specific initiation sites and are then released; this is the primary sigma factor of bacteriaRNA polymerase
ruvACj0799cHolliday junction DNA helicasePlays an essential role in ATPdependent branch migration of the Holliday junctionBranch migration repair mechanism
ssb Cj1071Single-stranded DNA-binding proteinBinds to single stranded DNA and may facilitate the binding and interaction of other proteins to DNAOther repair mechanisms
uvrA_Cj0342cExcinuclease ABC subunit A uvrA is an ATPase and a DNAbinding protein.A damage recognition complex composed of two uvrA and two uvrB subunits scans DNA for abnormalitiesNucleotide excision repair
uvrB Nucleotide excision repairCj0680cExcinuclease ABC subunit BThe UvrABC repair system catalyzes the recognition and processing of DNA lesionsThe beta hairpin of the Uvr-B subunit is inserted between the strands, where it probes for the presence of a lesion
xseACj0325Exo-deoxyribonuclease VII large subunitBi-directionally degrades single stranded DNA into large acid insoluble oligonucleotidesMismatch excision repair :Repair, ribosomal and nucleotide metabolic genes employed for MLST typing of other bacterial species

Table 4. GenBank sequence ids for metabolic genes for the five ST-474 isolates.

IsolatesSequence_idsGenes
H704a_1KC405082 argF
H704a_2KC405083 aroE
H704a_3KC405084 aspA
H704a_4KC405085 atpD
H704a_5KC405086 dapE
H704a_6KC405087 ftsZ
H704a_7KC405088 fumC
H704a_8KC405089 gapA
H704a_9KC405090 glmM
H704a_10KC405091 glnA
H704a_11KC405092 gltA
H704a_12KC405093 gltB
H704a_13KC405094 glyA
H704a_14KC405095 groEL
H704a_15KC405096 hemN
H704a_16KC405097 ilvD
H704a_17KC405098 infB
H704a_18KC405099 lysA
H704a_19KC405100 nuoD
H704a_20KC405101 pycA
H704a_21KC405102 sdhA
H704a_22KC405103 tkt
H704a_23KC405104 trpB
H704a_24KC405105 trpC
H704a_25KC405106 atpA
H73020_1KC405107 argF
H73020_2KC405108 aroE
H73020_3KC405109 aspA
H73020_4KC405110 atpD
H73020_5KC405111 dapE
H73020_6KC405112 ftsZ
H73020_7KC405113 fumC
H73020_8KC405114 gapA
H73020_9KC405115 glmM
H73020_10KC405116 glnA
H73020_11KC405117 gltA
H73020_12KC405118 gltB
H73020_13KC405119 glyA
H73020_14KC405120 groEL
H73020_15KC405121 hemN
H73020_16KC405122 ilvD
H73020_17KC405123 infB
H73020_18KC405124 lysA
H73020_19KC405125 nuoD
H73020_20KC405126 pycA
H73020_22KC405127 sdhA
H73020_21KC405128 tkt
H73020_23KC405129 trpB
H73020_24KC405130 trpC
H73020_25KC405131 atpA
P179a_1KC405132 argF
P179a_2KC405133 aroE
P179a_3KC405134 aspA
P179a_4KC405135 atpD
P179a_5KC405136 dapE
P179a_6KC405137 ftsZ
P179a_7KC405138 fumC
P179a_8KC405139 gapA
P179a_9KC405140 glmM
P179a_10KC405141 glnA
P179a_11KC405142 gltA
P179a_12KC405143 gltB
P179a_13KC405144 glyA
P179a_14KC405145 groEL
P179a_15KC405146 hemN
P179a_16KC405147 ilvD
P179a_17KC405148 infB
P179a_18KC405149 lysA
P179a_19KC405150 nuoD
P179a_20KC405151 pycA
P179a_21KC405152 sdhA
P179a_22KC405153 tkt
P179a_23KC405154 trpB
P179a_24KC405155 trpC
P179a_25KC405156 atpA
P569a_1KC405157 argF
P569a_2KC405158 aroE
P569a_3KC405159 aspA
P569a_4KC405160 atpD
P569a_5KC405161 dapE
P569a_6KC405162 ftsZ
P569a_7KC405163 fumC
P569a_8KC405164 gapA
P569a_9KC405165 glmM
P569a_10KC405166 glnA
P569a_11KC405167 gltA
P569a_12KC405168 gltB
P569a_13KC405169 glyA
P569a_14KC405170 groEL
P569a_15KC405171 hemN
P569a_16KC405172 ilvD
P569a_17KC405173 infB
P569a_18KC405174 lysA
P569a_19KC405175 nuoD
P569a_20KC405176 pycA
P569a_21KC405177 sdhA
P569a_22KC405178 tkt
P569a_23KC405179 trpB
P569a_24KC405180 trpC
P569a_25KC405181 atpA
P694a_1KC405182 argF
P694a_2KC405183 aroE
P694a_3KC405184 aspA
P694a_4KC405185 atpD
P694a_5KC405186 dapE
P694a_6KC405187 ftsZ
P694a_7KC405188 fumC
P694a_8KC405189 gapA
P694a_9KC405190 glmM
P694a_10KC405191 glnA
P694a_11KC405192 gltA
P694a_12KC405193 gltB
P694a_13KC405194 glyA
P694a_14KC405195 groEL
P694a_15KC405196 hemN
P694a_16KC405197 ilvD
P694a_17KC405198 infB
P694a_18KC405199 lysA
P694a_19KC405200 nuoD
P694a_20KC405201 pycA
P694a_21KC405202 sdhA
P694a_22KC405203 tkt
P694a_23KC405204 trpB
P694a_24KC405205 trpC
P694a_25KC405206 atpA

Table 5. GenBank sequence Ids of the genes involved in the repair mechanism for the five ST-474 isolates.

Isolate_numberSequence IdsGenes
H704axseAKC408804 xseA
H73020xseAKC408805 xseA
P179a_xseAKC408806 xseA
P569axseAKC408807 xseA
P694axseAKC408808 xseA
H704auvrBKC408809 uvrB
H73020uvrBKC408810 uvrB
P179auvrBKC408811 uvrB
P569auvrBKC408812 uvrB
P694auvrBKC408813 uvrB
H704auvrAKC408814 uvrA
H73020uvrAKC408815 uvrA
P179auvrAKC408816 uvrA
P569auvrAKC408817 uvrA
P694auvrAKC408818 uvrA
H704assbKC408819 ssb
H73020ssbKC408820 ssb
P179assbKC408821 ssb
P569assbKC408822 ssb
P694assbKC408823 ssb
H704aruvAKC408824 ruvA
H73020ruvAKC408825 ruvA
P179aruvAKC408826 ruvA
P569aruvAKC408827 ruvA
P694aruvAKC408828 ruvA
H704arpoDKC408829 rpoD
H73020rpoDKC408830 rpoD
P179arpoDKC408831 rpoD
P569arpoDKC408832 rpoD
P694arpoDKC408833 rpoD
H704arpoBKC408834 rpoB
H73020rpoBKC408835 rpoB
P179arpoBKC408836 rpoB
P569arpoBKC408837 rpoB
P694arpoBKC408838 rpoB
H704arplBKC408839 rplB
H73020_rplBKC408840 rplB
P179arplBKC408841 rplB
P569arplBKC408842 rplB
P694arplBKC408843 rplB
H704arecRKC408844 recR
H73020recRKC408845 recR
P179arecRKC408846 recR
P569arecRKC408847 recR
P694arecRKC408848 recR
H704arecNKC408849 recN
H73020recNKC408850 recN
P179arecNKC408851 recN
P569arecNKC408852 recN
P694arecNKC408853 recN
H704arecJKC408854 recJ
H73020recJKC408855 recJ
P179arecJKC408856 recJ
P569arecJKC408857 recJ
P694arecJKC408858 recJ
H704arecAKC408859 recA
H73020recAKC408860 recA
P179arecAKC408861 recA
P569arecAKC408862 recA
P694arecAKC408863 recA
H704apyrGKC408864 pyrG
H73020pyrGKC408865 pyrG
P179apyrGKC408866 pyrG
P569apyrGKC408867 pyrG
P694apyrGKC408868 pyrG
H704apyrCKC408869 pyrC
H73020pyrCKC408870 pyrC
P179apyrCKC408871 pyrC
P569apyrCKC408872 pyrC
P694apyrCKC408873 pyrC
H704apolAKC408874 polA
H73020polAKC408875 polA
P179apolAKC408876 polA
P569apolAKC408877 polA
P694apolAKC408878 polA
H704aogtKC408879 ogt
H73020ogtKC408880 ogt
P179aogtKC408881 ogt
P569aogtKC408882 ogt
P694aogtKC408883 ogt
H704amutYKC408884 mutY
H73020mutYKC408885 mutY
P179amutYKC408886 mutY
P569amutYKC408887 mutY
P694amutYKC408888 mutY
H704amutSKC408889 mutS
H73020mutSKC408890 mutS
P179amutSKC408891 mutS
P569amutSKC408892 mutS
P694amutSKC408893 mutS
H704amfdKC408894 mfd
H73020mfdKC408895 mfd
P179amfdKC408896 mfd
P569amfdKC408897 mfd
P694amfdKC408898 mfd
H704aligAKC408899 ligA
H73020ligAKC408900 ligA
P179aligAKC408901 ligA
P569aligAKC408902 ligA
P694aligAKC408903 ligA
H704agyrBKC408904 gyrB
H73020gyrBKC408905 gyrB
P179agyrBKC408906 gyrB
P569agyrBKC408907 gyrB
P694agyrBKC408908 gyrB
H704agyrAKC408909 gyrA
H73020gyrAKC408910 gyrA
P179agyrAKC408911 gyrA
P569agyrAKC408912 gyrA
P694agyrAKC408913 gyrA
H704aguaAKC408914 guaA
H73020_guaAKC408915 guaA
P179aguaAKC408916 guaA
P569aguaAKC408917 guaA
P694aguaAKC408918 guaA
H704agidAKC408919 gidA
H73020gidAKC408920 gidA
P179agidAKC408921 gidA
P569agidAKC408922 gidA
P694agidAKC408923 gidA
H704adnaEKC408924 dnaE
H73020dnaEKC408925 dnaE
P179adnaEKC408926 dnaE
P569adnaEKC408927 dnaE
P694adnaEKC408928 dnaE

Analysis of Guanine-Cytosine content and recombination

The length of the house keeping and the informational genes and sites of recombination can be found below:

The results of the guanine-cytosine (GC) analysis with the GC range for all of the genes from 19 C. jejuni genomes can be found below:

The overall GC and GC3 contents of individual genes were compared using DnaSP v521. The frequency distribution graphs of GC contents from all the 19 genomes compared were generated in R programming language22. Inferences on recombination within each gene under investigation were drawn using Dual-Brothers within Geneious v.5.3.423. This function uses a double change point model that detects spatial variation in the phylogenetic tree topology and spatial variation of the nucleotide substitution process23. The gene sequences were aligned using Geneious v5.3.4 and the recombination detection functionality was applied on each gene alignment to infer changes in the topologies and substitution processes. In addition, the aligned sequences were examined using DnaSP v5 to identify the sites involved in recombination. The number of recombination events, referred to as Rm was estimated using DnaSP v5. The GC variance for the genes were estimated using the variance function in Microsoft Office Excel and the relationship between the GC variance and the recombination events was analysed using linear models by having Rm as a dependent variable, and the log GC variance and the length as independent variables.

Figure 2Figure 9 describe the guanine-cytosine (GC) variation in the housekeeping and the informational genes. The plots describe the frequency distribution of GC content across 19 C. jejuni genomes and the number inside the parenthesis denotes the number of recombination sites for the respective genes.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure2.gif

Figure 2. Guanine-cytosine (GC) variation in the metabolic housekeeping genes – A.

The plots describe the distribution of GC across 19 C. jejuni genomes and the number inside the parenthesis denotes the number of recombination sites.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure3.gif

Figure 3. Guanine-cytosine (GC) variation in the metabolic housekeeping genes – A.

The plots describe the distribution of GC across 19 C. jejuni genomes and the number inside the parenthesis denotes the number of recombination sites.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure4.gif

Figure 4. Guanine-cytosine (GC) variation in the metabolic housekeeping genes – A.

The plots describe the distribution of GC across 19 C. jejuni genomes and the number inside the parenthesis denotes the number of recombination sites.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure5.gif

Figure 5. Guanine-cytosine (GC) variation in the MLST housekeeping genes – A.

The plots describe the distribution of GC across 19 C. jejuni genomes and the number inside the parenthesis denotes the number of recombination sites.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure6.gif

Figure 6. Guanine-cytosine (GC) distribution in the ribosomal and repair genes – A.

This plot describes the GC distribution across all 19 C. jejuni genomes. The number of recombination sites is provided in parentheses.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure7.gif

Figure 7. Guanine-cytosine (GC) distribution in the ribosomal and repair genes – A.

This plot describes the GC distribution across all 19 C. jejuni genomes. The number of recombination sites is provided in parentheses.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure8.gif

Figure 8. Guanine-cytosine (GC) distribution in the ribosomal and repair genes – A.

This plot describes the GC distribution across all 19 C. jejuni genomes. The number of recombination sites is provided in parentheses.

4a14f97d-f72f-4fd9-971f-7f7c02bd4a99_figure9.gif

Figure 9. Guanine-cytosine (GC) distribution in the ribosomal and repair genes – A.

This plot describes the GC distribution across all 19 C. jejuni genomes. The number of recombination sites is provided in parentheses.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 14 Mar 2013
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Mohan V and Stevenson M. Molecular data analysis of selected housekeeping and informational genes from nineteen Campylobacter jejuni genomes [version 1; peer review: 2 not approved]. F1000Research 2013, 2:87 (https://doi.org/10.12688/f1000research.2-87.v1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 14 Mar 2013
Views
48
Cite
Reviewer Report 07 Feb 2014
Daniel Falush, Department of Evolutionary Genetics, Max Planck Institute, Leipzig, Germany 
Not Approved
VIEWS 48
I do not understand either from the abstract or reading the earlier paper why this data was collected. Neither the strain collection nor the quantities calculated have any obvious special features. The information presented is of a similar character to ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Falush D. Reviewer Report For: Molecular data analysis of selected housekeeping and informational genes from nineteen Campylobacter jejuni genomes [version 1; peer review: 2 not approved]. F1000Research 2013, 2:87 (https://doi.org/10.5256/f1000research.1102.r3557)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
51
Cite
Reviewer Report 02 May 2013
David Ussery, Center for Biological Sequence Analysis, Department of Systems Biology, The Technical University of Denmark, Lyngby, Denmark 
Not Approved
VIEWS 51
This short article describes a comparison of 'housekeeping genes' with 'information processing genes', across 19 Campylobacter jejuni genomes. I cannot fully evaluate this paper as the data is not available. I understand that this is a new journal, and as ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Ussery D. Reviewer Report For: Molecular data analysis of selected housekeeping and informational genes from nineteen Campylobacter jejuni genomes [version 1; peer review: 2 not approved]. F1000Research 2013, 2:87 (https://doi.org/10.5256/f1000research.1102.r921)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 14 Mar 2013
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.