ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Research Article

Campylobacter jejuni genomes exhibit notable GC variation within housekeeping genes

[version 1; peer review: 2 not approved]
PUBLISHED 14 Mar 2013
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

Abstract

Campylobacter jejuni (C. jejuni) is a rapidly evolving bacterial species with massive genetic recombination potential to generate niche specific genotypes. Generally the housekeeping gene lineage has been evidenced to undergo lateral gene transfer and recombination quite frequently compared to the information processing gene lineage. During such exchanges, genetic amelioration takes place over time acquiring the host genomes’ molecular characteristics. In this study, fifty genes that comprised twenty five metabolic housekeeping lineage genes and twenty five information processing lineage genes from nineteen C. jejuni genomes were studied. These nineteen genomes included seven C. jejuni isolates that belonged to the same genotype or multilocus sequence type ST-474 that were sequences in New Zealand. The genes from both lineages were tested for recombination and the guanine-cytosine (GC) variation. There was a positive correlation between the GC variance and the number of recombination events amongst the metabolic housekeeping genes. Genes that showed wider GC variance had a relatively high number of recombination events. In contrast, although recombination was evident in all of the informational genes, there was no correlation between the GC variance and recombination. The observation of a positive correlation between the GC variance and the recombination events in the metabolic housekeeping genes may reflect the recent events of exchange of DNA and the regions that are constantly dynamic to undergo recombination under certain circumstances. While in the case of informational genes, the demand of stringent homology between genes may be a limiting factor for the absence of such correlation, however, the sites that involved in recombination may also represent the hotspots of recombination in those genes.

Introduction

Molecular events such as mutation, deletion, recombination and gene transfer play paramount roles in shaping the evolution of prokaryotes (reviewed by1,2). As a consequence the genomes are more prone for nucleotide base compositional fluctuations. Particularly, the evolutionary forces pose a major impact on the guanine-cytosine (GC) content of bacteria at the level of genes and genomes3. Amongst all the important evolutionary forces, the impact of recombination (homologous, non-homologous or illegitimate) on the evolution of bacteria has been evidenced as the major driving force or factor of microevolution48. However, the rate of recombination may differ greatly amongst different bacterial species; while some species recombine more frequently to have multiple recombination events than mutations that render them weakly clonal, whereas in other species it appears to be a rare incident leading to distinct clonal lineages5,6,8,9. Studies of genetic diversity in the bacterial kingdom have shown that bacteria form clusters of genetically related strains and that extensive recombination among related clusters have been regarded as normal rather than exceptional events10. However, not every single gene is involved in recombination or horizontal gene transfer11,12.

The lineages of the genes were broadly classified into informational and operational or metabolic housekeeping genes. The informational genes include genes of translation (T), transcription (S), and replication (R) and also the ATPases, GTPases (G) and tRNA synthetases whereas the operational genes are those involved in cell operations such as amino acid synthesis (A), biosynthesis of co-factors (B), cell envelope proteins (C), energy metabolism (E), intermediary metabolism (I), fatty acid and phospholipid biosynthesis (L), nucleotide biosynthesis (N), and regulatory genes (Z)12. The operational genes are the most modular genes in the cells that are inclined to be horizontally transferred or recombined most often11,13. As a result of this behaviour of the housekeeping genes it is therefore prudent to speculate that housekeeping genes may exhibit notable molecular differences compared to the informational genes.

Campylobacter jejuni (C. jejuni), is a zoonotic pathogen that colonises the gut of a wide variety of birds and mammals and has been attributed to the majority of bacterial gastroenteritis cases in developed countries14. Most predominantly the disease is caused by C. jejuni and often the disease is self limiting, however on rare occasions there can be serious sequelae such as Guillain-Barré syndrome and reactive arthritis15. The natural competency and the plasticity of C. jejuni were not investigated until Dingle et al. (2001)16 designed an MLST scheme for C. jejuni which has subsequently been exploited to structure and investigate the association of C. jejuni populations with different hosts and the environment from which human clinical infection originated1721. Further, Wilson et al.22 used the population genetics-phylogenetics approach to demonstrate the massive evolutionary potential of C. jejuni inferring that recombination plays a major role in the generation of diversity at twice the rate of mutation per se.

As C. jejuni is an actively recombining bacterial species, an attempt to investigate the GC variations in a subset of operational genes and informational genes was made in an effort to understand the difference between these two lineages of genes within C. jejuni genomes. The assumptions to conduct this analysis were: (1) amelioration or coalescence of GC content or the nucleotide base composition takes relatively long time23 which means that an event of recombination in a given group of genes with lack of time for coalescence will exhibit notable GC variation compared to other genes in the population; (2) housekeeping genes are relatively more prone for recombination where interspecies recombination has been demonstrated between donor and recipient DNA molecules that differ by up to 25–30% of their nucleotide sites2426. Given the fact that C. jejuni is a competent bacterial species where generation of host adapted variants has been documented using MLST datasets by several studies2731, it may be hypothesised that the housekeeping genes may exhibit notable differences in their GC content24; (3) Informational genes require highly stringent homology for an event of recombination to occur and recombination takes place as a part of DNA repair32 and hence these genes may not exhibit such notable GC variation even in the presence of recombination events. In order to test these hypotheses, nineteen C. jejuni genomes were analysed in this study.

Methods

Analyses of metabolic and informational housekeeping genes

Housekeeping genes were located on the C. jejuni NCTC 11168 genome (GenBank accession number NC002163) and a total of 50 genes were selected for the analyses3335. The genes were further classified into operational genes (metabolic housekeeping genes) and informational genes based on their function by referring to the KEGG pathway and the gene function websites. The positions of the categorised selected subset of genes (metabolic housekeeping and informational genes) are marked on the reference C. jejuni NCTC 11168 (GenBank accession number NC 002163) circular genome and are shown in Figures 1A to Figures 1D, respectively. Here after in this paper the genes are referred as housekeeping genes (metabolic housekeeping genes) and informational genes for the purpose of plain comparison and interpretation. Twelve fully sequenced C. jejuni genomes (Table 1) were used to compare 50 selected genes. The gene sequences were downloaded from the GenBank database. Seven C. jejuni MLST ST-474 isolates were sequenced at the Institute of Veterinary, Animal and Biomedical Sciences, Massey University, Palmerston North, New Zealand. The overall GC and GC3 contents of individual genes were compared using DnaSP v536. The frequency distribution graphs of GC contents from all the 19 genomes compared were generated in R programming language37. Inferences on recombination within each gene under investigation were drawn using Dual-Brothers within Geneious v.5.3.438 and the number of recombination events, referred to as Rm was estimated using DnaSP v5. The relationship between the GC variance and the recombination events was analysed using linear models by having Rm as a dependent variable, and the log GC variance and the length as independent variables. Detailed descriptions of methods used in the article are available from a related data article39.

Table 1. Campylobacter jejuni subsp. jejuni reference genomes.

Species Strain Size (Mb) %GC ORFs Disease/source Year and place of isolation GenBank
Campylobacter jejuni subsp. jejuni NCTC 111681.6430.51643Clinical Food poisoning1977, UK AL111168
Campylobacter jejuni subsp. jejuni RM12211.7830.31838Chicken2005*, USA CP000025
Campylobacter jejuni subsp. jejuni 81-1761.630.61653Clinical Food poisoning1981, USA CP000538
Campylobacter jejuni subsp. jejuni 811161.6330.51626Clinical Food poisoning2007, UK CP000814
Campylobacter jejuni subsp. jejuni CG84211.630.41512Clinical Food poisoningns ABGQ00000000
Campylobacter jejuni subsp. jejuni HB93-131.730.61710Clinical GBS2006*, USA AANQ00000000
Campylobacter jejuni subsp. jejuni CG84861.6530.41425Clinical Food poisoningns AASY00000000
Campylobacter jejuni subsp. jejuni CF93-61.6730.51757Clinical MFS2006*, Japan AANJ00000000
Campylobacter jejuni subsp. jejuni 84-251.6730.41748Clinical Meningitis2006*, USA AANT00000000
Campylobacter jejuni subsp. jejuni 260.941.6530.51716Clinical GBSns AANK00000000
Campylobacter jejuni subsp. jejuni IA39021.6430.51718Sheep abortion2010, USA CP001876
Campylobacter jejuni subsp. doylei 269.971.8530.62094Human blood2007, UK CP000768
Campylobacter jejuni subsp. Jejuni-P179a ST-474PoultryNew Zealand
Campylobacter jejuni subsp. Jejuni-P110b ST-474PoultryNew Zealand AEIO00000000.1
Campylobacter jejuni subsp. Jejuni-P694a ST-474PoultryNew Zealand
Campylobacter jejuni subsp. Jejuni-P569a ST-474PoultryNew Zealand
Campylobacter jejuni subsp. Jejuni-H22082 ST-474HumanNew Zealand AEIP01000008.1
Campylobacter jejuni subsp. Jejuni-H704a ST-474HumanNew Zealand
Campylobacter jejuni subsp. Jejuni-H73020 ST-474HumanNew Zealand

ST: Multilocus sequence sequence type; GBS: Guillain-Barré syndrome; ORFs: Open reading frames; Mb: Mega bases; GC: Guanine: Cytosine; ns: not stated; *:Date of start of project.

Results and Discussion

GC variance in housekeeping genes

Guanine-Cytosine (GC) content ranged between 31.9% and 36.4% across the metabolic housekeeping genes in general, where both the high and low ranges of the GC contents was evident in the C. jejuni subsp. doylei (CJJD269.97269.97) genome. The GC content of the housekeeping alleles varied among genomes with the MLST scheme alleles the tkt and gltA alleles showing a relatively wider variation followed by glyA, glmM, aspA, glnA and uncA alleles. GC contents were identical amongst all genes across all the ST-474 genomes except for two genes, namely fumC and trpC however, the GC3 content of the pycA gene varied between the ST-474 genomes. Analysis of recombination events showed that the infB gene possessed the highest number of 27 sites (Rm [the number of recombination events] = 27) (Table 2a) while the trpB and uncA possessed the least number of sites (n = 1) to be involved in recombination. The linear model showed that the number of recombination events was positively correlated with the GC variance, where the genes that showed a wider GC variance showed a high number of recombination events. However, the infB and gltB genes showed relatively high numbers of recombination events (n = 27 and n = 26, respectively) and they did not reveal convincing GC variation between genomes as it was observed in other genes. While the GC variance and the number of recombination events was found to be positively correlated (p value = 0.009), the length was not found to influence the number of recombination events significantly (p value = 0.7). Figures 1A and Figures 1B illustrate the relationship between the GC variance and the recombination events, and the relationship between the length and the recombination events, respectively. The frequency distribution graphs of the GC contents for all the 25 genes are provided in Mohan et al.37 in figures 2–5.

Table 2a. GC content range and recombination sites identified in the housekeeping genes.

Genes Sites Rm GC range
argF(114,163) (168, 195) (195, 207) (207, 231) (231, 282) (282, 291) (291, 303) (303, 308)
(308, 339) (420, 465)
100.331-0.351
aroE(381, 392) (392, 435) (514, 645) (645, 654) (654,702) (702, 717)60.288-0.303
aspA(198, 336) (336, 414) (414, 615) (627, 897) (1203,1251)50.330-0.334
atpD(348, 408) (408, 651) (924,1047)30.351-0.354
dapE(171, 216) (216, 230) (231, 321) (321, 351) (351, 357) (357, 387) (387, 453) (474, 522)
(531, 543) (543, 560) (609, 690)(701, 867) (867, 942) (942, 945) (990,1035)
150.350-0.362
ftsZ(90,138) (348, 561) (645, 681) (840, 903) (903, 936)50.351-0.358
fumC(45,108) (168, 303) (354, 357) (576, 591) (624, 687) (687, 711) (772, 909) (930, 987)
(987,1038) (1038,1227) (1281,1311) (1311,1335) (1335,1380)
130.358-0.366
gapA(288, 381) (381, 501) (501, 567) (570, 594) (594, 732) (819, 921) (948, 961)70.351-0.358
glnA(180, 230) (231, 258) (258, 279) (358, 448) (448, 513) (711, 768) (768, 945)
(945, 1188) (1188,1260)
90.319-0.325
gltA(312, 325) (325, 351) (351, 372) (372, 453) (453, 654) (654, 756) (780, 921) (921, 1089)
(1089, 1140) (1203, 1305)
100.331-0.335
gltB(159, 675) (741, 855) (1236,1401) (1401,1581) (1581,1605) (1605,1902) (1974, 2067)
(2151, 2178) (2226, 2325) (2349, 2412) (2412, 2418) (2418, 2700) (2700,2838) (2853, 2982)
(3204, 3210) (3210, 3318) (3387, 3462) (3462, 3636) (3651, 3861) (3885, 4023) (4023, 4092)
(4092, 4098) (4098, 4119) (4119, 4149) (4149, 4305) (4324, 4356)
260.352-0.364
glyA(165, 219) (474, 490) (492, 562) (612, 618) (618, 657) (684, 744) (966,1156)70.347-0.358
groEL(1008,1023) (1023,1029) (1029,1035) (1200,1236) (1285,1290)50.328-0.336
hemN(82,174) (174, 207) (210, 372) (544, 584) (585,783) (828, 903) (903, 927) (930, 975) (975, 996)
(996,1050) (1050,1065) (1104,1140) (1140,1163)
130.364-0.379
ilvD(249, 300) (300, 399) (492, 576) (654, 777) (777, 816) (864, 891) (1098,1137) (1170,1263)
(1263,1296) (1323,1347) (1389,1416) (1416,1530) (1530,1665)
130.30-0.306
InfB(1128,1140) (1140,1188) (1230,1263) (1263,1284) (1284,1365) (1365,1392) (1392,1416)
(1476,1551) (1596,1692) (1743,1824) (1824,1845) (1845,1914) (1914,1938) (1938,1962)
(1962, 2124) (2124, 2127) (2127, 2235) (2235, 2259) (2268, 2286) (2286, 2301) (2319, 2376)
(2376, 2394) (2394, 2436) (2436, 2478) (2478, 2487) (2487, 2490) (2538, 2562)
270.367-0.381
lysA(198, 222) (222, 672) (706, 790) (790, 825) (825, 944) (945,1035) (1035,1122) (1122,1171)80.349-0.385
nuoD(78, 93) (105, 249) (249, 267) (393, 453) (453, 483) (543, 566) (660, 678) (780, 852) (852, 864)
(875, 894) (894, 954) (1008,1050)
120.32-0.326
glmM(171, 291) (291, 366) (366, 450) (492, 573) (573, 636) (689, 798) (897, 972) (972,1020) (1119,1286)90.355-0.363
pycA(78,194) (194, 546) (672, 768)30.33-0.34
sdhA(684, 686) (686, 693) (693, 700) (700, 701) (775, 787) (787, 848) (1051,1066) (1066,1141)
(1219,1291) (1291,1329)
100.37-0.373
tkt (54,102) (159, 237) (237, 297) (297, 342) (342, 366) (366, 660) (966,1029) (1092,1107)
(1125,1138) (1138,1254) (1254,1287) (1320,1486) (1525,1749) (1749,1761) (1761,1791)
(1791,1827) (1827,1876)
170.355-0.361
trpB(1053, 1119)10.345-351
trpC(138,195) (210, 294) (567, 750)30.315-326
uncA(678, 1065)10.352-0.355

Rm: Number of recombination sites.

c3d1b8ef-5853-4fea-bdf6-42c9d568fc91_figure1a.gif

Figure 1A. epresents the relationship between the GC variance and the number of recombination events in housekeeping genes.

c3d1b8ef-5853-4fea-bdf6-42c9d568fc91_figure1b.gif

Figure 1B. Represents the relationship between the length and the number of recombination events in housekeeping genes.

GC variance in informational genes

The overall guanine-cytosine (GC) content varied greatly between informational genes in general, where genes such as mfd, ogt, polA, recJ, recN and xseA showed lower ranges of GC contents (from 0.265 to 0.298) whereas the remainder showed a relatively higher GC content with rplB showing the highest GC content of 0.392. The analysis of informational genes for recombination events showed that ogt gene and ssb possessed a single recombination site, while mutS was found to possess the highest number of recombination events (n = 38) (Table 2b describes the recombination events that occurred in the informational genes). The linear model showed that Rm was dependent on the length of the genes (p value = 0.05) rather than on the GC variance (p value = 0.9). Figures 1C and Figures 1D illustrate the relationship between the GC variance and the number of recombination events, and the relationship between the length and recombination events, respectively. (Frequency distribution graphs of GC contents for the informational genes are provided in Mohan et al.37 in figures 6–9).

Table 2b. GC content range and recombination sites identified in the informational genes.

Genes Sites Rm GC range
dnaE(210, 376) (376, 396) (396, 501) (714, 762) (828, 917) (963, 981) (981, 1023) (1131, 1347)
(1347, 1449) (1890, 1923) (1992, 2028) (2037, 2048) (2048, 2049) (2049, 2217) (2226, 2409)
(2409, 2679) (2688, 2778) (2778, 3016) (3177, 3240) (3240, 3285) (3285, 3303) (3303, 3423)
(3426, 3535)
230.319-0.324
gidA(345,365) (365,399) (419,441) (444,546) (624,669) (669,687) (687,768) (768,771) (771,858)
(912,930) (939,942) (942,955) (955,959) (959,1020) (1020,1127) (1127,1143) (1158,1233)
(1233,1245) (1245,1281) (1281,1347) (1347,1389) (1446,1577) (1578,1585) (1602,1611)
(1635,1746)
250.329-0.341
guaA(132,174) (210, 223) (226, 232) (232, 331) (331, 351) (351, 378) (402, 471) (489, 496) (496, 516)
(570, 582) (663, 687) (687, 723) (723, 765) (765, 801) (825, 858) (879, 960) (960, 1116)
(1116, 1140) (1140, 1179) (1179, 1230) (1230, 1254) (1332, 1392) (1398, 1410) (1410, 1428)
(1482, 1521)
250.347-0.361
gyrA(243, 257) (257, 357) (891, 1017) (1017, 1476) (1770, 1994) (1994, 2142) (2283, 2421)
(2472, 2478) (2478, 2547)
90.333-0.335
gyrB(165, 324) (324, 360) (360, 375) (375, 813) (852, 924) (952,1053) (1053,1107) (1113,1143)
(1167,1209) (1209,1360) (1449,1551) (1570,1662) (1662,1728) (2169, 2277)
140.329-0.332
ligA(125,188) (188, 558) (558, 579) (681, 708) (708, 720) (734, 787) (819, 832) (832,1032)
(1050,1104) (1104,1245) (1332,1371) (1371,1431) (1449,1500) (1500,1539) (1561,1587)
(1587,1649) (1665,1717) (1743,1752) (1752,1815) (1815,1878)
200.335-0.346
mfd (25, 87) (87,174) (195, 291) (465, 531) (693, 729) (729, 867) (867, 876) (888, 919) (936,1071)
(1071,1185) (1350,1371) (1372,1410) (1464,1506) (1506,1512) (1536,1554) (1554,1767)
(1767, 2070) (2080, 2713)
180.287-0.29
mutS(120, 132) (198, 210) (210, 225) (225, 316) (339, 372) (393, 421) (421, 423) (435, 474)
(474, 489) (489, 591) (599, 636) (636, 729) (774, 823) (843, 849) (852, 867) (867, 876)
(1026, 1028) (1028,1107) (1134,1243) (1243,1266) (1269,1305) (1305,1326) (1339,1365)
(1497,1548) (1548,1596) (1596,1626) (1626,1680) (1710,1869) (1869,1914) (1975,1978)
(1978,1983) (1984,1996) (1996, 2014) (2071,2077) (2077, 2089) (2095, 2116) (2152, 2185)
(2185, 2203)
380.324-0.33
mutY(75,141) (141,261) (294,327) (327,336) (336,351) (363,393) (438,501) (501,553) (553,564)
(613,670) (670,715) (766,847)
120.308-0.318
ogt -10.269-0.286
polA(21,142) (240,274) (274,327) (369,540) (600,649) (649,672) (1028,1083) (1083,1121)
(1253,1278) (1278,1305) (1305,1347) (1347,1368) (1380,1464) (1578,1611) (1683,1720)
(1721,1743) (1747,1860) (1893,1911) (2256,2262) (2472,2487) (2487,2523)
210.289-0.294
pyrC(5,246) (252,312) (312,396) (396,402) (402,723) (723,747) (747,975) (975,987)80.304-0.312
pyrG(915,945) (945,1018) (1146,1392) (1419,1445) (1445,1479)50.343-0.35
recA(226,351) (447,570) (570,612) (612,624) (807,825) (861,912)60.359-0.382
recJ(250, 300) (318, 355) (355, 375) (678, 688) (688, 972) (972,1059) (1059,1062) (1062,1245)
(1245,1530)
90.294-0.303
recN(87,111) (111,192) (243,396) (399,421) (421,539) (540,738) (759,834) (834,870) (870,879)
(900,1068) (1257,1272) (1272,1311)
120.272-0.278
recR(78,123) (282,432) (432,447)30.3-0.309
rplB(123,228) (240,258) (384,444) (468,588) (675,762)50.384-0.394
rpoB(606,1263) (1791,1812) (3363,3366) (3366,3417)40.333-0.337
rpoD(129,156) (156,174) (174,435) (495,735) (735,822) (822,858) (1272,1326) (1326,1758)100.304-0.309
ruvA(96,123) (147,264) (264,387)30.33-0.341
ssb (234, 448)10.33-0.346
uvrA(27,105) (105,132) (132,177) (222,243) (255,291) (291,294) (351,372) (372,384) (498,540)
(579,621) (651,681) (681,804) (921,1031) (1101,1170) (1170,1255) (1326,1425) (1425,1671)
(2058,2094) (2094,2139) (2202,2276) (2276,2397) (2433,2442) (2442,2532) (2610,2628)
(2643,2700) (2700,2715)
260.342-0.348
uvrB(36,72) (120,165) (165,168) (327,378) (420,594) (696,714) (714,751) (757,777) (777,784)
(816,856) (876,924) (924,928) (936,948) (948,988) (988,1005) (1042,1068) (1068,1074)
(1083,1086) (1086,1122) (1122,1182) (1182,1221) (1221,1248) (1248,1266) (1272,1284)
(1284,1356) (1362,1383) (1437,1530) (1530,1533) (1533,1534) (1536,1593) (1701,1776)
(1779,1785) (1785,1791) (1791,1866) (1881,1890) (1914,1938)
360.333-0.341
xseA(90,186) (402,669) (832,930) (930,961)40.281-0.288

Rm: Number of recombination sites.

c3d1b8ef-5853-4fea-bdf6-42c9d568fc91_figure1c.gif

Figure 1C. Represents the relationship between the GC variance and the number of recombination events in informational genes.

c3d1b8ef-5853-4fea-bdf6-42c9d568fc91_figure1d.gif

Figure 1D. Represents the relationship between the length and the number of recombination events in informational genes.

Differences in the nucleotide base composition of a gene and/or genome is a fundamental element shaping the genomic evolution which in turn, directly influences the GC contents of genes and/or genomes40. GC variation has been thought to be driven both by neutral mutational effects and adaptive selection pressures41,42. In this study, GC contents of the housekeeping and informational genes were measured across 19 C. jejuni genomes. Variation in the GC contents amongst the housekeeping genes investigated in this study within the 19 genomes was clearly evident. However, it should be noted that the GC contents of the housekeeping genes within the seven ST- 474 genomes were identical except for two genes, the fumC and trpC, with a variation in the GC3 content of the pycA gene (but showed an identical overall GC content across all the ST-474s). There was an association between the GC variance and the number of recombination sites that occurred within different housekeeping genes, where the majority of the genes (investigated in this study) that possessed a wider GC variance showed a higher number of recombination sites.

The variation in base composition is a consequence of differences in the patterns of evolutionary events43 where variation in the GC content is dependent on the mutational patterns and/or the evolutionary events that had occurred in a given nucleotide sequence43,44. GC content has also been shown to be correlated with various biological factors45 and is a potential research area where there has been a significant level of research carried out3,41,4652 as well as is ongoing. Previous reports that investigated the causes for the differences in the GC content have evidenced that the GC content at the third codon position (GC3) and the conversion of GC to AT and AT to GC at this position to occur in favour of a selection for the GC content of any given genome47,53. Further, this selection pressure possesses a great impact on evolution47,53. Furthermore, the major changes in the GC content has been shown to predict the future direction of the evolutionary changes of the genomic GC content54. The tRNA abundances in a genome has been shown to be yet another important selective pressure that determines the synonymous codon usage (change in the GC3 position) which in turn reflects the differential evolution in organisms50,55. Hence, in the light of previous research reports it will be prudent to hypothesise that since the housekeeping genes are involved in metabolic processes in a genome this may be a reflection of the different metabolic evolutionary pressures that acted upon these genes as a measure of adaptation to the prevailing environmental conditions. Also, the base composition may have changed as a result of recombination between the same species and /or between bacteria with a similar base composition and tRNA pools. It is very intriguing to determine the driving forces for the change in the base composition which may help to better understand the biological reasons behind the frequent nucleotide changes within housekeeping genes in C. jejuni.

Further, there may be few additional explanations for the variation in the GC contents that might have arisen as a result of recombination such as (1) they may not have ameliorated or coalesced after an event of recombination; or (2) may not have had ample time for amelioration after recombination; or (3) those sites may be the hotspots or active spots on the genes that are dynamic and continuously engaging in recombination.

Reports using MLST datasets have revealed mosaic alleles within the seven housekeeping alleles and have also suggested and/or raised the possibilities of both host adaptation and convergence of C. jejuni and Campylobacter coli (C. coli) species27,30,31. There may be an additional influence which may be relatively biological that triggers such recombination between C. jejuni isolates present in a host and/or convergence of C. jejuni with C. coli that may in turn enable better survival of the bacteria in certain hosts. Hybrid alleles of tkt and gltA have been frequently documented in previous reports30,31 where in our study, these two genes showed wider GC variance within genomes. In contrast, atpD and trpB showed the least number of recombination sites where, the GC content did not vary as evidenced in the tkt and the gltA alleles across the investigated genomes. However, there was one exception, the infB gene that showed a high number of recombination sites showed relatively small GC variance.

In the case of informational genes, the overall GC content varied greatly amongst genes investigated within the genomes. Although there was variation in the GC contents within genes, it was not positively correlated with the number of recombination sites, whereas it was correlated with the length of genes which is not surprising. It may be speculated that since an event of recombination in informational genes demand a relatively high degree of homologies56, recombination in this subset of genes may have occurred between sequences with high homologies. Moreover, according to the complexity theory11,12 the repair and ribosomal genes belong to the lineages of higher complex, which may not allow them to compromise or tolerate GC variation during recombination. Further, apart from the differences between individual genomes, the overall GC variation amongst the informational genes investigated in this study, (lower GC content in mfd, ogt, polA, recJ, recN and xseA genes and higher GC contents in the remainder) reflect the differences in the functional conservation and complexity. For example, rplB showed the highest GC content of 39.2% which is relatively complex and conserved as it is a 50S ribosomal protein – L2, involved in several discrete steps of polypeptide synthesis such as peptidyl transferase activity, binding of aminoacyl-tRNA to A and P-sites57.

Future research

Even though C. jejuni has been shown over the past decade through a significant amount of research to be a promiscuous and a competent bacterial species, the biological triggers behind recombination which leads to host adaptation and emergence of new variants are very unclear. Further, most of the studies are based on the seven housekeeping alleles that cover the internal fragments (approximately 400–500 base pairs) of those genes used in the MLST scheme for typing the isolates16,58. The nucleotide differences outside the typing region always get neglected which is important when a functional gene is to be evaluated for evolution. Housekeeping genes involved in various metabolic functions and cellular processes play a critical role in the overall integrity and survival of micro-organisms in general. The correlation between GC variance and recombination indicates the vulnerability of housekeeping genes to evolutionary forces and further it also shows how dynamic the regions on these genes are to continuously respond to such stimuli. Although base composition varies with mutation and recombination events (which is an expected biological plausibility) it is intriguing to speculate (1) how frequently the nucleotide base composition change in C. jejuni; (2) how much nucleotide changes can a C. jejuni gene and/or genome tolerate; (3) what are the biological triggers that generate new C. jejuni variants; (4) in the event of recombination how long does the exchanged gene portion take to coalesce; and (5) what are the functional genes mostly affected by evolutionary forces?

Even though our study has made an effort to differentiate between two lineages of genes in C. jejuni and to substantiate and speculate the possible triggers for GC variation and its relationship with recombination, the data we used is very small to make concrete statements. However, larger genomic datasets will be able to provide distinctive resolution on the hypotheses formulated in our study and will also provide answers to the questions that we are raising in this paper.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 14 Mar 2013
Comment
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Mohan V and Stevenson M. Campylobacter jejuni genomes exhibit notable GC variation within housekeeping genes [version 1; peer review: 2 not approved]. F1000Research 2013, 2:89 (https://doi.org/10.12688/f1000research.2-89.v1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 14 Mar 2013
Views
49
Cite
Reviewer Report 07 May 2013
Trudy M. Wassenaar, Molecular Microbiology and Genomics Consultants, Zotzenheim, Germany 
Not Approved
VIEWS 49
This paper compared 50 genes, obtained from 12 Campylobacter genomes, to test two hypotheses. I rephrase them for clarity here as: (1) Does GC content provide evidence of recombination events, and can this shed light on the history of recombinational ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Wassenaar TM. Reviewer Report For: Campylobacter jejuni genomes exhibit notable GC variation within housekeeping genes [version 1; peer review: 2 not approved]. F1000Research 2013, 2:89 (https://doi.org/10.5256/f1000research.1103.r932)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
Views
50
Cite
Reviewer Report 02 May 2013
David Ussery, Center for Biological Sequence Analysis, Department of Systems Biology, The Technical University of Denmark, Lyngby, Denmark 
Not Approved
VIEWS 50
This short article examines variation in GC content and recombination in two different sets of genes, across nineteen Campylobacter jejuni genomes. After reading the abstract several times, I'm still not sure exactly what hypothesis is being tested here. I am ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Ussery D. Reviewer Report For: Campylobacter jejuni genomes exhibit notable GC variation within housekeeping genes [version 1; peer review: 2 not approved]. F1000Research 2013, 2:89 (https://doi.org/10.5256/f1000research.1103.r922)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.

Comments on this article Comments (0)

Version 1
VERSION 1 PUBLISHED 14 Mar 2013
Comment
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.