Needle lost in the haystack: multiple reaction monitoring fails to detect Treponema pallidum candidate protein biomarkers in plasma and urine samples from individuals with syphilis

Background: Current syphilis diagnostic strategies are lacking a sensitive manner of directly detecting Treponema pallidum antigens. A diagnostic test that could directly detect T. pallidum antigens in individuals with syphilis would be of considerable clinical utility, especially for the diagnosis of reinfections and for post-treatment serological follow-up. Methods: In this study, 11 candidate T. pallidum biomarker proteins were chosen according to their physiochemical characteristics, T. pallidum specificity and predicted abundance. Thirty isotopically labelled proteotypic surrogate peptides (hPTPs) were synthesized and incorporated into a scheduled multiple reaction monitoring assay. Protein extracts from undepleted/unenriched plasma (N = 18) and urine (N = 4) samples from 18 individuals with syphilis in various clinical stages were tryptically digested, spiked with the hPTP mixture and analysed with a triple quadruple mass spectrometer. Results: No endogenous PTPs corresponding to the eleven candidate biomarkers were detected in any samples analysed. To estimate the Limit of Detection (LOD) of a comparably sensitive mass spectrometer (LTQ-Orbitrap), two dilution series of rabbit cultured purified T. pallidum were prepared in PBS. Polyclonal anti- T. pallidum antibodies coupled to magnetic Dynabeads were used to enrich one sample series; no LOD improvement was found compared to the unenriched series. The estimated LOD of MS instruments is 300 T. pallidum/ml in PBS. Conclusions: Biomarker protein detection likely failed due to the low (femtomoles/liter) predicted concentration of T. pallidum proteins. Alternative sample preparation strategies may improve the detectability of T. pallidum proteins in biofluids.


List of abbreviations hPTPs
Isotopically labelled proteotypic surrogate peptides LOD Limit of detection MSM Men who have sex with men

Introduction
Treponema pallidum ssp. pallidum (T. pallidum), a culturable 1 microaerophilic spirochete, is responsible for more than 8 million new cases of syphilis per year 2 . There has been a resurgence of syphilis in a number of world regions over the last two decades [2][3][4] . In Europe 3 and North America 4 , this increase has been most marked in men who have sex with men (MSM). A striking feature of these outbreaks has been the increasing proportion of cases that are occurring in patients with a previous diagnosis of syphilis 5,6 . Patients with reinfections are more likely to present with asymptomatic or less symptomatic disease 5 , hence the diagnosis of reinfection is wholly dependent on subtle changes in serological tests 7 . Two types of serological tests are used to diagnose syphilis: treponemal tests detect antibodies to T. pallidum and non-treponemal tests, such as the Rapid plasma reagin (RPR) test, detect agglutination secondary to the presence of anti-lipoidal antibodies reactive to material released from damaged host cells and possibly cardiolipin released from T. pallidum 8 . Treponemal tests remain positive for life and are therefore of no use in the diagnosis of reinfection. Non-treponemal tests are used for syphilis post-treatment follow-up and diagnosis of reinfection. A wide range of factors can result in increases in test titers, causing syphilis to be over-diagnosed and unnecessarily treated 7,9-11 . Direct T. pallidum detection techniques, including various nucleic acid amplification tests, have been developed, but apart from testing of primary ulcer specimens the sensitivity of these techniques is low 12 . Even in the setting of secondary syphilis, when there is a high T. pallidum load in the blood 13 , the sensitivity of polymerase chain reaction (PCR) tests reaches only 52 % on serum specimens 12,14 .
The T. pallidum genome, through evolutionary reduction, is one of the smallest of the human bacterial pathogens, with a predicted 1044 open reading frames 15 . Approximately half of the predicted proteins have been detected through MS techniques 16,17 , including the semi-quantification of T. pallidum proteins using spectral counting 17 . A T. pallidum transcriptome study demonstrated that almost all genes were expressed during peak rabbit experimental infection 18 . This maximum utilization of the genome, well characterized proteome, and swift invasion of the organism into the bloodstream (within 24 hours after infection 19 ) make this pathogen an ideal candidate for antigen diagnostic assay development. A variety of antigen tests against other pathogens have been designed for clinical samples such as blood, cerebrospinal fluid, faeces and urine; and these have proven their utility in the diagnosis and assessment of therapeutic response in a number of infections, including Helicobacter pylori 20 , Cryptococcus neoformans 21 , Cryptosporidium ssp. 22 , Entamoeba histolytica 23 , Ebola virus 24 and Mycobacteria tuberculosis 25 . If a highly sensitive and specific test could be developed that is able to confirm the presence or absence of T. pallidum in the body then this would be of considerable utility in the diagnosis of syphilis reinfections and in assessing therapeutic response. It could also be useful for the diagnosis of neuro-and congenital syphilis -two diagnoses where contemporary tests are suboptimal 26 .
During the last decade, advanced MS-based proteomics platforms have emerged as mainstay bioanalytical tools for a broad range of clinical applications, including targeted protein identification 27 and bacteria identification and typing 28 . Particularly the AQUA workflow 29,30 , with its use of stable isotopically labelled standard proteotypic peptides (henceforth referred to as 'heavy' PTPs or hPTPs) and selected/multiple reaction monitoring-mass spectrometry (SRM/MRM MS), has emerged as a powerful technique for the fast determination of multiple protein concentrations in highly complex sample matrixes such as urine (reviewed by Mermelekas et al. 31 ) and plasma (reviewed by Pernemalm and Lehtiö 32 ). Precise quantitation of proteins is possible by using hPTPs as internal standards that correspond to endogenous peptides created during the enzymatic digestion of the sample of interest. When combined, the endogenous and synthetic peptides elute together chromatographically and ionize with the same efficiency. Since the quantity of the labelled peptide is known, the absolute quantity of the targeted native protein can be determined by comparing MRM hPTP/endogenous peak areas. The precision and utility of this highly sensitive multiplexed method has been demonstrated on undepleted/ unenriched plasma for the detection of a panel of human cardiovascular disease 33 and cancer 34 biomarkers with a detection capability of four orders of magnitude (10 3 -10 4 range in protein concentration) and up to femtomolar level sensitivity in plasma 35  In this study, we investigated if T. pallidum proteins could be detected in plasma and urine samples from individuals with syphilis using a targeted proteomics (MRM) approach. Successful development of a T. pallidum antigen test will most likely be contingent upon the simultaneous detection of multiple protein biomarkers to comprehensively cover different stages of disease. Eleven T. pallidum protein biomarkers were chosen based on a predicted specificity, high predicted abundance, and physiochemical properties. Thirty surrogate hPTPs were synthesized corresponding to eleven candidate T. pallidum biomarkers. Analysis of eighteen plasma and four urine samples revealed no detectable MRM signal for the endogenous peptides from the biomarkers of interest. This is likely due to the extremely low (femtomoles per liter) predicted concentration of bacterial proteins in the samples of interest, or the fact that the biomarkers are not expressed during infection. T. pallidum spiking experiments established a MS detection limit of 300 bacteria/ml in PBS; polyclonal anti-T. pallidum magnetic bead enrichment did not improve the protein detectability.

Study participants
Between January 2014 and August 2015, 120 patients attending the Institute of Tropical Medicine Antwerp clinic, over the age of 17 years, and in whom a new diagnosis of syphilis was made and had not received antibiotics in the preceding thirty days, were recruited into the cohort study. Thirty HIV-positive controls, in whom the diagnosis of syphilis was excluded via serological and PCR testing, were also recruited. The diagnosis and staging of syphilis was according to the Centers for Disease Control and Prevention classification 43 , and treatment was administered according to European guidelines 44 . All patient sera were tested for syphilis using a RPR test (BD Macro-Vue RPR card test, Becton, Dickinson and Co., Sparks, MD, United States of America (USA)) and an antibody detection Treponema pallidum Particle Agglutination test (SERODIA-TPPA Fujirebio Inc., Tokyo, Japan). A PCR test targeting T. pallidum polA was also performed on serum 45 and whole blood samples were tested for multiple gene targets 46 , as previously described. Selection criteria of participants from the cohort study for the MRM assay analysis included a range of syphilis clinical stages and prioritized predicted high bacterial loads, as demonstrated by positive PCR tests and/or high RPR titres. Patients with early stage syphilis (primary, secondary, early latent) that were plasma and/or whole blood PCR positive for T. pallidum were expected to have the highest bacterial load 12,13 .
Plasma and urine sample processing Plasma was collected immediately before Benzathine Penicillin G intramuscular injection using 7.5 ml EDTA-coated blood collection tubes (Sarstedt Monovette, Nümbrecht, Germany). We refer to these samples as the pre-penicillin samples. A selection of randomly selected patients participated in an additional blood draw three hours after penicillin treatment since studies have shown penicillin to be fast acting on T. pallidum, leading to consequent cell lysis and antigen release 47 . These samples are termed the post-penicillin samples. Plasma was chosen for the MRM assay according to HUPO guidelines 48 . Protease inhibitors were not added to the plasma samples since previous studies did not demonstrate a significant higher protein yield with treated samples 49 and peptides could inadvertently be modified 50 . Plasma were subjected to dual centrifugation in an Eppendorf 22331 centrifuge (Hamburg, Germany) in an effort to minimize cellular contamination: whole blood was centrifuged at 2000 g for 10 minutes at ambient temperature, followed by transfer of the plasma fraction to a 50 ml falcon tube and centrifugation at 2400 g for 15 minutes. All plasma were processed and aliquoted into cryovials for storage at -80 °C in a long-term freezer unit (Eppendorf U725-G Innova New Brunswick, Hamburg, Germany) until further testing. Mid-stream random-void urine samples were collected and processed following HUPO guidelines 51 , including centrifugation for 10 minutes at 2000 g at ambient temperature in order to remove insoluble contents such as cells and casts. Urine was aliquoted into 15 ml falcon tubes and stored at -80 °C until further testing. All plasma and urine samples were processed within three hours of collection and were only subjected to one freeze thaw cycle. was applied: 5 % MP-B during 1 min and from 5 to 35 % MP-B in 5 min, followed by a steep increase to 100 % MP-B in 1 min, all at a flow rate of 300 μL/min. Based on the specific retention times of each peptide, three scheduled MRM runs of 10 minutes were generated, each of them containing 20 MS1 channels (10 endogenous (T. pallidum) PTPs without isotopic label and 10 channels with a synthetic hPTP equivalent). At least three transitions (ion pairs) were selected for each peptide of interest. For each scheduled MRM analysis, 50 μg of peptides (injection loop of 5 μL) per plasma/urine sample were loaded onto the analytical column. In addition to an extensive needle wash after each injection, a blank run was performed between two subsequent clinical samples to prevent carry-over effects. Data acquisition was controlled by MassLynx version 4.1, while targeted datasets were analysed by TargetLynx, which is part of MassLynx (Waters Corporation, Milford, MA, USA). All Xevo TQ MS raw spectral files are available at PeptideAtlas 54 with the identifier PASS00978.
Magnetic bead antibody-based enrichment of T. pallidum proteins and approximation of the MS LOD for T. pallidum protein detection T. pallidum protein enrichment was performed using magnetic beads (Dynabeads® M-270, Life Technologies, CA, USA) coated with biotin-conjugated polyclonal T. pallidum-specific antibodies (PA1-73103, Thermo Fisher Scientific, CA, USA) through streptavidin-biotin conjugation. According to the manufacturer's protocol, 10 μg of antibody was used to bind 1 mg of beads (approximately 5 × 10 7 beads).
In vivo rabbit cultured purified T. pallidum DAL-1 strain extracts 55,56 were kindly provided by the group of David Šmajs from the Masaryk University, Czech Republic. The original concentration of the T. pallidum extract was approximately 10 6 bacteria/ml as quantified under darkfield microscopy using a Olympus BX41 (Olympus Corporation, Tokyo, Japan) equipped with darkfield microscope condenser DCW 1.4-1.2; magnification 10×40. Samples were stored in 1 ml phosphate buffered saline (PBS) and only subjected to one freeze-thaw cycle. Two dilution series of T. pallidum were prepared, each time starting in 1 ml of PBS and finally equating to eight approximate bacterial concentrations: 10 4 , 10 3 , 300, 100, 33, 10, 3 and 0 bacteria/ml.
For one dilution series, each of the eight fractions were incubated with a constant amount (~10 5 ) of magnetic beads coated with polyclonal anti-T. pallidum antibodies. After incubation for two hours at 4° C and magnetic separation, the supernatant was discarded and beads were washed three times with PBS. To lyse the antibody bound bacteria, 1 ml of PBS was added to each bead sample, these were sonicated on ice using a Sonics Vibra Cell VC130 (Sonics and Materials Inc., Newtown, CT, USA) (two times 30 seconds with an amplitude of 50 %). The bead fraction was retained (retentant) after sonication by using magnetic separation. Released proteins were precipitated adding ice-cold acetone and incubated overnight at -20 °C. Tryptic digestion was performed, following the aforementioned procedure, on both the precipitated proteins (supernatant) and directly "on-bead" (retentate), to test for possible unreleased proteins during sonication. For the second dilution series (unenriched), 1 ml was directly drawn from each of the eight samples. The samples from this series were also sonicated on ice (two times 30 seconds with an amplitude of 50 %) to lyse the bacteria. Released proteins were then acetone precipitated and subsequently digested, in conformance with the other parallel series procedure. All LTQ-Orbitrap MS/MS raw spectral data is available at PeptideAtlas 54 with the identifier PASS00978.

Study subject inclusion
Eighteen syphilis-infected study participants were selected for the MRM assay analyses ( Table 1). All participants were male and identified as MSM. A third of the participants (6/18; 33 %)  63 and is partially homologous to Tpr E/J. According to pBLAST analysis, all chosen biomarker proteins and corresponding PTPs did not demonstrate high homology with other pathogens, non-pathogenic commensal bacterial or human proteins (data not shown). One to three corresponding well-suited PTPs were selected for each biomarker, for a total of 30 PTPs. Details pertaining to these are provided in Table 2.

Multiple reaction monitoring assay optimization
The This was likely a false-positive non-specific peptide secondary to rabbit protein contamination since this short peptide sequence is closely homologous to the Oryctolagus cuniculus (rabbit) 60 kDa heat shock protein, or could have originated from the beads or antibodies. As a result, it has been excluded from the analysis. Three T. pallidum proteins detected in both the enriched and unenriched sample series were also biomarker candidates tested in the MRM assay experiments: Flagellar filament core protein

Figure 2. Work-flow diagram describing the estimation of T. pallidum protein MS LOD experiments.
In total, eight different concentrations of T. pallidum (from 10 4 to 0 bacteria/ml PBS) were treated in three different ways i) T. pallidum was enriched using magnetic beads coated with polyclonal anti-T. pallidum antibodies and lysed by sonication for release of T. pallidum proteins in the supernatant. Acetone precipitated proteins were trypsinized; ii) In order to detect any remaining protein on the beads, the beads were also trypsinized (retentant on-bead trypsinization); iii) As a control, non-enriched samples were sonicated and immediately trypsinized. *-proteins selected as candidate biomarkers in this study. All samples were analysed by an LTQ-Orbitrap mass spectrometer.   Supplementary File 4).

Discussion
The T. pallidum MRM assay designed in this study failed to detect any of the 30 targeted proteotypic peptides related to eleven candidate T. pallidum protein biomarkers in eighteen plasma and four urine samples from individuals with syphilis. A number of explanations are possible. The foremost is the extremely low predicted concentration of bacterial proteins compared to host proteins. To a large extent our estimates of T. pallidum bacterial load in blood are based on molecular studies. In one of the largest studies, Tipple et al. found that median copy numbers of Lipoprotein antigen Tp47 (TP_0574) DNA detectable per milliliter of whole blood was 127, 516 and 70 in primary, secondary and latent syphilis, respectively 13 . Other studies have produced comparable results 47,64,65 , with the exception of a recent study that found a median of 1.4 × 10 5 T. pallidum/ml in whole blood from patients with secondary syphilis 66 .
The concentration of T. pallidum in blood according to these PCR-based studies is lower compared to our estimated LOD in a shotgun experiment on diluted samples (300 T. pallidum/ml) since we would need a 500x higher concentration (same amount of proteins from 300 T. pallidum in 1 ml vs. 2 μl) to detect the 300 T. pallidum/ml (see Supplementary File 4). Despite this outcome, we were hoping to detect T. pallidum proteins in the plasma or urine of some syphilis patients because i) MRM measurements are generally more sensitive than shotgun experiments since scanning times are drastically reduced and ii) the amounts from Tipple et al. 13 were averages so we hypothesized that some patients (especially those with secondary syphilis) might have high T. pallidum levels detectable by MRM. These results could then motivate us to develop an (immuno)assay, capable of detecting the proteins even at low concentrations.
Little difference in T. pallidum abundance has been found between whole blood, plasma or serum 12 . Not much is known about the persistence of T. pallidum in the human urinary tract and to our knowledge no studies have quantified T. pallidum in the urine of syphilis-infected patients. However, even if T. pallidum does not consistently persist in the urinary tract, bacterial proteins present in the blood could be filtered through the glomerulus, ending up in the urine either intact or as peptide fragments, depending on the size of the protein and state of proteolysis 67 .
These considerations suggest that detection of T. pallidum proteins in human biofluids may not be possible without additional steps such as front-end immunoaffinity depletion 68 , twodimensional LC separation 69 and/or selective enrichment of target proteins/peptides (as reviewed by Shi et al. 70 ). These techniques, or combinations thereof, have allowed the detection of low abundance proteins up to the low-to sub-nanogram/ml level 70,71 in clinical samples. For example, to reduce the wide dynamic range of plasma proteins, multicomponent single-step immunoaffinity depletion of high-abundant (host) proteins can allow up to a 10-20-fold enrichment of low-abundant proteins due to the depletion of 90-95 % of the total protein mass 68 . However, of particular concern with this approach is the possibility of concomitant removal of low-abundance proteins due to protein binding to the antibodies or high-abundant proteins, as shown in a study that systematically analysed the antibody bound (high-abundant) protein fraction which found that this fraction contained 101 proteins at a high degree of confidence 72 . T. pallidum has a high binding affinity for constituents of serum and host cells, including laminin 73 , fibronectin 74,75 and albumin 76 , which may lead to unintentional depletion of targeted proteins if human protein specific immunodepletion would be applied. Furthermore, targeted mass spectrometric immunoassays (MSIA) that use surfaceimmobilized antibodies to affinity retrieve proteins from biological samples have proven their utility for clinical applications 77-79 . In our study, magnet bead coupled polyclonal anti-T. pallidum antibodies failed to significantly detect more T. pallidum proteins compared to the unenriched dilution series. Antibody effectivity is dictated by binding affinity; we used commercial antibodies that were to our knowledge not previously characterized as to their binding affinity or targeted proteins. Furthermore, it is unlikely that the polyclonal antibodies would bind a large range of proteins since few (<5 %) T. pallidum proteins are immunogenic 16,80 . The fact that T. pallidum can remain in 'plain sight' without invoking immune defences 81 , together with the very low amount of outer membrane proteins compared to other human pathogens 82 , also suggests that antibody enrichment of whole organisms and/or proteins would probably not be an effective strategy. Peptidelevel immunoenrichment, also known as the 'Stable Isotope Standards and Capture by Anti-Peptide Antibodies' (SISCAPA) method developed by Anderson et al. 83 has shown considerable promise as a high-throughput, automated, highly multiplexed approach for protein biomarker quantification, with MRM application detection limits in the low picogram/ml range of protein concentration in plasma 84 . If a selection of T. pallidum peptides could be definitively demonstrated to be present in plasma or urine, then this could be an attractive analytical approach with a strong potential for yielding the detection capabilities and precision needed for clinical applications.
However, apart from the low abundance in plasma or urine, other factors could explain why the T. pallidum proteins were not detected in our MRM experiments: 1. The LOD T. pallidum spiking experiments were performed in PBS buffer as opposed to a highly complex plasma or urine matrix background.
2. Variations in gene expression and structural components of proteins could also account for the lack of T. pallidum protein detection. Fluctuations in gene expression may explain why we did not find TprG, a protein implicated in phase variation which has been shown to be expressed at varying levels during infection due to changes in the number of guanine nucleotide repeats immediately upstream of its transcriptional start site 85 . Heterogeneous T. pallidum protein sequence sites 15,17,86 could also confound rigid MRM assay detection parameters. Such heterogeneity has been shown 17 to be present in one candidate biomarker, TP_0922, although this variable site was not present in the PTPs incorporated in this MRM assay. Poor proteolytic cleavage can stem from structural features of the protein, different digestion kinetics and post-translational modifications. For example, phosphorylated residues within two amino acids of the point of cleavage can hinder proteolysis 87 . Little is known about the extent of T. pallidum protein post-translational modification aside from a study that demonstrated glycosylation of the Flagellar core proteins (FlaBs) as reported by antibody and glycan staining techniques 88 , however, the exact modification sites and extent of modification remain unknown. Other proteomics studies of L. interrogans have demonstrated likely roles for protein acetylation and methylation in virulence mechanisms 89,90 .
3. We only tested eleven out of more than a thousand predicted proteins in the T. pallidum proteome 57 , a selection largely based on spectral counting 17 as an estimation of protein abundance. We cannot assume, however, that this indirect manner of quantifying T. pallidum protein levels in a rabbit testicle model directly recapitulates T. pallidum protein expression levels in plasma samples of syphilis-infected patients. One of the reasons for this is that protein expression may vary according to host and disease stage. Antigen detection during latent stage disease will be especially challenging since T. pallidum has been shown to sequester itself in protected niches such as eyes, hair follicles and nerves 91 . Other T. pallidum proteins may be more suitable diagnostic biomarkers, given that they are reflective of the disease stages studied and that they are consistently present in the biofluids of interest. For example, Lipoprotein Tp47, which could still be identified in the most diluted T. pallidum sample (300 T. pallidum/ml) in this study, could be an interesting biomarker for future studies.
4. Various technical limitations such as a possible suboptimal chromatographic gradient length, modifiable proteotypic residues and protein degradation secondary to sample processing could have impeded biomarker detection. Other studies have reported chromatographic gradient lengths of 30 minutes or longer 33,34,36,39 , thus implementation of longer gradients could be considered in future studies in order to improve peptide resolution. In this study, chromatographic separations were performed in triple using shorter 10-minute gradients in order to optimize the sample throughput without the loss of MS sensitivity due to overlapping transition windows. Therefore, co-eluting peptides were split over different chromatographic runs since plasma protein availability was not a limiting factor. Oxidizable proteotypic residues, namely cysteine, methionine and tryptophan, can cause artifactual modifications during processing or storage resulting multiple forms of targeted peptides. With this said, the PTP selection process also requires a necessary balance between many different parameters, whereby selection of peptides containing suboptimal amino acid residues can sometimes remain the most favourable option. Ribosomal protein TP_0250b was only represented by one PTP, which may have limited detectability, thus future assays could ideally incorporate more than one peptide per protein.
5. Sample processing may have also contributed to protein degradation; therefore prompt analysis of fresh non-frozen biological specimens, if possible, is recommended. Moreover, alternative sample processing procedures, such as the use of molecular weight cut off filters to concentrate urine could improve protein detectability 40 .
6. Lastly, only a limited amount of clinical samples were analysed, especially urine and the study was a single centre study with only MSM participants, therefore it is not generalizable.
An improvement for future studies would be the incorporation of isotopically labelled (non-T. pallidum) reference standards, which have been shown to improve analytical precision, detect variations in instrument performance and aid in detecting chemical interferences 92 .

Conclusions
In an effort to identify promising T. pallidum diagnostic biomarkers, we designed a scheduled MRM assay incorporating 141 MRM ions pairs correlated to 30 PTPs/ 11 T. pallidum proteins. Factors such as the extremely low (femtomoles per liter) predicted T. pallidum protein concentration in biofluids, possible variable protein expression according to host/disease stage and potential presence of protein post-translational modifications likely contributed to the lack of signal detection for all candidate biomarkers investigated. Since the proteins targeted in this study were likely buried in the proverbial haystack of plasma proteins, alternative sample preparation and analysis strategies are warranted. With the rapidly progressing innovations of MS applications and technology, we believe clinical proteomics is far from its pinnacle of potential.

Data availability
The datasets supporting the conclusions of this article are available in the PeptideAtlas 54 repository, with the identifier PASS00978, in addition to being provided within the article and its Supplementary Files.

Consent and ethics approval
The This paper by Van Raemdonck describes the use of mass spectroscopy to identify T. pallidum proteins from plasma and urine from infected patients. If successful, such a method would be very useful in syphilis diagnostics, particularly with regard to reinfection. Thus, the work addresses a significant problem. However, although they could detect isotopically labeled peptides spiked into the samples, they could not detect T. pallidum proteins from the infecting organisms. Limit of detection experiments suggest this is due to the very low concentrations of T. pallidum proteins in plasma and serum samples. Thus, the main goal produced a negative result. However, there is considerable useful information in this study. The limit of detection experiments with T. pallidum bacteria that have been diluted and were unenriched or enriched with antibody beads provided interesting results on which proteins could be detected and how many bacteria per ml were needed for detection. In addition, the MRM experiments appear to be carefully designed and provide important limit of detection information for future studies. The discussion provides a useful assessment of limiting factors in the direct detection of T. pallidum antigen proteins.

Comments:
1. Some of the description of LOD experiments with dilutions of T. pallidum on page 7, right column, found in both the unenriched and enriched retentate dilution series in one or more of the concentrations analyzed". This refers to the proteins that were commonly found in both experiments, regardless of concentration. The details of which are provided in Supplementary  Table 3.
The other sentence was also reworded for clarify "Ten unique proteins were found in T. pallidum the highest concentration (10 bacteria/ml) four in the enriched retentate sample (N = 10) and non-enriched sample (N = 10), two proteins detected were unique to either the enriched or unenriched samples (Figure 3)." 2. Page 7, right column, paragraph 2. Tp47 is discussed twice in the paragraph with different gene names each time, ie, Tp47 (TP_0547) and Tp47 (TP_0574).
Thanks for pointing this out, this has been rectified to the actual ORF (TP_0574). The synthetic peptides were added after SPE clean-up. Why weren't they added before SPE to determine losses? Please supply more information on how LOD was calculated; it was based on the dilution series? The readers can use the explanation of calculations, if provided. The spiking experiments should have been done in real matrix?
Please comment more on PTP selection? Are they unique? What about labile residues? Was anything done to look at the methionine oxidation? Deamindations? etc. The synthetic peptides were 95% pure. Were they quantified? (AAA?) The paper makes an interesting list of its shortcomings in the discussion, which is helpful. A lot of the critique is already self-proclaimed.The overall conclusion of manuscript is: "A lot of effort and fine-tuning of sample prep/method development will be needed for biomarker discovery and validation." Biomarker validation is time-consuming and challenging, perhaps some orthogonal experiments should have been done (such as western blotting) to be able to know if the global data acquired using spectral-count was good enough before moving on to the MRM experiments. Nonetheless, authors have done a great job in discussing the shortcomings and in writing the paper.

If applicable, is the statistical analysis and its interpretation appropriate? Yes
Are all the source data underlying the results available to ensure full reproducibility? Yes Why only 4 urine samples were analyzed? which protein marker(s) supposedly should be detected in urine? and which one(s) in plasma? please consider adding the details and rationale.
As this was an exploratory biomarker study we did not stratify our biomarker selection by biofluid type, thus the potential biomarkers mentioned in this study were theoretically applicable to urine and blood. No previous studies of this type have been performed, hence our selection, as described, was based on our shotgun proteomics studies of during rabbit Trepoenema pallidum infection, literature inferences on previous microarray studies and physiochemical characteristics that would be amendable to MRM detection. Admittedly, only analyzing four urine samples is a small number. It became apparent that our experimental strategy was not working after analyzing the initial set of samples that we decided not to go further. Despite this small number, we believe this information might be useful for other groups considering employing similar methods.
Authors have used microflow for the experiments on quadrupole, which has lower sensitivity. Perhaps targeted experiments using a nanoflow setup, as was done for experiments using orbitrap, will get better sensitivity. Indeed, one would expect higher sensitivity with the nanoflow set-up, however, to analyze larger volumes of patient material, which might also increase the sensitivity, the microflow set-up is more advantageous. Moreover, targeted microflow LC-MS/MS experiments offer the benefit of increased throughput (the initial goal was to develop method to analyze larger sample cohorts in a short time) and robustness.
The synthetic peptides were added after SPE clean-up. Why weren't they added before SPE to determine losses? In this exploratory study, multiple candidate biomarkers were included in the targeted setup to evaluate their potency. Therefore, it was not yet clear in which final concentration these synthetic peptides should be spiked into the samples. At this stage of the study, the synthetic peptides were also not exactly quantified (i.e. AQUA Basic peptides), which would make the determination of losses during sample preparation not precise.
Please supply more information on how LOD was calculated; it was based on the dilution series? The readers can use the explanation of calculations, if provided. The LOD calculations were based on the dilution series of in PBS, which were either T. pallidum enriched or unenriched with magnetic beads coupled with polyclonal antibodies directed against T.
. These were then subjected to LTQ Orbitrap analyses. Two unique proteins, pallidum T. pallidum Cytoplasmic filament protein A (TP_0748) and Lipoprotein antigen Tp47 (TP_0547), were found in the 300 bacteria/ml fraction in the enriched and unenriched samples, respectively. Therefore, the LOD based on a high-resolution LTQ-Orbitrap instrument was approximately 300 bacteria/ml PBS for both the antibody enriched and unenriched samples, meaning there was no significant improvement in LOD using bead enrichment. These results are detailed in Supplementary File 3. Furthermore, rough concentration calculations based on previous studies were presented in Supplementary

The spiking experiments should have been done in real matrix?
Indeed, the final dilution series of the labeled synthetic peptides, that would be used to determine the absolute concentration of the candidate biomarkers, would have been done in real matrix. However, at this point of the study the goal was to evaluate the abundance of the selected proteotypic peptides before using absolutely quantified labeled peptides (e.g. AQUA Ultimate). Therefore, it was decided to tune and optimize the LC-MS/MS parameters of each labeled peptide without any matrix to determine the most optimal instrument settings.
Please comment more on PTP selection? Are they unique? What about labile residues? Was anything done to look at the methionine oxidation? deamindations? etc Due to the lack of available MS datasets about the proteome (no library Treponema pallidum available), proteotypic peptides of each candidate protein biomarker were predicted . As in silico described, ESP predictor (Fusaro et al, 2009 Nat. Biotechnology) was used to find the most suitable proteotypic peptides based on 550 physico-chemical parameters including potential modifications (e.g. oxidation of methionine, deamidation, phosphorylation etc.). Best scoring peptides were selected for each of the proteins. Moreover, the PTPs that were selected were subjected to BLAST analyses to confirm their uniqueness.
The synthetic peptides were 95% pure. Were they quantified? (AAA?) During this exploratory study AQUA Basic peptides (Thermo Fisher Scientific) were used to evaluate the abundance of the selected proteotypic peptides. Although the quantity of the PTPs were specified in the leaflet, the peptides were purchased in a lyophilized formulation as one aliquot. Therefore, they are not suited as reference for absolute quantification. In a next step, AQUA Ultimate peptides (with a high concentration precision) would have been used to determine the absolute abundance of the protein biomarkers.

None
Competing Interests: