Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition

Saul Chemonges

doi:10.12688/f1000research.128316.1

Home Browse Interrogation of an ovine serum peptide spectral library to annotate...

ALL Metrics

-

Views

-

Downloads

Get PDF

Get XML

Export

▬

✚

Research Article

Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition

[version 1; peer review: 2 approved with reservations]

Saul Chemonges ^1,2

PUBLISHED 05 Dec 2022

Author details Author details

¹ School of Veterinary Science, The University of Queensland, Gatton, QLD, 4343, Australia
² Proteomics and Small Molecule Mass Spectrometry, Central Analytical Research Facility, Queensland University of Technology, Brisbane, QLD, 4000, Australia

Saul Chemonges
Roles: Conceptualization, Data Curation, Formal Analysis, Funding Acquisition, Investigation, Methodology, Project Administration, Resources, Software, Supervision, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

OPEN PEER REVIEW

REVIEWER STATUS

Abstract

Background: The use of data-independent data acquisition mass spectrometry (DIA-MS) on biological samples from domestic animals is still uncommon. Here, sequential window acquisition of all theoretical mass spectra (SWATH-MS) – a variant of DIA-MS was used to analyse serum peptides of healthy sheep as compared with serum of sick sheep by interrogating a novel peptide spectral library (PSL). This approach enabled the detection and annotation of a wide range of proteins, than conventional clinical pathology protein assays.
Methods: Serum samples from healthy sheep were obtained from a commercial source and normalised to represent a healthy sheep proteome background and then compared with serum samples of sheep suffering from a range of naturally-acquired illnesses submitted to The University of Queensland, Australia. Purified tryptic peptides were subjected to liquid chromatography coupled with tandem mass spectrometry (LC-MS/MS) on a quadrupole time-of-flight instrument (TripleTOF 5600+, SCIEX) set in a cyclic data-independent acquisition (DIA) mode using a generic (SWATH™, SCIEX) acquisition method. Data were processed using PeakView® v2.2 software with SWATH™ Acquisition MicroApp 2.0 (SCIEX) and MarkerView™ v1.3 software (SCIEX) pipeline to generate protein lists for downstream gene ontology annotation and pathway analysis of identified proteins.
Results: There were distinct differences in peptide chromatographic features of sick sheep samples compared to those from healthy sheep. Healthy and sick sheep serum samples yielded 335 and 236 protein identifications (IDs), respectively. There were 96 protein IDs unique to sick sheep serum. A total of 431 protein IDs were annotated by combining healthy control and sick sheep protein IDs.
Conclusions: SWATH analysis successfully aided in the detection some established clinicopathological serum biochemical analytes. This approach enabled the distinction of protein profiles of sick sheep samples from a healthy control sample, thereby providing a promising future perspective for the application of SWATH analysis in veterinary clinical use.

Keywords

Veterinary SWATH analysis, peptide spectral library, sheep serum proteomics, TICs, annotation of proteins in serum, nanoLC-nanoESI-MS/MS, sick sheep, healthy sheep

Corresponding author: Saul Chemonges

Competing interests: No competing interests were disclosed.

Grant information: This work was undertaken as part of author’s PhD which was financially supported by an Australian Postgraduate Award scholarship through The University of Queensland and personal resources. All the facilities for the experimental work were funded by a collaborative arrangement with Queensland University of Technology Central Analytical Research Facility (QUT-CARF) arranged by the author. Academic supervisors, laboratory personnel and other personnel involved in this work were paid employees of their respective institutions.

Copyright: © 2022 Chemonges S. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

How to cite: Chemonges S. Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition [version 1; peer review: 2 approved with reservations]. F1000Research 2022, 11:1433 (https://doi.org/10.12688/f1000research.128316.1) First published: 05 Dec 2022, 11:1433 (https://doi.org/10.12688/f1000research.128316.1) Latest published: 05 Dec 2022, 11:1433 (https://doi.org/10.12688/f1000research.128316.1)

List of abbreviations

ACN: Acetonitrile

ALP: Alkaline phosphatase

AST: Aspartate aminotransferase

ATP: Adenosine triphosphate

AUC: Area under curve

BCA: Bicinchoninic acid assay

CARF: Central Analytical Facility

CCKR: Cholecystokinin receptors

CK: Creatine Kinase

DIA: Data independent acquisition

DTT: Dithiothreitol

EBI: European Bioinformatics Institute

EGF: Epidermal growth factor

EMBL: European Molecular Biology Laboratory

FA: Formic acid

FGF: Fibroblast growth factor

GGT: Gamma-glutamyl transferase

GO: Gene ontology

IAM: Iodoacetamide

ID: Identification

LC: Liquid chromatography

MS/MS: tandem mass spectrometry

MS: Mass spectrometry

nanoLC-nanoESI-MS/MS: nano liquid chromatography nano electrospray ionisation tandem mass spectrometry

PANTHER: Protein ANalysis THrough Evolutionary Relationships

PCA: Principal component analysis

PRIDE: Proteomics identifications

PSL: Peptide spectral library

QUT: Queensland University of Technology

RT: Room temperature

SWATH: Sequential acquisition of all theoretical fragrant mass spectra

TEMED: Tetramethylethylenediamine

TFA: Trifluoroacetic acid

TOF: Time-of-flight

TP: Total protein

UQ: The University of Queensland

Introduction

Until now, the use of sequential window acquisition of all theoretical fragment mass spectra (SWATH-MS) analysis¹ to complement traditional shotgun MS-based proteomics² on samples from species of veterinary importance has been uncommon. Only a limited number of published reports have applied this approach on animal experimental subjects, for example in the quantification of hepatic proteins of chicken exposed to heat stress,³ analysis of seminal plasma protein of pigs,⁴ identification of proteins involved in nutritional stress in goats,⁵ quantification of protein alterations in plasma of acutely endotoxaemic sheep,⁶ and detecting proteins in peptide spectral libraries of bovids.⁷ Otherwise, much of the work in this area has been centred upon in-vitro studies on cell lysates⁸^–¹¹ and body fluids obtained from mice.¹² Despite its promising nature in protein analysis, SWATH-MS analysis remains largely untested on actual clinical samples from veterinary patients.

In this report, SWATH-MS was used to analyse serum samples obtained from sheep suffering from a range of naturally-acquired illnesses and compared with samples from healthy sheep, by interrogating a novel peptide spectral library (PSL) constructed from the ovine circulating acellular proteome.⁶^,¹³ The sick sheep serum samples were of actual clinical cases submitted to The University of Queensland (UQ) (Gatton, Australia). Serum samples from twenty healthy sheep were obtained from a commercial source (Serum Australis Pty Ltd.) and normalised to represent an analytical normal proteome background. Tryptic peptides were purified by StageTip technique,¹⁴^,¹⁵ prior to nano-liquid chromatography nano-electrospray ionisation tandem mass spectrometry (nanoLC-nanoESI-MS/MS) in a cyclic data-independent acquisition (DIA) mode, using a generic SWATH-MS acquisition method (SWATH™, SCIEX). The acquired data were processed in PeakView® software with SWATH™ Acquisition MicroApp 2.0 (SCIEX) pipeline to generate protein lists for downstream gene ontology annotation and pathway analysis of identified proteins.

The use of SWATH-MS analysis enabled protein profiles of individual sick sheep to be differentiated as compared to the normalised healthy sheep serum aliquots. This approach detected some established clinical biochemical analytes and possibly other candidate disease markers. These observations suggest that it is potentially feasible to use SWATH-MS in conjunction with the novel PSL to determine relative protein alterations in clinical samples from veterinary patients in future.

Methods

Ethics

The experiments included in this study were conducted in accordance with the Australian Code of Practice for the Care and Use of Animals for Scientific Purposes.¹⁶ Animal tissue samples for the generation of the PSL were acquired following approval from Queensland University of Technology (QUT) Confirmation number: 1400000591, QV reference number: 46375 and Ethics number TRIM 10/8428 issued to Serum Australis Pty Ltd (http://www.serumaustralis.com.au), for bleeding live sheep and supply of blood products. Tissue from live animal experiments had animal ethics approval obtained from the University Animal Ethics Committee of QUT (Reference No. 0800000555), which was ratified by The University of Queensland (UQ).

Experimental samples and workflow

Ex-diagnostic serum samples of seven sheep that presented with a range of clinical manifestations each, were opportunistically obtained from UQ laboratory archive for analysis. The accompanying clinical details of these sheep are presented in Additional file 1.⁴⁸ In this set of experiments, the individual code-identified samples from each sheep (SCi502, SCi503, SCi504, SCi507, SCi508, SCi509 and SCi510), a pooled sample of all of them (SCi511) and a pooled sample of aliquots from 20 healthy sheep obtained from Serum Australis Pty Ltd (SA) (SCi512) as the normalised analytical control for the experiments, were prepared and subjected to proteomic analysis using the SWATH-MS (SCIEX) analysis pipeline (Figure 1).

Figure 1. Schematic experimental workflow showing serum samples and SWATH-MS analysis pipeline.

Serum samples derived from seven sick sheep (SWATH Sample ID: SCi502, SCi503, SCi504, SCi507, SCi508, SCi509 and SCi510), a pooled sample from all the sick sheep (pooled sick) (SWATH Sample ID SCi511) and a normalised sample pooled from 20 healthy sheep (pooled normal) (SWATH Sample ID SCi512). The acquired SWATH data from each sample were processed in PeakView® v2.2 software with SWATH™ Acquisition MicroApp 2.0 (SCIEX) – SWATH pipeline. Data were exported into MarkerView™ v1.3 software (SCIEX) for statistical analysis protein list generation. Protein lists were exported to spreadsheet (Microsoft® Excel™), which facilitated comparative analysis using a Venn diagram (BioVenn software).

Sample preparation

Samples were prepared as in a previously described method.¹³ In brief, frozen sheep serum samples were thawed and kept under refrigeration conditions with an added protease inhibitor (cOmplete™, Roche) according to the manufacturer’s instructions. Each sample was agitated vigorously for half a minute before centrifuging at 13,000 g for 20 minutes. Only the supernatant was retained for downstream analysis, after determining the protein concentration with bicinchoninic acid (BCA) protein assay method (BCA Protein Assay Kit, Pierce™) and a spectrophotometer (NanoDrop 2000; Thermo Scientific).

Proteins in the sample supernatants were precipitated by adding 4 × (v:v %) of cold acetone (20 °C) and then incubated at this temperature for 16 h, followed by centrifugation at 4,000 g for 2 minutes. The pellet was then retained and washed using 1 mL of cold acetone by agitating vigorously to break the pellet. The suspension was then centrifuged at 4,000 g for 5 min at 4 °C and sediment was retained. This washing procedure was performed once more. The pellet was dissolved in aqueous fresh 8 M urea in 25 mM ammonium bicarbonate (NH₄HCO₃) and then centrifuged at 4,000 g for 5 min at 4 °C. The protein concentration of the supernatant was then determined by BCA protein assay.

While still under refrigeration conditions, sample aliquots comprising 100 μg of protein were reduced with 20 mM DTT (titrated to 5 mM final concentration) and then incubated for 1h at room temperature (RT), followed by alkylation with 55 mM iodoacetamide (IAM) (resulting in 14 mM final concentration) and incubated again for 20 min in the dark at RT. Alkylation was suppressed using dithiothreitol (DTT) prior to further incubation for 5 min in the dark after which the solution was diluted with 25mM NH₄HCO₃ buffer followed by titrating aqueous 70 mM CaCl₂ into the samples (aiming at 10 mM final concentration). Trypsin (Promega) was added at a ratio of 1:50 (enzyme to protein concentration) and the contents were incubated for 16 h at 37 °C with gentle shaking and then cooled to RT. Protein digestion was curtailed using 10% trifluoroacetic acid (TFA) before vacuum drying the contents. Peptides were then dissolved in aqueous 0.1% TFA/2% acetone nitrile (ACN) and routinely desalted using octadecyl carbon chain (C₁₈) pipette tips (ZipTip® Pipette Tips, Millipore), vacuum dried, reconstituted in 10 uL of aqueous 2% ACN/0.1% FA for nanoLC-nanoESI-MS/MS analysis.

nanoLC-nanoESI-MS/MS analysis

Chromatography

Quantities of about 400 ng – 1 μg of peptides per sample were subjected to reversed-phase chromatography setup in a trap and elute arrangement across a 90 min gradient at 40 °C. Two mobile phases A and B were used, prepared from aqueous 0.1% formic acid (FA) and ACN/0.1% FA, respectively on a nanoLC-nanoESI-MS/MS system (TripleTOF® 5600+ instrument (SCIEX)).

Data independent acquisition (DIA)

Spectral data of purified eluted peptides were extracted by cyclic DIA using a generic SWATH™ acquisition method described elsewhere.¹^,³ The mass spectrometer was operated using 0.1 s for the survey MS run, followed by tandem mass spectrometry on all precursors in a recurring manner with an accumulation time of 0.1 s per SWATH window across 36 windowsl, each 26 m/z units wide for total cycle duration of 3.75 s in order to sample at least 6 data points for each chromatographic peak per peptide to ensure improved quantification precision.

Data analysis

The proprietary raw data files (.wiff format) were concurrently imported into PeakView® v.2.2 software with integrated SWATH™ Acquisition MicroApp 2.0.1 (SCIEX) for spectral alignment and targeted data extraction from the PSL to be interrogated. The PSL was assembled from hundreds of samples derived from serum and plasma of sick and healthy sheep that had hitherto been analysed by nanoLC-nanoESI-MS/MS by data dependent acquisition (DDA) on the same instrument as described in earlier.⁶^,¹³ Extracted ion chromatograms (XICs) from MS/MS spectra for targeted peptides were generated by the SWATH™ Acquisition MicroApp 2.0.1, by assimilating peak areas from the SWATH™ data files as detailed in the vendor’s technical note (http://tinyurl.com/j6w83pu).

The parameters for SWATH™ data extraction were set as follows: five peptides per protein, five transitions per peptide, peptide confidence level of >95%, exclude shared peptides, and extracted ion chromatograph (XIC) width of 50 ppm. Processed data were exported as quantitative output for the peak area under the intensity curve for individual ions, the summed intensity of individual ions for a given peptide, and the summed intensity of peptides for a given protein in .txt format. Data were then statistically analysed using MarkerView™ software (SCIEX). Data were then exported in.xlxs format (Microsoft® Excel™) into spreadsheet for inspection and comparative visual analysis using BioVenn software.¹⁷

There are other open-access alternative software that can be used to for analysis of the data used in this article, for example the Trans-Proteomic Pipeline (TPP),¹⁸ Skyline¹⁹ and OpenSWATH.²⁰^,²¹

Access to raw proteomics data files

The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium²² via the PRIDE²³ partner repository with the dataset identifiers PXD005002 and PXD005077.

Results

Chromatography of peptides, protein identification and comparisons between sick and healthy sheep

The total ion chromatograms of each of the analysed peptides derived from serum samples of sick sheep were processed in PeakView® software and compared with that of healthy sheep (Additional file 2⁴⁸). A total of 335 proteins (Additional file 3(a)⁴⁸) were identified in the healthy sheep sample pool (SCi512) that was derived from aliquots of 20 sheep, representing the normal proteome background. Protein identifications (IDs) from individual samples of sick sheep were compared to the IDs of SCi512 to reveal shared and unshared overlaps in protein IDs, and total combined protein IDs in each individual clinical case experiment as shown in Figure 2A. A comparison of protein IDs from pooled normal sheep serum (SCi512), pooled sick sheep (SCi511) and those unique to sick sheep only is presented Figure 2B. There were proteins that were unique to each of the sick sheep samples, as compared to healthy sheep pool. These unique proteins numbered 55, 95, 61, 54, 80, 61, 30, and 70 for SCi502, SCi503, SCi504, SCi507, SCi508, SCi509, SCi510, and SCi511, respectively. The UniProtKB entries for the individual proteins unique to each sick sheep sample are presented in Additional file 3 (b), (c), (d), (e), (f), (g), (h) and (i)⁴⁸, respectively.

Figure 2. Proteins identified using SWATH-MS in sick and healthy sheep serum samples.

A - Shows the number of protein IDs obtained using SWATH™ processing of individual serum samples SCi504, SCi503, SCi504, SCi507, SCi508, SCi509 and SCi510 from sick sheep, and a pooled sample from all the preceding samples (SCi511), that were compared with a pooled serum sample from healthy sheep (SCi512). B - Illustrates a composite analysis of all the samples that yielded 431 protein IDs, 335 of these IDs were attributed to SCi512, 210 IDs were from the pooled sick sheep sample (SCi511) and 96 IDs were exclusive to sick sheep serum only. The 26 protein IDs that were not detected in SCi511 were identified by analysing proteins that were exclusive to sick sheep (sick sheep only fractions) of the individual samples in A.

The UniProtKB entries for the 195 proteins that were identified as being unique to the healthy sheep sample pool (i.e. proteins that were not detected in sick sheep samples illustrated in Figure 2B), are listed in Additional file 3 (j).⁴⁸ Also listed, are the 96 protein IDs (Additional file 3 k⁴⁸) that were unique to sick sheep and the 26 protein IDs (Additional file 3 (l)⁴⁸) that were not detected in the sick sheep pool (SCi511) compiled from protein IDs exclusive to the sick sheep samples. The UniProtKB entries for all of the 431 proteins identified in the entire workflow comprising of serum samples from sick and healthy sheep in Figure 2B are listed in Additional file 3 (m).⁴⁸

Differential abundance of proteins

There was differential abundance in proteins identified across all the analysed samples, including protein IDs from healthy sheep. The relative intensities of the top 10 most abundant protein IDs in healthy sheep serum pool in descending order, alongside protein IDs from the seven individual sick sheep samples (SCi502, SCi503, SCi504, SCi507, SCi508, SCi509 and SCi510), and the pooled sample from sick sheep (SCi511) are illustrated in Figure 3. The relative abundance of many proteins was variable between the different samples as presented in Additional file 4.⁴⁸ There were notable differences in protein intensities between the healthy sheep serum pool and the sick sheep sample pool as shown in Figure 4.

Figure 3. Ten most abundant proteins identified by SWATH-MS in sick and healthy sheep serum samples.

Protein intensities from a pooled sample of healthy sheep (SCi512) in descending order were compared with intensities from sick sheep samples (SCi502, SCi503, SCi504, SCi507, SCi508, SCi509, SCi510 and SCi511 (pooled sick)) and corresponding proteins.

Figure 4. Disposition of protein intensities in healthy versus sick serum samples.

A – Shows the relative summed protein intensities of all proteins identified by SWATH-MS analysis in a pooled sample from healthy sheep (SCi512) as compared with a pooled sample (SCi511) derived from 7 samples (SCi502, SCi503, SCi504, SCi507, SCi508, SCi509 and SCi510) from sick sheep. B – Shows the quantitative comparison between the two samples representing healthy sheep and sick sheep was possible, however only 140 identified proteins were common between SCi512 and SCi511. In (B), taking the protein intensities calibration plot from SCi512 as the normalised control (or benchmark), protein IDs in sample SCi511 falling above the blue line were considered upregulated, whilst those below this line were designated as downregulated. Note the wide dynamic range in the intensities of some proteins in both (A) and (B).

Of the 335 proteins identified in the healthy sheep serum pool, the ten most abundant proteins (see Figure 2B and Additional file 3(a)⁴⁸) in descending order were W5PU57 (Nuclear envelope pore membrane protein POM 121C), W5PBH6 (RAD54 like 2), W5QGI6 (Exonuclease 3'-5' domain containing 1), W5P860 (Dual-specificity kinase), W5Q2U7 (Plectin), O02762 (Apolipoprotein A1), W5Q7I2 (I Ig-like domain-containing protein), C5IS96 (Lecithin-cholesterol acyltransferase), W5PI54 (Rabphilin 3A) and W5P8F9 (BPI1 domain-containing protein). The ten least abundant proteins in this healthy sheep sample pool were W5P0B2 (LDL receptor related protein 2), W5QBV7 (CD44 antigen), W5PAS6 (EvC ciliary complex subunit 2), W5QCV8 (Cadherin EGF LAG seven-pass G-type receptor 2), W5P640 (Lamin A/C), W5PQ96 (Ciliary rootlet coiled-coil, rootletin), W5P7B1 (Sirtuin 2), W5QGV5 (Dedicator of cytokinesis 10), W5QGX2 (Spectrin beta, non-erythrocytic 5) and A2P2G4 (VH region).

Of the 210 proteins identified in the sick sheep serum sample pool (SCi511)(Figure 2B), the ten most abundant proteins in descending order were W5QIV1 (Protein S100), W5QHZ8 (Ig-like domain-containing protein), W5PGA9 (Nicastrin), W5PI92 (Apolipoprotein C-IV), W5NV45 (Hephaestin like 1), A2P2I4 (VH region), W5P640 (Lamin A/C), B0FZL9 (Pre-mRNA splicing factor SRP20-like protein), W5PVM3 (Myocilin) and W5NQ83 (RNA binding motif protein 25). The ten least abundant proteins in this sample were W5PJG0 (Serum amyloid A protein), W5NX96 (Attractin), W5PV45 (Centrosomal protein of 162 kDa), W5PBY0 (Complement component 4 binding protein alpha), W5P860 (Dual-specificity kinase), P12303 (Transthyretin), W5QFP0 (Thrombospondin 1), W5P3J3 (Complement C1s), W5NXP3 (Serpin A3-6-like) and W5PHP8 (Leucine rich alpha-2-glycoprotein 1).

Gene ontology annotation and pathway analysis of identified proteins

All the 236 proteins identified from a composite of all sick sheep serum samples and the 335 proteins from healthy sheep pool (Figure 2B) — taking into consideration overlapping protein IDs, were subjected to gene ontology (GO) classification, within the domain terms of molecular function, biological process, cellular component, protein class using the PANTHER (Protein ANalysis THrough Evolutionary Relationships) classifications system²⁴ (Figure 5). Using the UniProtKB gene identification and mapping tool, 215 of the 236 proteins identified in sick sheep serum were mapped to 206 named gene IDs of sheep. Of these identified sheep genes, only 172 of them were recognised by the PANTHER tool after aligning them to Bos taurus — the species with a genome most closely homologous that of sheep. Likewise, 299 of the 335 proteins identified in healthy sheep serum pool were mapped to 290 sheep genes, however, only 251 genes were recognised in PANTHER based on bovine homologous entries.

Figure 5. Gene ontology (GO) term annotation of proteins identified in serum of sick versus healthy sheep serum.

Note that the number of genes and individual GO term hits were different between sick and healthy sheep pool.

Apart from the number of distinct genes and individual GO term hits, the fractional representation (percentages) of the contributing elements within the domains between sick and healthy sheep serum were comparable (Figure 5). Catalytic activity, binding, structural molecule and receptor activity had comparatively larger representations in the molecular function domain. Cellular process, metabolic process, response to stimulus and localisation had the largest representation in the biological process domain. In the cellular component domain, cell part, extracellular region, organelle and macromolecular complex had the largest representation. As for the protein class domain, enzyme modulator, hydrolase, signalling molecule and receptor terms had the largest representation of the GO terms. It was evident from the molecular function domain that large representations of proteins are involved in catalytic activity (36%) and binding (35%). The biological process domain is interesting for these case studies that involve pathology in that it illustrates inclusion of immune process (5%) and response to stimulus (9%), alongside metabolic (20%) and cellular processes (25%). Cell part (33%), extracellular region (22%) and organelle (20%) GO terms were predominant in the cellular component domain. Meanwhile defense/immunity comprised of only 3%, compared to hydrolase (13%) signalling molecule (12%), receptor (9%) and oxidoreductase (4%) terms in the protein class domain.

Protein pathway analysis was conducted on all the identified proteins from healthy sheep serum sample pool and sick sheep serum using PANTHER (Figure 6). Differences were observed in the enriched protein pathways between sick and healthy sheep. In the results from serum derived from sick sheep, the predominantly represented protein pathways were ATP synthesis, interleukin signalling, angiogenesis, Alzheimer disease-presenilin, integrin signalling, inflammation mediated by chemokine and cytokine signalling, gonadotropin-releasing hormone receptor, cytoskeletal regulation by Rho GTPase, blood coagulation, Huntington disease, p53 pathway, Wnt signalling pathway and Glycolysis pathways. As for the serum derived from healthy sheep, the predominantly represented protein pathways were Alzheimer disease-presenilin pathway, integrin signalling, inflammation mediated by chemokine and cytokine signalling pathway, Huntington disease, Wnt signalling pathway, glycolysis, Parkinson disease, cytoskeletal regulation by Rho GTPase, cadherin signalling, blood coagulation, cholecystokinin receptors (CCKR) signalling map and gonadotropin-releasing hormone receptor pathways.

Figure 6. Pathway analysis of proteins identified in serum of sick, versus healthy sheep serum.

Note the differences in the enriched protein pathways between samples from sick and healthy sheep.

A note about the sick sheep serum samples used for this experimental workflow

The sick sheep serum samples utilised for these experiments were submitted by referring veterinarians for analysis at UQ using automated methods for routine veterinary diagnostic work up. The samples were accompanied by brief presenting clinical case histories presented in Additional file 1.⁴⁸ Follow-up information on these cases was not available in laboratory entries obtained for this report. With the clinical case history data at hand, the most important of them were signalment (except for two samples (SCi504 and SCi508)) whose ages were missing and the laboratory findings from the analysis of the samples.

Discussion

The experiments for generating this report utilised SWATH-MS technology to primarily analyse proteins in ex-diagnostic serum samples of sheep suffering from a range of different ailments in comparison with serum from healthy sheep to detect any differences in their protein profiles. The strategy adopted involved testing and validating the potential application of the nascent peptide spectral library built from circulating acellular proteome of sheep.⁶ The overall intention here was to target protein analytes in significant laboratory findings of the submitted sheep serum samples based on the accompanying clinical data from UQ’s School of Veterinary Science laboratory. The relevant analytes for proteomics studies for this report are found in Additional file 1.⁴⁸ Among the analytes of interest were gamma-glutamyl transferase (GGT), total protein (TP), serum albumin (Alb), aspartate aminotransferase (AST), alkaline phosphatase (ALP), creatine kinase (CK), fibrinogen and some other analytes that were listed in the reference ranges, for example globulin and serum amyloid. Another important analyte – at least in ruminants – that was not listed in the laboratory panel is haptoglobin.²⁵ These analytes may have appeared as isoforms of their known protein moieties. In some cases however, the genes coding for these proteins were not well defined, possibly due to the consideration that the sheep genome still remains to be fully annotated.²⁶^,²⁷ Nevertheless, it is feasible and practical to use draft genome sequences for the interpretation of MS/MS proteomics data without the need for tedious genome annotation.²⁸ This does not negate the point that a well-annotated genome is necessary in order to name genes and proteins unambiguously.²⁹^–³²

Differentially expressed serum proteins between healthy and sick sheep were either known or considered to be potential biomarker candidates for diagnosing and monitoring disease states, and this could also provide leads for further research endeavours. There are two important layers to determining the significance of the differences between sick and healthy sheep serum proteomes observed in this work: a) the degree of confidence in the observed difference from an analytical standpoint and, b) the biological plausibility that such differences truly correlate with a disease state. The discussion that follows looks at these preceding two aspects.

A problem of working with the present approach of identifying proteins is that when protein isoforms are not explicitly named and identified, it introduces ambiguity that makes it challenging to identify even well-known proteins as in the instances presented in this report. For example, in the set of samples utilised in this study, Hb was identified as A0A0F6YFJ0 (Beta-C globin). There were also two protein IDs representing GGT which comprised of W5QCX2 (Transglutaminase 1) and W5PB04 (Transglutaminase 3). Serum albumin was identified as W5PWE9 which is yet to be characterised for sheep in UniProtKB. It should be noted that one isoform of albumin, P14639 (Albumin), that was not identified in this set of experiments, has already been characterised in UniProtKB. The ID feature for Aspartate aminotransferase (AST or GOT) was W5PS88. There was no protein ID directly matching ALP in this dataset. However, there were three other phosphatases including W5P195 (Dual specificity phosphatase 14), W5P3B0 (Phosphatidylinositol-3,4,5-trisphosphate 5-phosphatase) and W5QE45 (Serine/threonine-protein phosphatase) that were identified. Meanwhile, CK was represented by two IDs: W5PJ69 (Creatine kinase) and W5NQ67 (Creatine kinase). Fibrinogen was represented by four proteins including W5Q5H8 (Fibrinogen alpha chain), W5NQ46 (Fibrinogen beta chain), W5Q5A6 (Fibrinogen gamma chain) and W5PH03 (Fibrinogen like 1). Serum amyloid was represented by W5PJG0 (Serum amyloid A protein), P42819 (Serum amyloid A protein) and W5PJR0 (Serum amyloid A protein). Haptoglobin was identified as W5P0Q4 (Haptoglobin).

In veterinary clinical pathology, the determination of the quantity of protein in plasma or serum has always been traditionally based on the amount of albumin and globulin fractions,³³^,³⁴ and yet there is more to the two broad protein groups when it comes to deep proteomic analysis. For example, in the present set of protein identifications, the globulin fraction was represented by at least 12 protein families or groups. The globulin protein fraction comprised of W5P812 (Protein AMBP), W5PSC1 (Ig-like domain-containing protein), W5Q0R1 (Sex hormone binding globulin), W5NQW4 (Alpha-1-macroglobulin-like), W5PGT9 (Ig mu chain C region), W5QI15 (Ig-like domain-containing protein), W5PSK4 (Ig-like domain-containing protein), A4ZVY9 (Beta-2-microglobulin) W5NSA6 (Alpha-2-macroglobulin), W5PPQ8 (Joining chain of multimeric IgA and IgM), P49920 (Corticosteroid-binding globulin) and the various VH region immunoglobulins such as A2P2G1 and A2P2I1. Being able to identify more proteins in the different fractions of serum proteins gives this approach an edge over traditional clinicopathological protein assays.

The experiments in this report utilised samples from single sheep case reports as proof-of-concept that SWATH-MS technology can be applied to identify vast numbers of proteins and their alterations in clinical samples of veterinary patients. The comparison of clinical cases (sick sheep serum), versus normalised serum from a large number of healthy sheep is a classical approach typical of biomarker discovery or detection studies.³⁵ With this foundation in place, it is possible to establish a standard for routine proteomic analysis of serum samples submitted to laboratories in future, once a specific baseline serum proteome of sheep has been optimised using far much larger numbers of samples. This method also has the potential to be used for identifying different protein species that show differences between samples. The pooling of samples from healthy sheep was conducted on the premise that the pooled sample would provide a representative proteome of normal sheep serum. Similarly, the rationale of using a pooled sample from sick sheep followed the same premise that that this could provide a representative picture of all the proteins present in the sick sheep, particularly proteins that might only be abundant in specific disease states that could not be detectable in serum samples from healthy individuals. The inherent downside of pooling samples is that this strategy disregards the biological variation of the individuals the samples are drawn from,³⁶ by capturing for example, the ‘average’ proteome profile across a population. And also in the present results, 26 protein IDs were not detected when samples from sick sheep were pooled, as opposed to them being analysed individually (Figure 2B). The reason behind this observation is not immediately clear, but batch effects during sample analysis could be advanced as a contributing factor,³⁷ or possibly unknown effects of pooling samples for downstream analysis. It also follows that the use of pooled serum, at least from normal individuals to act as a control, is an accepted scientific practice for normalising samples that has also been widely used in some human studies.³⁵^,³⁶

The evaluation of chromatographic features of peptides from samples – TICs in this case – was a practical, inexpensive and straightforward visual way of comparing ion intensities between small numbers of samples as previously reported elsewhere.³⁸ Here, only two sample TICs were loaded per displayed panel on PeakView®: one from a sick sheep versus a normalised serum of normal sheep (Additional file 2⁴⁸). The TICs represented a measure of relative abundance of the peptides detected.³⁹ It follows therefore that during interpretation of the TICs, it should be considered that they represented summed signals from the sample as well as background noise.⁴⁰ The use of TICs in addition to SWATH-MS data extraction has a high depth of analysis whilst factoring in the wide dynamic range in analytes, is a useful strategy especially when considerable differences are expected between samples³⁸ as in the present study. There were distinct differences between TIC profiles of sick sheep serum samples compared to serum profiles from normal sheep; this difference was more marked in the sample from the sheep with scabby mouth lesions (Additional file 2 ⁴⁸). In the majority of the analysed samples, the TIC intensities of sick sheep serum samples were generally higher than those of normal sheep serum, except for the ill-thrift and the ill lamb cases. A possible explanation for this observation could be malnutrition/starvation in the chronically ill sheep, since decreased food intake depletes protein generally and this was even more crucial in the neonatal sheep (ill lamb) that was likely still reliant on colostrum.⁴¹^–⁴³ Proteins in circulation are either derived from synthesis or from degradation, but considering that the small intestine is the most important organ for synthesis and absorption of proteins in ruminants,⁴⁴ the detected proteins in starving sheep are therefore most likely to have been derived from tissue degradation. As a result, there was probably a comparatively very little protein reserve to elicit the higher protein intensities compared to the other relatively acute disease case samples. As for the analysis of the pooled sample of sick sheep (SCi511), the TIC profile was generally above that of pooled normal serum up to approximately 38 min when it switched below the pooled normal serum TIC, before peaking again at the hydrophobic end of the chromatogram. An explanation for this observation remains to be established and is open to further interpretation.

The differences in protein species between individual serum samples from sick sheep could have been due to the different aetiological and pathological factors of the presenting condition in the different sheep that could have stimulated the production of different proteins or their alterations. Without follow-up clinical data on the cases or a definitive diagnosis, it is not possible to fully determine and to explain these differences. Also, clinical cases were from diverse populations which may have contributed to different serum proteomic profiles as proteomes are known to be dynamic.⁴⁵

Different numbers of proteins were identified from the different samples and to some extent, different protein species. The sample from the sheep with scabby foot and mouth lesions had the highest number of protein IDs (Figure 2A). A substantial number of protein IDs were common between sick and the normalised sample from healthy sheep as represented by the ‘pooled healthy control and sick overlap’ legend item in Figure 2A. The protein yield of 210 IDs from the pooled sick sheep sample (SCi511) was much lower than expected as compared to the IDs from all other individual samples considered together. It is not immediately evident as to why this was the case. This relatively low numbers of protein IDs could potentially have been due to sub-optimal tryptic digestion of this particular pooled sample, loss of peptides during sample preparation or probably due to unknown factors that influence the pooling of samples.

There were differences in protein intensities between the samples as evident in the ten most abundant proteins in each analytical case (Figure 3). The comparison of protein intensities of the pooled serum from healthy sheep (SCi512) versus pooled serum from sick sheep (SCi511) revealed considerable differences for some proteins, with each of the points on the graph representing a protein (Figure 4A). Fewer proteins (210 IDs) were identified in sick sheep sample pool, as compared to 335 IDs in healthy sheep serum pool. It should have also been possible to have a relative quantitation of the proteins centred on their intensities benchmarked on the calibration of the intensities from healthy sheep serum based on the principle of area under curve of the TICs⁴⁶^,⁴⁷ (Additional file 2⁴⁸). Using this approach, could have demonstrated the feasibility of determining what proteins were either upregulated or downregulated in the clinical serum samples from sick sheep.

The GO term classification provided an overall picture of the identified proteins in both sick and healthy sheep serum (Figure 5). An important observation from the protein pathway analysis is that it provided 11 clearer protein pathways that peaked in sick sheep serum (Figure 6). The roles of some of these protein pathways in sheep remain to be determined, but they have been studied considerably in humans and other model species such as mice. It is not unreasonable therefore to suggest that with homology, there is a translational potential in the observations made in this study from sick sheep that can be learnt from.

There were limitations in this experimental study. The first one was that there was only one biological sample for each clinical case study — which is not unusual for case studies. Since each clinical case was unique, to be able to adequately test current observations, three or more technical replicates (or MS injections for that matter) out of each biological sample would have improved the meaningfulness of the data score plot. The second limitation was that it was not possible to re-analyse samples drawn from the same sheep after they recovered (i.e. to act as their own controls), and/or at different stages on the disease to determine if chronicity had an effect on protein species or their alteration – considering that sick sheep samples were opportunistically utilised from laboratory archives at UQ.

Conclusion

The use of SWATH-MS analysis successfully identified proteins and enabled protein profiles from serum samples of sick and healthy sheep to be distinguished. It was possible to detect some established clinical biochemical analytes, for example aspartate aminotransferase, creatinine kinase, haemoglobin, serum amyloid A, fibrinogen, haptoglogin, members of gamma-glutamyl transferase group, serum albumin and various proteins of the globulin fraction. Protein pathway analysis provided useful information on the expression profiles of protein groups between sick and healthy sheep by giving them a biological meaning. There was a downside to this approach, for example the detection of alkaline phosphatase – an important analyte that was not obviously identifiable in this dataset. Some detected proteins of sheep such haemoglobin, haptoglobin and some isoforms of serum amyloid did not have distinct gene names as compared to homologous counterparts in related species. Nevertheless, this proof-of-concept study has demonstrated that with a known benchmark or standard samples from healthy individuals, it is feasible to determine relative protein alterations in clinical samples. This approach could have a place in veterinary clinical pathology in future.

Author’s contributions

The author originated the concept of the study, conducted the experiments with assistance from staff of The University of Queensland and Queensland University of Technology, and wrote the final manuscript.

Data availability

Underlying data

PRIDE: Towards a comprehensive targeted proteogenomic assay repository for the liquid fraction of sheep blood. Accession number PXD005002; https://www.ebi.ac.uk/pride/archive/projects/PXD005002.

PRIDE: Application of a novel peptide spectral library using swath analysis for the quantitation of proteins in ex-diagnostic clinical serum samples from sheep. Accession number PXD005077; https://www.ebi.ac.uk/pride/archive/projects/PXD005077.

Extended data

figshare: Supplementary data: Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition. https://doi.org/10.6084/m9.figshare.21546999.v8 ⁴⁸

This project contains the following extended data:

‐ Additional file 1. Clinical data of sick sheep.docx
‐ Additional file 2. Chromatographic features of sheep serum peptide samples.png
‐ Additional file 3. UniProtKB entries of identified proteins.docx
‐ Additional file 4. Relative abundance of proteins in healthy and sick sheep serum.docx
‐ Additional file descriptive notes.txt

Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).

Acknowledgements

Thank you to the Australian Red Cross Blood Service for making available the primary samples used in this manuscript under Material Supply Agreement 15-03QLD-19 and staff of The University of Queensland and Queensland University of Technology who provided research degree supervisory oversight for this study.

References

1. Gillet LC, Navarro P, Tate S, et al.: Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol. Cell. Proteomics. 2012; 11: O111.016717. PubMed Abstract | Publisher Full Text
2. Aebersold R, Mann M: Mass spectrometry-based proteomics. Nature. 2003; 422: 198–207. Publisher Full Text
3. Tang X, Meng Q, Gao J, et al.: Label-free Quantitative Analysis of Changes in Broiler Liver Proteins under Heat Stress using SWATH-MS Technology. Sci. Rep. 2015; 5: 15119. PubMed Abstract | Publisher Full Text
4. Perez-Patiño C, Barranco I, Parrilla I, et al.: Extensive dataset of boar seminal plasma proteome displaying putative reproductive functions of identified proteins. Data Brief. 2016; 8: 1370–1373. PubMed Abstract | Publisher Full Text
5. Hernández-Castellano LE, Ferreira AM, Nanni P, et al.: The goat (Capra hircus) mammary gland secretory tissue proteome as influenced by weight loss: A study using label free proteomics. J. Proteome. 2016; 145: 60–69. PubMed Abstract | Publisher Full Text
6. Chemonges S: Development of a novel encyclopaedic peptide spectral library using the liquid fraction of sheep blood. The University of Queensland, School of Veterinary Science;2018. PhD. Thesis.
7. Noor Z, Paramasivan S, Ghodasara P, et al.: Leveraging homologies for cross species plasma proteomics in ungulates using data-independent acquisition. J. Proteome. 2021; 250: 104384. Publisher Full Text
8. Huang Q, Yang L, Luo J, et al.: SWATH enables precise label-free quantification on proteome scale. Proteomics. 2015; 15: 1215–1223. PubMed Abstract | Publisher Full Text
9. Yan Z, Zhou Y, Shan Y, et al.: Label-free quantification of differentially expressed proteins in mouse liver cancer cells with high and low metastasis rates by a SWATH acquisition method. Sci. China. Chem. 2014; 57: 718–722. Publisher Full Text
10. Furini G, Schroeder N, Huang L, et al.: FP013QUANTITATIVE PROTEOMICS BY SWATH-MS REVEALS AN ENDOSOMAL TRANSPORT HUB OF PROTEINS WHICH INTERACT WITH TG2 IN A MODEL OF EXPERIMENTAL KIDNEY FIBROSIS. Nephrol. Dial. Transplant. 2015; 30: iii70. Publisher Full Text
11. Williams EG, Wu Y, Jha P, et al.: Systems proteomics of liver mitochondria function. Science. 2016; 352. Publisher Full Text
12. Enk VM, Baumann C, Tho LKC, et al.: Regulation of highly homologous major urinary proteins in house mice quantified with label-free proteomic methods. Mol. BioSyst. 2016; 12: 3005–3016. PubMed Abstract | Publisher Full Text | Free Full Text
13. Chemonges S, Gupta R, Mills PC, et al.: Characterisation of the circulating acellular proteome of healthy sheep using LC-MS/MS-based proteomics analysis of serum. Proteome Sci. 2017; 15: 11.
14. Kulak NA, Pichler G, Paron I, et al.: Minimal, encapsulated proteomic-sample processing applied to copy-number estimation in eukaryotic cells. Nat. Methods. 2014; 11: 319–324. PubMed Abstract | Publisher Full Text
15. Yu Y, Smith M, Pieper R: A spinnable and automatable StageTip for high throughput peptide desalting and proteomics.2014.
16. Australian code of practice for the care and use of animals for scientific purposes. http
17. Hulsen T, de Vlieg J , Alkema W: BioVenn - a web application for the comparison and visualization of biological lists using area-proportional Venn diagrams. BMC Genomics. 2008; 9: 488. PubMed Abstract | Publisher Full Text
18. Deutsch EW, Mendoza L, Shteynberg D, et al.: A Guided Tour of the Trans-Proteomic Pipeline. Proteomics. 2010; 10: 1150–1159. PubMed Abstract | Publisher Full Text
19. MacLean B, MacCoss MJ, Tomazela DM, et al.: Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics. 2010; 26: 966–968. PubMed Abstract | Publisher Full Text
20. Röst HL, Rosenberger G, Navarro P, et al.: OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data. Nat. Biotechnol. 2014; 32: 219–223. PubMed Abstract | Publisher Full Text
21. Gillet LC, Navarro P, Tate S, et al.: Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol. Cell. Proteomics. 2012; 11(O111): O111.016717. Publisher Full Text
22. Vizcaino JA, Deutsch EW, Wang R, et al.: ProteomeXchange provides globally coordinated proteomics data submission and dissemination. Nat. Biotechnol. 2014; 32: 223–226. PubMed Abstract | Publisher Full Text
23. Vizcaino JA, Cote RG, Csordas A, et al.: The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 2013; 41: D1063–D1069. PubMed Abstract | Publisher Full Text
24. Mi H, Poudel S, Muruganujan A, et al.: PANTHER version 10: expanded protein families and functions, and analysis tools. Nucleic Acids Res. 2016; 44: D336–D342. PubMed Abstract | Publisher Full Text
25. Lepherd ML, Canfield PJ, Hunt GB, et al.: Haematological, biochemical and selected acute phase protein reference intervals for weaned female Merino lambs. Aust. Vet. J. 2009; 87: 5–11. PubMed Abstract | Publisher Full Text
26. International Sheep Genomics CArchibald AL, Cockett NE, et al.: The sheep genome reference sequence: a work in progress. Anim. Genet. 2010; 41: 449–453. PubMed Abstract | Publisher Full Text
27. Jiang Y, Zhang W, Stanton J-A, et al.: The sheep genome illuminates biology of the rumen and lipid metabolism. Science (New York, N.Y.). 2014; 344: 1168–1173. PubMed Abstract | Publisher Full Text
28. Armengaud J, Trapp J, Pible O, et al.: Non-model organisms, a species endangered by proteogenomics. J. Proteome. 2014; 105: 5–18. PubMed Abstract | Publisher Full Text
29. Goll J, Montgomery R, Brinkac LM, et al.: The Protein Naming Utility: a rules database for protein nomenclature. Nucleic Acids Res. 2010; 38: D336–D339. PubMed Abstract | Publisher Full Text
30. Ban N, Beckmann R, Cate JH, et al.: A new system for naming ribosomal proteins. Curr. Opin. Struct. Biol. 2014; 24: 165–169. PubMed Abstract | Publisher Full Text
31. Gray KA, Yates B, Seal RL, et al.: Genenames.org: the HGNC resources in 2015. Nucleic Acids Res. 2015; 43: D1079–D1085. Publisher Full Text
32. Fundel K, Zimmer R: Gene and protein nomenclature in public databases. BMC Bioinformatics. 2006; 7: 372. PubMed Abstract | Publisher Full Text
33. Alberghina D, Giannetto C, Vazzana I, et al.: Reference intervals for total protein concentration, serum protein fractions, and albumin/globulin ratios in clinically healthy dairy cows. J. Vet. Diagn. Investig. 2011; 23: 111–114. PubMed Abstract | Publisher Full Text
34. Alberghina D, Casella S, Vazzana I, et al.: Analysis of serum proteins in clinically healthy goats (Capra hircus) using agarose gel electrophoresis. Vet. Clin. Pathol. 2010; 39: 317–321. Publisher Full Text
35. Dayon L, Kussmann M: Proteomics of human plasma: A critical comparison of analytical workflows in terms of effort, throughput and outcome. EuPA Open Proteom. 2013; 1: 8–16. Publisher Full Text
36. Beck HC, Overgaard M, Melholt Rasmussen L: Plasma proteomics to identify biomarkers – application to cardiovascular diseases. Translational Proteomics. 2015; 7: 40–48. Publisher Full Text
37. Čuklina J, Lee CH, Williams EG, et al.: Diagnostics and correction of batch effects in large-scale proteomic studies: a tutorial. Mol. Syst. Biol. 2021; 17: e10240. Publisher Full Text
38. Schulze WX, Usadel B: Quantitation in mass-spectrometry-based proteomics. Annu. Rev. Plant Biol. 2010; 61: 491–516. Publisher Full Text
39. Di Girolamo F, Lante I, Muraca M, et al.: The Role of Mass Spectrometry in the “Omics” Era. Curr. Org. Chem. 2013; 17: 2891–2905. PubMed Abstract | Publisher Full Text
40. Cairns DA, Thompson D, Perkins DN, et al.: Proteomic profiling using mass spectrometry--does normalising by total ion current potentially mask some biological differences? Proteomics. 2008; 8: 21–27. PubMed Abstract | Publisher Full Text
41. Yvon M, Levieux D, Valluy MC, et al.: Colostrum protein digestion in newborn lambs. J. Nutr. 1993; 123: 586–596. PubMed Abstract | Publisher Full Text
42. Mellor DJ, Cockburn F: A comparison of energy metabolism in the new-born infant, piglet and lamb. Q. J. Exp. Physiol. 1986; 71: 361–379. PubMed Abstract | Publisher Full Text
43. Massimini G, Peli A, Boari A, et al.: Evaluation of assay procedures for prediction of passive transfer status in lambs. Am. J. Vet. Res. 2006; 67: 593–598. PubMed Abstract | Publisher Full Text
44. Neutze SA, Gooden JM, Oddy VH: Measurement of protein turnover in the small intestine of lambs. 1. Development of an experimental model. J. Agric. Sci. 1997; 128: 217–231. Publisher Full Text
45. Peng J, Gygi SP: Proteomics: the move to mixtures. J. Mass Spectrom. 2001; 36: 1083–1091. PubMed Abstract | Publisher Full Text
46. Kemperman RF, Horvatovich PL, Hoekman B, et al.: Comparative urine analysis by liquid chromatography-mass spectrometry and multivariate statistics: method development, evaluation, and application to proteinuria. J. Proteome Res. 2007; 6: 194–206. PubMed Abstract | Publisher Full Text
47. Milac TI, Randolph TW, Wang P: Analyzing LC-MS/MS data by spectral count and ion abundance: two case studies. Statistics and its interface. 2012; 5: 75–87. PubMed Abstract | Publisher Full Text
48. Chemonges S: Supplementary data: Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition. figshare. Journal contribution. 2022. Publisher Full Text

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 05 Dec 2022

Author details Author details

¹ School of Veterinary Science, The University of Queensland, Gatton, QLD, 4343, Australia
² Proteomics and Small Molecule Mass Spectrometry, Central Analytical Research Facility, Queensland University of Technology, Brisbane, QLD, 4000, Australia

Saul Chemonges
Roles: Conceptualization, Data Curation, Formal Analysis, Funding Acquisition, Investigation, Methodology, Project Administration, Resources, Software, Supervision, Validation, Visualization, Writing – Original Draft Preparation, Writing – Review & Editing

Competing interests

No competing interests were disclosed.

Grant information

This work was undertaken as part of author’s PhD which was financially supported by an Australian Postgraduate Award scholarship through The University of Queensland and personal resources. All the facilities for the experimental work were funded by a collaborative arrangement with Queensland University of Technology Central Analytical Research Facility (QUT-CARF) arranged by the author. Academic supervisors, laboratory personnel and other personnel involved in this work were paid employees of their respective institutions.

Article Versions (1)

version 1

Published: 05 Dec 2022, 11:1433

https://doi.org/10.12688/f1000research.128316.1

Copyright

© 2022 Chemonges S. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Download

Export To

metrics

	Views	Downloads
F1000Research	-	-
PubMed Central Data from PMC are received and updated monthly.	-	-

Citations

0

SEE MORE DETAILS

CITE

how to cite this article

Chemonges S. Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition [version 1; peer review: 2 approved with reservations]. F1000Research 2022, 11:1433 (https://doi.org/10.12688/f1000research.128316.1)

NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.

track

receive updates on this article

Track an article to receive email alerts on any updates to this article.

Open Peer Review

Version 1

VERSION 1

PUBLISHED 05 Dec 2022

Views

5

Reviewer Report 01 Mar 2024

Uma Aryal, Purdue University, West Lafayette, Indiana, USA

Approved with Reservations

https://doi.org/10.5256/f1000research.140892.r243204

Data Independent Acquisition (DIA) is gaining traction in recent years and more and more laboratories are adopting these methods instead of the most widely used DDA method. Thus, analysis of ovine serum peptides spectral library is timely manuscript.
Overall, ... Continue reading

Data Independent Acquisition (DIA) is gaining traction in recent years and more and more laboratories are adopting these methods instead of the most widely used DDA method. Thus, analysis of ovine serum peptides spectral library is timely manuscript.
Overall, the manuscript is written well and easy to follow.
However, I have several comments or concerns, which I hope author will find useful and addressing them will improve the quality of the manuscript.
First, I found number of proteins identified in this study significantly low compared to other similar studies in other animals. The study identified only 335 and 236 proteins in healthy and sick serum samples, which seems significantly low. In our own experience using DDA, we can easily profile close to 1000 proteins in mouse or dog plasma or serum samples. Some clarification in the low number of protein IDs would be helpful. My understanding is that DIA will give more IDs compared to DDA method.

My second concern is the healthy and sick samples. It appears that healthy samples were obtained commercially while sick samples were obtained clinically that presented a range of clinical situations. I am not sure comparing commercial samples with the clinically collected samples is a good way to profile protein differences. This may present significant heterogeneity. Based on my own experience, animals raised in different environment and fed different food exhibit significant differences in serum proteome profile. It would be nice to describe these limitations briefly in a separate paragraph.
It was not clear to me how protein families grouped together. Was this done based on unique and/or shared peptides sequences?
Also, the term "unique proteins" may be misleading. It would be better to refer them as "proteins identified only in xxx, etc.
Also, low overlap of proteins between healthy and sick sheep also caught my attention. I assume some of these differences could have been originated from samples coming from totally different environment and in addition to disease states, other factors might have contributed these discrepancies. Again, a separate section summarizing all these limitations will be good service to the readers.
Details of downstream data analysis and statistical analysis to identify differentially abundant proteins was not provided. I suggest the author to provide this information in detail.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

References

1. Messner CB, Demichev V, Bloomfield N, Yu JSL, et al.: Ultra-fast proteomics with Scanning SWATH.Nat Biotechnol. 2021; 39 (7): 846-854 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: Quantitative proteomics, phosphoproteomic, application of proteomics to senescence or aging biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Views

9

Reviewer Report 15 Feb 2024

Bart Van Puyvelde, Ghent University, Ghent, Belgium

Approved with Reservations

https://doi.org/10.5256/f1000research.140892.r238118

In the manuscript titled "Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition," the author outlines a SWATH data analysis workflow for serum samples from 20 healthy and 7 diseased sheep. While the ... Continue reading

In the manuscript titled "Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition," the author outlines a SWATH data analysis workflow for serum samples from 20 healthy and 7 diseased sheep. While the application of Data-Independent Acquisition (DIA) has been predominantly explored in human specimens, its adoption in veterinary contexts remains limited. I commend the author for the comprehensive detailing of the data analysis workflow; however, the reliance on vendor-commercialized software may constrain its widespread applicability. It is noteworthy to consider the availability of several freely accessible academic pipelines such as DIA-NN and EncyclopeDIA, which have demonstrated superior performance compared to the SWATH MicroApp in terms of identified proteins. Addressing these alternative tools and their potential advantages could enhance the manuscript's relevance. Additionally, given the usage of a pre-existing spectral library, acknowledging its inherent limitations would provide a more comprehensive perspective. A suggestion to expand the impact of the publication involves comparing the performance of various pipelines, including the mentioned academic alternatives, utilizing predicted spectral libraries. This approach has the potential to significantly augment the number of identified peptides and proteins. By incorporating such comparisons, the relevance and impact of the research could be further elevated.

While reviewing the total ion chromatograms of various runs, a significant variance in sample loading caught my attention. Notably, sample B appears to have a sample load approximately 10 times higher than the other samples. Without any batch correction, such as normalization, performing a meaningful differential analysis becomes challenging. Moreover, the inclusion of only seven serum samples from individuals with different underlying diseases, as emphasized by the author, raises concerns about the study's robustness. In my view, a more controlled and expansive disease cohort is essential to draw conclusive insights regarding the annotation of clinicopathological biomarkers from this study. If the authors intend to include such a claim in the title, it would necessitate substantial additional work, which may often be impractical. Therefore, I recommend modifying the paper's title to accurately reflect the study's scope and limitations.

Some of the figures (Figure 2A, 5 and 6) can also be moved to supplementary as they don't really add a lot of information.

Minor comments:
-Page 5: cold acetone (20°C) => -20°C
-Page 5: is centrifugation at 4,000g for 2 minutes sufficient to precipitate the proteins?
-Page 5: Aceto nitrile => acetonitrile
-Page 5: What kind of nanoLC instrument was used? Provide some more details
-Page 5: 36 windowsl => remove the -l
-Page 5: instrument as described in earlier => remove the "in"
- Page 14 :Why did the author decide to pool the samples prior to trypsin digestion instead of pooling them after digestion?"

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

References

1. Van Puyvelde B, Willems S, Gabriels R, Daled S, et al.: Removing the Hidden Data Dependency of DIA with Predicted Spectral Libraries.Proteomics. 2020; 20 (3-4): e1900306 PubMed Abstract | Publisher Full Text
2. Searle BC, Swearingen KE, Barnes CA, Schmidt T, et al.: Generating high quality libraries for DIA MS with empirically corrected peptide predictions.Nat Commun. 2020; 11 (1): 1548 PubMed Abstract | Publisher Full Text
3. Demichev V, Messner CB, Vernardis SI, Lilley KS, et al.: DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput.Nat Methods. 2020; 17 (1): 41-44 PubMed Abstract | Publisher Full Text
4. Searle BC, Pino LK, Egertson JD, Ting YS, et al.: Chromatogram libraries improve peptide detection and quantification by data independent acquisition mass spectrometry.Nat Commun. 2018; 9 (1): 5128 PubMed Abstract | Publisher Full Text

Competing Interests: No competing interests were disclosed.

Reviewer Expertise: SWATH-MS, Viral detection, Plasma peptide/protein biomarker screening

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

CITE

Report a concern

Respond or Comment

Comments on this article Comments (0)

Version 1

VERSION 1 PUBLISHED 05 Dec 2022

Open Peer Review

Reviewer Status

Reviewer Reports

	Invited Reviewers
	1	2
Version 1 05 Dec 22	read	read

Bart Van Puyvelde, Ghent University, Ghent, Belgium
Uma Aryal, Purdue University, West Lafayette, USA

Comments on this article

All Comments(0)

Add a comment

Sign up for content alerts

Browse by related subjects

Back to all reports

Reviewer Report

5 Views

01 Mar 2024 | for Version 1

Uma Aryal, Purdue University, West Lafayette, Indiana, USA

5 Views Cite this report Responses(0)

Approved With Reservations

Data Independent Acquisition (DIA) is gaining traction in recent years and more and more laboratories are adopting these methods instead of the most widely used DDA method. Thus, analysis of ovine serum peptides spectral library is timely manuscript.
Overall, the manuscript is written well and easy to follow.
However, I have several comments or concerns, which I hope author will find useful and addressing them will improve the quality of the manuscript.
First, I found number of proteins identified in this study significantly low compared to other similar studies in other animals. The study identified only 335 and 236 proteins in healthy and sick serum samples, which seems significantly low. In our own experience using DDA, we can easily profile close to 1000 proteins in mouse or dog plasma or serum samples. Some clarification in the low number of protein IDs would be helpful. My understanding is that DIA will give more IDs compared to DDA method.

My second concern is the healthy and sick samples. It appears that healthy samples were obtained commercially while sick samples were obtained clinically that presented a range of clinical situations. I am not sure comparing commercial samples with the clinically collected samples is a good way to profile protein differences. This may present significant heterogeneity. Based on my own experience, animals raised in different environment and fed different food exhibit significant differences in serum proteome profile. It would be nice to describe these limitations briefly in a separate paragraph.
It was not clear to me how protein families grouped together. Was this done based on unique and/or shared peptides sequences?
Also, the term "unique proteins" may be misleading. It would be better to refer them as "proteins identified only in xxx, etc.
Also, low overlap of proteins between healthy and sick sheep also caught my attention. I assume some of these differences could have been originated from samples coming from totally different environment and in addition to disease states, other factors might have contributed these discrepancies. Again, a separate section summarizing all these limitations will be good service to the readers.
Details of downstream data analysis and statistical analysis to identify differentially abundant proteins was not provided. I suggest the author to provide this information in detail.

Is the work clearly and accurately presented and does it cite the current literature?

Yes
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Partly
If applicable, is the statistical analysis and its interpretation appropriate?

Partly
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

References

1. Messner CB, Demichev V, Bloomfield N, Yu JSL, et al.: Ultra-fast proteomics with Scanning SWATH.Nat Biotechnol. 2021; 39 (7): 846-854 PubMed Abstract | Publisher Full Text

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

Quantitative proteomics, phosphoproteomic, application of proteomics to senescence or aging biology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

Back to all reports

Reviewer Report

9 Views

15 Feb 2024 | for Version 1

Bart Van Puyvelde, Ghent University, Ghent, Belgium

9 Views Cite this report Responses(0)

Approved With Reservations

In the manuscript titled "Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition," the author outlines a SWATH data analysis workflow for serum samples from 20 healthy and 7 diseased sheep. While the application of Data-Independent Acquisition (DIA) has been predominantly explored in human specimens, its adoption in veterinary contexts remains limited. I commend the author for the comprehensive detailing of the data analysis workflow; however, the reliance on vendor-commercialized software may constrain its widespread applicability. It is noteworthy to consider the availability of several freely accessible academic pipelines such as DIA-NN and EncyclopeDIA, which have demonstrated superior performance compared to the SWATH MicroApp in terms of identified proteins. Addressing these alternative tools and their potential advantages could enhance the manuscript's relevance. Additionally, given the usage of a pre-existing spectral library, acknowledging its inherent limitations would provide a more comprehensive perspective. A suggestion to expand the impact of the publication involves comparing the performance of various pipelines, including the mentioned academic alternatives, utilizing predicted spectral libraries. This approach has the potential to significantly augment the number of identified peptides and proteins. By incorporating such comparisons, the relevance and impact of the research could be further elevated.

While reviewing the total ion chromatograms of various runs, a significant variance in sample loading caught my attention. Notably, sample B appears to have a sample load approximately 10 times higher than the other samples. Without any batch correction, such as normalization, performing a meaningful differential analysis becomes challenging. Moreover, the inclusion of only seven serum samples from individuals with different underlying diseases, as emphasized by the author, raises concerns about the study's robustness. In my view, a more controlled and expansive disease cohort is essential to draw conclusive insights regarding the annotation of clinicopathological biomarkers from this study. If the authors intend to include such a claim in the title, it would necessitate substantial additional work, which may often be impractical. Therefore, I recommend modifying the paper's title to accurately reflect the study's scope and limitations.

Some of the figures (Figure 2A, 5 and 6) can also be moved to supplementary as they don't really add a lot of information.

Minor comments:
-Page 5: cold acetone (20°C) => -20°C
-Page 5: is centrifugation at 4,000g for 2 minutes sufficient to precipitate the proteins?
-Page 5: Aceto nitrile => acetonitrile
-Page 5: What kind of nanoLC instrument was used? Provide some more details
-Page 5: 36 windowsl => remove the -l
-Page 5: instrument as described in earlier => remove the "in"
- Page 14 :Why did the author decide to pool the samples prior to trypsin digestion instead of pooling them after digestion?"

Is the work clearly and accurately presented and does it cite the current literature?

Partly
Is the study design appropriate and is the work technically sound?

Partly
Are sufficient details of methods and analysis provided to allow replication by others?

Yes
If applicable, is the statistical analysis and its interpretation appropriate?

Not applicable
Are all the source data underlying the results available to ensure full reproducibility?

Yes
Are the conclusions drawn adequately supported by the results?

Yes

References

1. Van Puyvelde B, Willems S, Gabriels R, Daled S, et al.: Removing the Hidden Data Dependency of DIA with Predicted Spectral Libraries.Proteomics. 2020; 20 (3-4): e1900306 PubMed Abstract | Publisher Full Text
2. Searle BC, Swearingen KE, Barnes CA, Schmidt T, et al.: Generating high quality libraries for DIA MS with empirically corrected peptide predictions.Nat Commun. 2020; 11 (1): 1548 PubMed Abstract | Publisher Full Text
3. Demichev V, Messner CB, Vernardis SI, Lilley KS, et al.: DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput.Nat Methods. 2020; 17 (1): 41-44 PubMed Abstract | Publisher Full Text
4. Searle BC, Pino LK, Egertson JD, Ting YS, et al.: Chromatogram libraries improve peptide detection and quantification by data independent acquisition mass spectrometry.Nat Commun. 2018; 9 (1): 5128 PubMed Abstract | Publisher Full Text

Competing Interests

No competing interests were disclosed.

Reviewer Expertise

SWATH-MS, Viral detection, Plasma peptide/protein biomarker screening

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Respond to this report

Responses (0)

[1] 1. Gillet LC, Navarro P, Tate S, et al.: Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol. Cell. Proteomics. 2012; 11: O111.016717. PubMed Abstract | Publisher Full Text

[2] 2. Aebersold R, Mann M: Mass spectrometry-based proteomics. Nature. 2003; 422: 198–207. Publisher Full Text

[3] 3. Tang X, Meng Q, Gao J, et al.: Label-free Quantitative Analysis of Changes in Broiler Liver Proteins under Heat Stress using SWATH-MS Technology. Sci. Rep. 2015; 5: 15119. PubMed Abstract | Publisher Full Text

[4] 4. Perez-Patiño C, Barranco I, Parrilla I, et al.: Extensive dataset of boar seminal plasma proteome displaying putative reproductive functions of identified proteins. Data Brief. 2016; 8: 1370–1373. PubMed Abstract | Publisher Full Text

[5] 5. Hernández-Castellano LE, Ferreira AM, Nanni P, et al.: The goat (Capra hircus) mammary gland secretory tissue proteome as influenced by weight loss: A study using label free proteomics. J. Proteome. 2016; 145: 60–69. PubMed Abstract | Publisher Full Text

[6] 6. Chemonges S: Development of a novel encyclopaedic peptide spectral library using the liquid fraction of sheep blood. The University of Queensland, School of Veterinary Science;2018. PhD. Thesis.

[7] 7. Noor Z, Paramasivan S, Ghodasara P, et al.: Leveraging homologies for cross species plasma proteomics in ungulates using data-independent acquisition. J. Proteome. 2021; 250: 104384. Publisher Full Text

[8] 8. Huang Q, Yang L, Luo J, et al.: SWATH enables precise label-free quantification on proteome scale. Proteomics. 2015; 15: 1215–1223. PubMed Abstract | Publisher Full Text

[9] 9. Yan Z, Zhou Y, Shan Y, et al.: Label-free quantification of differentially expressed proteins in mouse liver cancer cells with high and low metastasis rates by a SWATH acquisition method. Sci. China. Chem. 2014; 57: 718–722. Publisher Full Text

[10] 10. Furini G, Schroeder N, Huang L, et al.: FP013QUANTITATIVE PROTEOMICS BY SWATH-MS REVEALS AN ENDOSOMAL TRANSPORT HUB OF PROTEINS WHICH INTERACT WITH TG2 IN A MODEL OF EXPERIMENTAL KIDNEY FIBROSIS. Nephrol. Dial. Transplant. 2015; 30: iii70. Publisher Full Text

[11] 11. Williams EG, Wu Y, Jha P, et al.: Systems proteomics of liver mitochondria function. Science. 2016; 352. Publisher Full Text

[12] 12. Enk VM, Baumann C, Tho LKC, et al.: Regulation of highly homologous major urinary proteins in house mice quantified with label-free proteomic methods. Mol. BioSyst. 2016; 12: 3005–3016. PubMed Abstract | Publisher Full Text | Free Full Text

[13] 13. Chemonges S, Gupta R, Mills PC, et al.: Characterisation of the circulating acellular proteome of healthy sheep using LC-MS/MS-based proteomics analysis of serum. Proteome Sci. 2017; 15: 11.

[14] 14. Kulak NA, Pichler G, Paron I, et al.: Minimal, encapsulated proteomic-sample processing applied to copy-number estimation in eukaryotic cells. Nat. Methods. 2014; 11: 319–324. PubMed Abstract | Publisher Full Text

[15] 15. Yu Y, Smith M, Pieper R: A spinnable and automatable StageTip for high throughput peptide desalting and proteomics.2014.

[16] 16. Australian code of practice for the care and use of animals for scientific purposes. http

[17] 17. Hulsen T, de Vlieg J , Alkema W: BioVenn - a web application for the comparison and visualization of biological lists using area-proportional Venn diagrams. BMC Genomics. 2008; 9: 488. PubMed Abstract | Publisher Full Text

[18] 18. Deutsch EW, Mendoza L, Shteynberg D, et al.: A Guided Tour of the Trans-Proteomic Pipeline. Proteomics. 2010; 10: 1150–1159. PubMed Abstract | Publisher Full Text

[19] 19. MacLean B, MacCoss MJ, Tomazela DM, et al.: Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics. 2010; 26: 966–968. PubMed Abstract | Publisher Full Text

[20] 20. Röst HL, Rosenberger G, Navarro P, et al.: OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data. Nat. Biotechnol. 2014; 32: 219–223. PubMed Abstract | Publisher Full Text

[21] 21. Gillet LC, Navarro P, Tate S, et al.: Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol. Cell. Proteomics. 2012; 11(O111): O111.016717. Publisher Full Text

[22] 22. Vizcaino JA, Deutsch EW, Wang R, et al.: ProteomeXchange provides globally coordinated proteomics data submission and dissemination. Nat. Biotechnol. 2014; 32: 223–226. PubMed Abstract | Publisher Full Text

[23] 23. Vizcaino JA, Cote RG, Csordas A, et al.: The PRoteomics IDEntifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res. 2013; 41: D1063–D1069. PubMed Abstract | Publisher Full Text

[24] 24. Mi H, Poudel S, Muruganujan A, et al.: PANTHER version 10: expanded protein families and functions, and analysis tools. Nucleic Acids Res. 2016; 44: D336–D342. PubMed Abstract | Publisher Full Text

[25] 25. Lepherd ML, Canfield PJ, Hunt GB, et al.: Haematological, biochemical and selected acute phase protein reference intervals for weaned female Merino lambs. Aust. Vet. J. 2009; 87: 5–11. PubMed Abstract | Publisher Full Text

[26] 26. International Sheep Genomics CArchibald AL, Cockett NE, et al.: The sheep genome reference sequence: a work in progress. Anim. Genet. 2010; 41: 449–453. PubMed Abstract | Publisher Full Text

[27] 27. Jiang Y, Zhang W, Stanton J-A, et al.: The sheep genome illuminates biology of the rumen and lipid metabolism. Science (New York, N.Y.). 2014; 344: 1168–1173. PubMed Abstract | Publisher Full Text

[28] 28. Armengaud J, Trapp J, Pible O, et al.: Non-model organisms, a species endangered by proteogenomics. J. Proteome. 2014; 105: 5–18. PubMed Abstract | Publisher Full Text

[29] 29. Goll J, Montgomery R, Brinkac LM, et al.: The Protein Naming Utility: a rules database for protein nomenclature. Nucleic Acids Res. 2010; 38: D336–D339. PubMed Abstract | Publisher Full Text

[30] 30. Ban N, Beckmann R, Cate JH, et al.: A new system for naming ribosomal proteins. Curr. Opin. Struct. Biol. 2014; 24: 165–169. PubMed Abstract | Publisher Full Text

[31] 31. Gray KA, Yates B, Seal RL, et al.: Genenames.org: the HGNC resources in 2015. Nucleic Acids Res. 2015; 43: D1079–D1085. Publisher Full Text

[32] 32. Fundel K, Zimmer R: Gene and protein nomenclature in public databases. BMC Bioinformatics. 2006; 7: 372. PubMed Abstract | Publisher Full Text

[33] 33. Alberghina D, Giannetto C, Vazzana I, et al.: Reference intervals for total protein concentration, serum protein fractions, and albumin/globulin ratios in clinically healthy dairy cows. J. Vet. Diagn. Investig. 2011; 23: 111–114. PubMed Abstract | Publisher Full Text

[34] 34. Alberghina D, Casella S, Vazzana I, et al.: Analysis of serum proteins in clinically healthy goats (Capra hircus) using agarose gel electrophoresis. Vet. Clin. Pathol. 2010; 39: 317–321. Publisher Full Text

[35] 35. Dayon L, Kussmann M: Proteomics of human plasma: A critical comparison of analytical workflows in terms of effort, throughput and outcome. EuPA Open Proteom. 2013; 1: 8–16. Publisher Full Text

[36] 36. Beck HC, Overgaard M, Melholt Rasmussen L: Plasma proteomics to identify biomarkers – application to cardiovascular diseases. Translational Proteomics. 2015; 7: 40–48. Publisher Full Text

[37] 37. Čuklina J, Lee CH, Williams EG, et al.: Diagnostics and correction of batch effects in large-scale proteomic studies: a tutorial. Mol. Syst. Biol. 2021; 17: e10240. Publisher Full Text

[38] 38. Schulze WX, Usadel B: Quantitation in mass-spectrometry-based proteomics. Annu. Rev. Plant Biol. 2010; 61: 491–516. Publisher Full Text

[39] 39. Di Girolamo F, Lante I, Muraca M, et al.: The Role of Mass Spectrometry in the “Omics” Era. Curr. Org. Chem. 2013; 17: 2891–2905. PubMed Abstract | Publisher Full Text

[40] 40. Cairns DA, Thompson D, Perkins DN, et al.: Proteomic profiling using mass spectrometry--does normalising by total ion current potentially mask some biological differences? Proteomics. 2008; 8: 21–27. PubMed Abstract | Publisher Full Text

[41] 41. Yvon M, Levieux D, Valluy MC, et al.: Colostrum protein digestion in newborn lambs. J. Nutr. 1993; 123: 586–596. PubMed Abstract | Publisher Full Text

[42] 42. Mellor DJ, Cockburn F: A comparison of energy metabolism in the new-born infant, piglet and lamb. Q. J. Exp. Physiol. 1986; 71: 361–379. PubMed Abstract | Publisher Full Text

[43] 43. Massimini G, Peli A, Boari A, et al.: Evaluation of assay procedures for prediction of passive transfer status in lambs. Am. J. Vet. Res. 2006; 67: 593–598. PubMed Abstract | Publisher Full Text

[44] 44. Neutze SA, Gooden JM, Oddy VH: Measurement of protein turnover in the small intestine of lambs. 1. Development of an experimental model. J. Agric. Sci. 1997; 128: 217–231. Publisher Full Text

[45] 45. Peng J, Gygi SP: Proteomics: the move to mixtures. J. Mass Spectrom. 2001; 36: 1083–1091. PubMed Abstract | Publisher Full Text

[46] 46. Kemperman RF, Horvatovich PL, Hoekman B, et al.: Comparative urine analysis by liquid chromatography-mass spectrometry and multivariate statistics: method development, evaluation, and application to proteinuria. J. Proteome Res. 2007; 6: 194–206. PubMed Abstract | Publisher Full Text

[47] 47. Milac TI, Randolph TW, Wang P: Analyzing LC-MS/MS data by spectral count and ion abundance: two case studies. Statistics and its interface. 2012; 5: 75–87. PubMed Abstract | Publisher Full Text

[48] 48. Chemonges S: Supplementary data: Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition. figshare. Journal contribution. 2022. Publisher Full Text

Interrogation of an ovine serum peptide spectral library to annotate ambiguous clinicopathological biomarkers using data-independent acquisition

Abstract

Keywords

List of abbreviations

Introduction

Methods

Ethics

Experimental samples and workflow

Figure 1. Schematic experimental workflow showing serum samples and SWATH-MS analysis pipeline.

Sample preparation

nanoLC-nanoESI-MS/MS analysis

Chromatography

Data independent acquisition (DIA)

Data analysis

Access to raw proteomics data files

Results

Chromatography of peptides, protein identification and comparisons between sick and healthy sheep

Figure 2. Proteins identified using SWATH-MS in sick and healthy sheep serum samples.

Differential abundance of proteins

Figure 3. Ten most abundant proteins identified by SWATH-MS in sick and healthy sheep serum samples.

Figure 4. Disposition of protein intensities in healthy versus sick serum samples.

Gene ontology annotation and pathway analysis of identified proteins

Figure 5. Gene ontology (GO) term annotation of proteins identified in serum of sick versus healthy sheep serum.

Figure 6. Pathway analysis of proteins identified in serum of sick, versus healthy sheep serum.

A note about the sick sheep serum samples used for this experimental workflow

Discussion

Conclusion

Author’s contributions

Data availability

Underlying data

Extended data

Acknowledgements

References

Comments on this article Comments (0)

Open Peer Review

Comments on this article Comments (0)

Open Peer Review

Reviewer Status

Reviewer Reports

Comments on this article

Browse by related subjects

Competing Interests Policy

Stay Updated