ALL Metrics
-
Views
-
Downloads
Get PDF
Get XML
Cite
Export
Track
Method Article

Accurate cytogenetic biodosimetry through automated dicentric chromosome curation and metaphase cell selection

[version 1; peer review: 2 approved]
* Equal contributors
PUBLISHED 09 Aug 2017
Author details Author details
OPEN PEER REVIEW
REVIEWER STATUS

This article is included in the Interactive Figures collection.

Abstract

Accurate digital image analysis of abnormal microscopic structures relies on high quality images and on minimizing the rates of false positive (FP) and negative objects in images. Cytogenetic biodosimetry detects dicentric chromosomes (DCs) that arise from exposure to ionizing radiation, and determines radiation dose received based on DC frequency. Improvements in automated DC recognition increase the accuracy of dose estimates by reclassifying FP DCs as monocentric chromosomes or chromosome fragments. We also present image segmentation methods to rank high quality digital metaphase images and eliminate suboptimal metaphase cells. A set of chromosome morphology segmentation methods selectively filtered out FP DCs arising primarily from sister chromatid separation, chromosome fragmentation, and cellular debris. This reduced FPs by an average of 55% and was highly specific to these abnormal structures (≥97.7%) in three samples. Additional filters selectively removed images with incomplete, highly overlapped, or missing metaphase cells, or with poor overall chromosome morphologies that increased FP rates. Image selection is optimized and FP DCs are minimized by combining multiple feature based segmentation filters and a novel image sorting procedure based on the known distribution of chromosome lengths. Applying the same image segmentation filtering procedures to both calibration and test samples reduced the average dose estimation error from 0.4 Gy to <0.2 Gy, obviating the need to first manually review these images. This reliable and scalable solution enables batch processing for multiple samples of unknown dose, and meets current requirements for triage radiation biodosimetry of high quality metaphase cell preparations.

Keywords

Ionizing Radiation, Radiation Exposure, Computer-Assisted Image Processing, Software, Quality Control, Cytogenetics, Metaphase, Mass casualty incidents

Abbreviations

ADCI, Automated Dicentric Chromosome Identifier and dose estimator; CNL, Canadian Nuclear Laboratories; DC, Dicentric chromosome; DCA, Dicentric chromosome assay; FP, False positive; HC, Health Canada; K–S, Kolmogorov–Smirnov test; MC, Monocentric chromosome; MC-DC SVM, Monocentric-Dicentric Support Vector Machine; ML, Machine learning; SCS, Sister chromatid separation; SD, Standard deviation; SVM, Support Vector Machine; TP, True positive.

Introduction

Analysis of microscopy images of metaphase cells demonstrates the damaging effects of ionizing radiation and can be used to measure the amount of radiation absorbed. The gold standard method for radiation biodosimetry, the dicentric chromosome assay (DCA), uses the frequency of aberrant dicentric chromosomes (DCs) formed after radiation exposure to determine the dose received by an individual (in Gy). While some aspects of the assay have been successfully streamlined, the overall throughput remains limited by the labour-intensive identification of DCs in many cells. This affects the timely estimation of radiation exposure, especially for testing multiple affected individuals in a large accident or a mass casualty nuclear event1,2.

The selection of images of adequate quality for accurate identification of the chromosome damage is a prerequisite to automating DCA. The decision to select or exclude particular microscope images based on the quality of metaphase cells has been performed manually, which is impractical given the increasing sizes of datasets produced by automated image capture systems. Image quality assessment has traditionally compared new data relative to reference images3, complex mathematical models4, or distortions from a training set recognized by machine learning5. Such generic approaches are not appropriate in the DCA because features tailored for ranking morphologically diverse chromosome images are not easily generalized as entropic or other measures applying frequency filters to intensity distributions. We demonstrate that quality chromosomal images can be selected for the DCA using supervised, image segmentation rules aimed at categorizing the preferred images and eliminating false positive (FP) DCs.

We previously developed the Automated Dicentric Chromosome Identifier and Dose Estimator (ADCI) software to automate DC detection and estimate radiation exposures611. Briefly, ADCI uses image segmentation techniques to extract possible chromosomes. Preprocessing image filters remove most but not all non-chromosomal objects (e.g. debris, nuclei, overlapping chromosomes). Each remaining object is regarded as a single, intact, post-replication “chromosome-like” object. Each of these objects is processed by a series of algorithms710 which create a quantitative profile measuring chromosome width from one telomere to the other. Potential centromere locations (“centromere candidates”) are identified at constrictions in the width profile (Figure 1)12. Machine learning (ML) modules then use features sourced from computer vision analysis of each chromosome to classify centromeres and dicentric chromosomes6,11. An initial Support Vector Machine (SVM) ranks potential centromere candidates in each chromosome according to their corresponding distances to the hyperplane that distinguishes centromeres from non-centromeric constrictions; then, another SVM scores the chromosome as either monocentric (MC) or dicentric (DC), using features derived from the top two centromere candidates.

53d0ac74-507e-436b-aacb-c9e5d6ca5106_figure1.gif

Figure 1. Chromosome images processed by ADCI, annotated with key segmentation features.

(A) Monocentric and (B) Dicentric chromosome. Chromosome contour overlaid in green, long-axis centreline in red. For reference, the minimum bounding box of the contour is also displayed in magenta and green. Yellow and cyan markers on the centerline indicate the top-ranked and 2nd-ranked centromere candidates, respectively, and all other candidates are indicated with a dark blue marker. For each centromere candidate, their corresponding width traceline (crossing through the candidate and running approximately orthogonal to the centerline) are displayed in dark blue. The arc lengths of width tracelines running down the centerline (not all shown) are used to construct a chromosomal width profile. Note that for the monocentric chromosome (A), the top-ranked candidate correctly labels the true centromere location, while the 2nd-ranked candidate labels a minor non-centromeric constriction. Meanwhile, for the dicentric example (B), both the top and 2nd-ranked candidates label true centromere locations. By comparing features extracted from the top 2 candidates (including width and pixel intensity information), the software will determine if the chromosome is monocentric or dicentric.

Samples from blood exposed ex vivo to known radiation doses are processed by ADCI to construct a dose-response calibration curve. The average frequency of DCs per cell in dose calibrated samples, i.e. the radiation response, is fit to a linear-quadratic function. Responses for test samples exposed to unknown radiation levels can then be analyzed with this function to estimate the corresponding doses.

We noticed that metaphase cell images of inconsistent, lower quality can affect the accuracy of dose estimation by ADCI. Previous studies evaluated the efficacy of ADCI at chromosome classification and dose estimation10,11. While the sensitivity (recall) for DCs was acceptable (~70%) and relatively constant at all radiation exposure levels, precision showed a strong dependence on dose. Chromosome misclassification, in particular FPs comprised a larger fraction of DCs at low (≤1 Gy) relative to high (3–4 Gy) doses; at 1 Gy, FPs could outnumber true positive (TP) dicentrics by a factor of 4 to 5. Consequently, ADCI-processed samples exhibited a reduced range of accurate responses to radiation compared to manually scored samples. Although use of the same algorithm to derive the calibration curve compensates for some of these differences, reliability of the dose estimation ultimately hinges on DC classification accuracy. As DCs are always greatly outnumbered by MCs in a cell (background frequency in normal, unexposed individuals is one DC per 1000 cells6), this study focuses on improving the distinction between TP and FP DCs without compromising sensitivity.

FPs reflect inadequacies in interpreting certain chromosome morphologies or non-chromosomal objects as DCs. To improve overall DC classification accuracy, FPs must be selectively identified and removed without limiting TP counts. We first investigated FPs to categorize problematic cases and devised a set of post-processing object segmentation filters to eliminate them. Then, to ensure consistent overall performance within a set of images from a sample, statistical filters were developed to remove poor quality cells. Frequently, these images either lacked any chromosomes or contained incomplete metaphase cells, misclassified interphase or micro-nuclei as metaphase cells, or incorrectly segmented sister chromatids as individual chromosomes. Chromatid separation and chromosome fragments increase the object count in an image, but the pixel areas of said objects are smaller than actual chromosomes. Chromosome-overlaps reduce the object count, but their areas tend to exceed those of discrete chromosomes. Each proposed statistical filter was tested individually, and the best performing filters were applied cumulatively, then tested on cytogenetic dosimetry data at various radiation exposures. Effects of these filters on classification performance and dose estimation were then evaluated with dose-blinded, irradiated samples obtained from biodosimetry laboratories at Health Canada (HC) and Canadian Nuclear Laboratories (CNL).

This hybrid approach selects images based on optimal metaphase cell image properties and customized segmentation, and by identification and elimination of FP DCs. These improvements in ADCI ensure timely, reproducible, and accurate quantitative assessment of acute radiation exposure.

Methods

Cytogenetic image data were obtained at biodosimetry laboratories at HC and CNL, according to International Atomic Energy Agency (IAEA) guidelines. Blood samples were irradiated by an XRAD-320 (Precision X-ray, North Branford, CT) at Health Canada and processed at both laboratories. Samples were obtained with written informed consent from anonymous donors by the HC laboratory as approved by the Health Canada and Public Health Agency of Canada’s Research Ethics Board of protocol: “Development of Biological Dosimeters for Ionizing Radiation.” Peripheral blood lymphocyte samples were cultured, fixed, and stained at each facility according to established protocols2,12. Metaphase images from Giemsa-stained slides were captured independently by each laboratory using an automated microscopy system (Metasystems, Newton, MA). One set of metaphase images from CNL and two sets from HC (Table 1) were used for development and initial testing of the proposed algorithms. After image processing by ADCI, the identified DCs were manually reviewed and of the numbers of TPs or FPs were tallied. Calibration curves were prepared based on 6 samples of known radiation dose (Table 2). An additional 6 samples11 were initially blinded to the actual radiation exposures as test samples (Table 3). Test samples were exposed to a range of radiation doses bounded by the doses of samples used to construct the calibration curve. The sample naming convention is the laboratory name followed by the sample identifier, e.g. HC1Gy signifies the 1 Gy calibration sample prepared at HC, whereas CNL-INTC03S04 represents the test sample, INTC03S04, from an international laboratory inter-comparison exercise that was prepared at CNL (which had been exposed to 1.8 Gy).

Table 1. Metaphase image sets used in development and validation of DC filters.

Dataset nameHC-mixed*
HC-lowHC-highCNL-low
Lab sourceHealth
Canada
Health
Canada
Canadian
Nuclear
Laboratories
Radiation dose (Gy)13-41
No. of images198216256
No. of
chromosomes**
8041869710583
No. of TPs2016314
No. of FPs976182

*HC-mixed refers to a combined set of all images from both the HC-low + HC-high datasets

**Defined as number of valid segmented objects defined by ADCI.

Table 2. Metaphase image samples used in construction of dose calibration curves.

Sample
(HC or
CNL)
Physical
dose
(Gy)
No. of
images,
HC
No. of
images,
CNL
0Gy0731798
0.5Gy0.510541532
1Gy11566841
2Gy21147996
3Gy312121188
4Gy49091635
5Gy51019-

Table 3. Metaphase image samples used in evaluation of dose assessment performance.

Sample
name
Physical
dose
(Gy)
No. of
images, HC
preparation
No. of
images, CNL
preparation
INTC03S013.1540500
INTC03S082.3637500
INTC03S101.4708n/a
INTC03S041.8600957
INTC03S052.811361527
INTC03S073.4477735

n/a: data were not available.

Each calibration and test sample consisted of images from the same individual. HC provided an unselected set of all metaphase cells that were automatically recognized and captured using the default classifier of the microscopy system. By contrast, CNL previously manually curated a set of 500 high quality metaphase cell images, selected according to IAEA guidelines12, which deem metaphase cells analyzable based on chromosome count, distribution and morphology.

1) ADCI settings and metaphase image data

ADCI software (V1.0)11 was used for DC detection and dose prediction, setting the tuning parameter, σ, for the MC-DC SVM to 1.5. Software libraries were initially developed as available MATLAB scripts to test segmentation filters that detected FP DCs; once validated, C++ versions of these libraries were integrated into ADCI. For validation, two low dose and one high dose dataset were used (Table 1; the combination from HC comprise the HC-mixed image set).

2) Filtering out false positive objects

Quantitative morphological filters to delineate FP DCs were created and tested (i-viii, below). Each filter is designed to detect one or more of 6 FP morphological subclasses of FPs (described in Supplementary File 1). The FPs result from either I) excessive sister chromatid separation (SCS), II) fragmented or III) overlapping chromosomes, IV) chromosomes with highly variable boundaries or contours, V) non-chromosomal cellular debris, or VI) errors in the machine learning algorithms that detect centromere candidates and distinguish MCs from DCs.

The set of N chromosomes in any metaphase image is denoted by {c1,…,cN} and c* denotes the predicted DC of interest. Each filter (designated i – viii, below) classifies c* as either a TP or FP by comparing its filter score against a heuristically-defined threshold that is independent of laboratory source. Quantitative thresholds were established for each filter to eliminate the maximum number of FPs, without compromising detection of TP. Due to the relatively low frequency of DCs in the samples, maximal detection of TPs is essential for accurate dose estimation. Since FPs generally produce lower filter scores than TPs (i.e. lower area, lower width, less oblong footprint, more asymmetrical), FPs were selected by eliminating candidate DCs with scores below each threshold. The corresponding FP filter scores were calculated for all DCs in the HC-mixed image set (Table 1), and a heuristic threshold (to 2 significant digits; see below) was set to the minimum value observed in TPs for each filter. Thresholds for filters vi, vii and viii were calculated by repeating the same procedure on a set of 244 TP chromosomes from the MC-DC SVM training set6, and the final thresholds were set to the lower of each pair of values.

  • i. Area filter: A(c) denotes the pixel area occupied by chromosome c (Figure 2B). c* was classified as FP, if A(c*)/median({A(c1),…,A(cN)}) < 0.74 or as TP otherwise. This filter targets small chromosomes commonly displaying SCS (Figure 2A) and chromosome fragments.

  • ii. Mean width filter: Wmean(c) denotes the mean value of the width profile of chromosome c (Figure 2C). c* was classified as FP if Wmean(c*)/median({Wmean(c1),…, Wmean(cN)}) < 0.80 or as TP otherwise. This filter targets SCS and chromosome fragments.

  • iii. Median width filter: Wmed(c) denotes the median value of the width profile of chromosome c (Figure 2C). c* was classified as FP if Wmed(c*)/median({Wmed(c1),…, Wmed(cN)}) < 0.77, or as TP otherwise. This filter targets SCS and chromosome fragments.

  • iv. Max width filter: Wmax(c) denotes the maximum value of the width profile of chromosome c (Figure 2C). c* was classified as FP if Wmax(c*)/median({Wmax(c1),…, Wmax(cN)}) < 0.83, or as TP otherwise. This filter targets SCS and chromosome fragments.

  • v. Centromere width filter: Wcent(c) denotes the width of chromosome c at the position of the top-ranked centromere candidate (Figure 2C). c* was classified as FP if Wcent(c*)/median({Wcent(c1),…,Wcent(cN)}) < 0.72, or as TP otherwise. This filter targets SCS and chromosome fragments.

  • vi. Oblongness filter: S(c) denotes the pair of side lengths of the minimum bounding rectangle enclosing the contour of chromosome c (Figure 2D). c* was classified as FP if 1 − min(S(c*))/max(S(c*)) < 0.28, or as TP otherwise. This filter targets acrocentric chromosomes with SCS and some cases of overlapping chromosomes.

  • vii. Contour symmetry filter: L(c) denotes the pair of arc lengths of contour halves produced by partitioning the contour of chromosome c at its centerline endpoints (Figure 2E). c* was classified as FP if min(L(c*))/max(L(c*)) < 0.51, or as TP otherwise. This filter targets SCS.

  • viii. Intercandidate contour symmetry filter: LC(c) denotes the pair of arc lengths of the contour regions of chromosome c that run between the traceline endpoints of its top 2 centromere candidates (Figure 2F). c* was classified as FP if min(LC(c*))/max(LC(c*)) < 0.42, or as TP otherwise. This filter targets SCS and some instances of overlapping chromosomes.

53d0ac74-507e-436b-aacb-c9e5d6ca5106_figure2.gif

Figure 2. A visualization of DC filter scores for a particular false positive (FP).

DC filters are defined in Methods. (A) A processed FP (chromosome with SCS), with contour in green, centerline in red, top-ranked centromere candidate and its width traceline in yellow, 2nd-ranked centromere candidate and its width traceline in cyan. (B) Filter i: Thresholded binary image of the chromosome is used to calculate pixel area (in white). (C) Filters ii–v: Width profile along centerline is shown in red (horizontal axis plots centerline location, vertical axis plots width), with mean width in green (filter ii), median width in blue (filter iii), max width in magenta (filter iv), and width of top centromere candidate in yellow (filter v). (D) Filter vi: Contour in blue and its minimum bounding rectangle in magenta and green. (E) Filter vii: Partitioning of contour at centerline endpoints (intersection of red line with contour) into two segments, green and blue. (F) Filter VIII: Traceline endpoints of top 2 centromere candidates (intersection of yellow and cyan lines with contour) are used to partition contour into 4 segments (1 blue, 1 green, 2 magenta); relative arc lengths of blue and green segments are taken into consideration.

Determination of optimal filter subset: The same chromosome segmentation features were present in different segmentation filters, usually in combination with other elements (i.e. width for filters ii–v, contour symmetry for vi–viii) and/or targeted the same morphological subclass (notably, SCS). Thus, the “optimal” filter subset (termed “FP filters”) was defined as the subset of filters that maximized reclassification of the maximum number of FPs, while minimizing redundant detection of the same FPs. The performance for a given set of filters was the cumulative percentage of FPs removed by any of its filters, based on the HC-mixed set of images (Table 1). Using a forward selection approach, individual filters were added iteratively to identify those that produced the largest improvement in performance.

Modifications to ADCI: After chromosome processing and MC-DC SVM classification11 but prior to dose determination, all DC chromosomes inferred by ADCI were analyzed with the FP filters. DCs classified as FPs by any of the filters were reclassified and the remaining TP DCs were used for dose determination. The contours of DCs that were reclassified as MCs are outlined in yellow in the ADCI metaphase image viewer11 (Figure 3; top centre).

53d0ac74-507e-436b-aacb-c9e5d6ca5106_figure3.gif

Figure 3. Cell image viewer in ADCI demonstrating example of a corrected false positive (FP) DC.

Graphical User Interface for viewing cell images within a sample processed by ADCI11. Valid segmented objects (generally chromosomes, but occasionally nuclei or debris) are shown with coloured contours. Red contours indicate predicted DCs, yellow contours indicate chromosomes that were initially classified as DC and then reclassified by the FP filters (example at 12 o’clock), green contours indicate predicted MCs, and blue contours indicate objects that could not be further processed after image segmentation. Below the cell image, options were added to allow manual inclusion or exclusion of images within a sample from dose determination.

3) Dose estimation analysis

In ADCI, a pre-computed dose-response calibration curve is also used to estimate radiation absorbed in samples with unknown whole body exposures11. For a given sample, the radiation response is the ratio of the number of DCs detected to the number of selected metaphase cells. Calibration curves can be generated either from a set of samples of known exposures either by determining the response for each sample automatically with ADCI, or by entering the corresponding response from manually scored samples, and fitting the dose-response paired data to a linear-quadratic curve by regression. Because sample preparation protocols can vary and affect results, dose estimation of test samples (of unknown exposures) were performed with calibration curves generated with data from the same laboratory11.

The impact of segmentation filters to remove FPs on calibration curves was determined for the 0, 0.5, 1, 2, 3 and 4 Gy calibration samples. Radiation doses were estimated for CNL and HC test samples using the HC calibration curve after applying the same FP filters (Table 6).

4) Effect of filtering on manually image selected HC data

We compared HC calibration curves derived from manually curated samples with the FP filters either enabled or disabled to assess the impact of image selection on dose accuracy (Table 2). The criteria for manually curated HC samples were similar to the manual image selection performed by CNL. These images required: A) a complete complement of approximately 46 chromosomes, >40 segmented objects, <5 segmented objects from different nuclei if multiple nuclei present; B) exclusion of metaphase cells with “harlequin” hemi-stained chromosomes (indicative of multiple rounds of division after radiation exposure) that distort true DC frequencies10; C) images with <5 incorrectly-segmented chromosomes (chromosome overlaps indicating poor spreading), fragments (indicating sister chromatid separation) and overly-noisy contours (indicating poor image contrast); and D) an adequate degree of chromosome condensation. Depending on the stage of metaphase arrest, the degree of chromosome condensation can differ1,13. Prometaphase cells have longer chromosomes, are less rigid, exhibit greater overlap and less well-defined centromere constrictions, all of which pose significant challenges for automated chromosome classifiers1,14. Metaphase images with longer, thinner chromosomes (roughly corresponding to >550-band level14) were also excluded.

A minimum sample size of 500 cells per dose was adopted from IAEA recommendations12. Cell images selected from HC samples with automatic morphology filtering (see Methods section #5) were compared with a high quality set of images that were manually identified using the ADCI microscope viewer. For each sample, consecutive images meeting all criteria were evaluated manually until a sufficient number of cells were accrued. DC classifications were hidden during image selection to minimize bias. After generation of the curated HC calibration curves, the radiation doses of the three HC test samples (Table 3) were re-estimated on the new curves, with and without the FP filters enabled.

5) Automated removal of suboptimal images by morphology filtering

Manual selection of images assures consistency and reliability of metaphase data, which increases accuracy in DC analysis. Exclusion of lower quality images was automated in ADCI, since it was expected to reduce the number of FP DCs, thereby more accurately estimating radiation exposures.

We derived a set of image selection filters, implemented as available Python scripts, by segmenting features (I-VI, below) that eliminate metaphase cells in a sample with characteristics that increased the number of FPs:

  • I. Length-width ratio filter (LW) is based on the average length-width ratio of all chromosomes in an image. For a given chromosome c in a given image I containing N chromosomes, L(c,I) denotes the arc length of the centerline of c, Wmean(c,I) denotes the mean value of the width profile of c, SD is the standard deviation on Wmean(c,I), and T denotes the threshold value of SD common to all of these filters that distinguishes acceptable from outlier images. MW(I) is defined as mean{L(c1,I)/Wmean(c1,I),…,L(cN,I)/Wmean(cN,I)}. I* is removed if MW(I*) > mean{MW(I1),…,MW(IM)} + T×SD{MW(I1),…,MW(IM)}.

  • II. Centromere candidate density filter (CD) counts occurrences of centromere candidates in chromosomes and eliminates images containing chromosomes with a high density of candidate centromeres. For a given chromosome c in image I containing N chromosomes, L(c,I) denotes the arc length of the centerline of c, and Ncent(c,I) denotes the number of centromere candidates along c. CD(I) is defined as the mean{Ncent(c1,I)/L(c1,I),…,Ncent(cN,I)/L(cN,I)}. I* is removed if CD(I*) > mean{CD(I1),…,CD(IM)} + T×SD{CD(I1),…,CD(IM)}.

  • III. Contour finite difference filter (FD) represents the smoothness of contours of segmented objects in an image. It eliminates images with non-chromosomal objects with smooth contours, such as nuclei or micronuclei. For a given chromosome c in a given image I containing N chromosomes, WPD(c,I) denotes the set of first differences of the normalized width profile of c (range normalized to interval [0,1]). WD(I) is defined as the mean{mean{abs{WPD(c1,I)}},…,mean{abs{WPD(cN,I)}}}. I* is removed if WD(I*) < mean{WD(I1),…,WD(IM)} – T×SD{WD(I1),…,WD(IM)}.

  • IV. Total object count (ObjCount) filter is based on the number of all objects detected in an image. Values lying outside of a threshold range are rejected to eliminate images with multiple metaphases or excessive cellular debris. Based on empirical analyses, the suggested object count range falls within the interval [40, 60].

  • V. Segmented object count (SegObjCount) filter is based on the number of objects processed by the gradient vector flow7 (GVF) algorithm in an image. It is applied in the same way as filter IV. The suggested range for the object count interval is [35, 50].

  • VI. Classified object ratio (ClassifiedRatio) filter is derived from the ratio of objects recognized as chromosomes to the total number of segmented objects. It excludes images in which ADCI fails to process the majority of chromosomes. An image is removed if the ratio is less than either 0.6 or 0.7, which is determined by the desired level of stringency for this filter.

Filters I and II detect cells in prometaphase (having relatively long and thin chromosomes), with prominent sister chromosome separation, and with highly bent and twisted chromosomes. Filter III detects overly-smooth contours characterized by images containing intact nuclei and otherwise incomplete chromosome sets. The total object count (IV) and segmented object count filters V enrich for nearly normal metaphase images of approximately 46 chromosomes. These filters are then used to exclude images with extreme object counts. Filter VI selects images based on effectiveness of chromosome recognition by ADCI.

Image level filters I-III are based on the z-scores of different properties and comprise all objects in an image. For metaphase image I* in a sample containing M images, {I1,...,IM}, {c1,…,cN} denotes the set of N chromosomes within image I*. This SD value was determined to be 1.5 by varying T and applying these filters to the HC 2Gy calibration sample (Table 2). The corresponding thresholds for filters IV-VI were also derived from testing multiple samples.

Image ranking by combining image selection filters: Applying these filters sequentially to the same image distinguished the metaphase images used for dose estimation from lower quality images. Features consisting of counts, ratios and Z-scores for image filters I-VI were linearly combined to globally assess image quality. The combined score is one representation of the degree to which a particular image deviates from the population in a sample:

Combined Z Score =w(LW)×z(LW)+w(CD)×z(CD) – w(FD) ×z(FD)+w(ObjCount)×|z(ObjCount) | +w(SegObjCount)×|z(SegObjCount) |- w(ClassifiedRatio)×z(ClassifiedRatio)

Smaller Combined Z Scores represent higher quality images. Longer and thinner chromosomes in the image will increase the LW score, whereas bent and twisted chromosomes increase the CD term. Decreased chromosome concavity results in a higher FD score. The object and segmented object counts and their respective Z scores are related to chromosome distribution, and the level of sister chromatid separation in an image. These terms contribute to higher Combined Z Scores for images exhibiting either incomplete cells, multiple cells or severe sister chromatid separation. The Classified Ratio terms produce high scores for images that the algorithm does not process accurately. Each feature has a positive free parameter, weight, to adjust its contribution to the total score. Weights are determined by evaluating many possible weights using a grid search technique, and selecting those that minimize the error in curve calibration. The optimal weights for calibration samples are expected to perform similarly on test samples exposed to unknown radiation levels, assuming that the calibration and test samples have comparable chromosome morphologies. The Combined Z Score, however, cannot be used to compare the overall qualities of different samples, as Z-scores are normalized within each sample.

Image comparisons based on chromosome length distributions: The previously described tests use image morphology as the primary consideration in assessing metaphase image quality. The most common problems in lower quality metaphase cells are severe sister chromatid separation, excessive chromosome overlap, fragments of chromosomes in image segmentation, and multiple cells or incomplete cells in the same image. These often result in changes in either the number or the sizes of segmented objects. These tests do not account for the known relationships between the chromosomes in a cell with a nearly normal karyotype.

We derived a novel quality measure based on the observation that lengths and areas of chromosome images (in pixels) are approximately proportional to the well-known base-pair counts for each human chromosome. By comparing the distribution of observed chromosome object lengths with this “gold standard” inferred from the lengths of chromosomes in the reference human genome sequence, the overall quality of chromosome segmentation can be assessed in each cell image. Excluding chromosome abnormalities, which result from radiation exposure and are randomly distributed among cells, individual chromosome lengths are approximated by their corresponding chromosome areas (in pixels), since the actual chromosome lengths are difficult to measure accurately. Once noisy non-chromosomal objects, nuclei and large overlapped chromosome clusters have been removed, the areas of each remaining object are then determined relative to the total area of all chromosomes. The chromosomes in a metaphase cell are binned into three groups corresponding to the ISCN cytogenetic classification system16: The (AB) set comprises the A and B chromosome groups, (C) contains all of group C, and (DG) includes the D, E, F, and G groups. A single chromosome in group AB contains > 2.9% of nucleotides in the complete genome (determined by the shortest B group chromosome). A chromosome in category C has < 2.9% (determined by the longest C group chromosome), but > 2% (determined by the shortest C group chromosome) of nucleotides in the complete genome. Any chromosome in category DG contains < 2% of the complete genome (determined by the longest D group chromosome). These thresholds, 2.9% and 2% of the genome length, are respectively considered to be the maximum lengths of X and Y chromosomes. These thresholds are then applied to the areas of each chromosome object to count the number of chromosomes in each category in a metaphase image. An ideal metaphase image will have 10 AB chromosomes, 16 C chromosomes and 20 DG chromosomes in a female, and 10 AB chromosomes, 15 C chromosomes and 21 DG chromosomes in a male. We find that images with many overlapping chromosomes will have increased AB chromosome counts, while images with excessive sister chromatid separation generally have elevated DG chromosome counts. The quality of a metaphase image is determined by comparing the observed quantities of chromosomes in each group to the female or male standard. In practice, the result for an image is treated as a 3-element vector (AB, C, DG) and the Euclidean distance between the observed vector and the ideal standard is determined. Larger group bin distances correspond to less satisfactory images. We find that this measure appears to be universally applicable to metaphase images from different samples.

Sorting all images in a sample by either their Combined Z Score or by chromosome area Group Bin distance ranks cells according to metaphase quality for subsequent DC analyses. Image selection models can also be created in multiple stages by first qualifying images with chromosome morphology filters and then by selecting the top scoring images according to their Combined Z Scores or Group Bin distances.

6) Sample quality confidence measurement

Cytogenetic artifacts, such as sister chromatid separation and chromosome fragmentation, interfere with correct identification of DCs, thereby compromising reliability of dose estimates. This motivated the development of criteria to evaluate how well automated cell and FP curation improves sample quality. Samples exposed to low energy transfer, whole-body irradiation exhibit DC distributions that follow a Poisson distribution17 in all cells. The number of DC occurrences in a cell is constructed as a probability model of a sample. Each DC is assumed to be independent of other DCs in the first cell division and the rate at which DCs occur is constant for a single sample at a given radiation dose. The DC distribution detected either manually or by ADCI can be approximated by the Poisson statistic, with the λ parameter corresponding to the average number of DCs per cell in a sample.

Deviation from the Poisson distribution can occur when either some TPs are not accounted for or when FP DCs have not been reclassified. We evaluated post-processing sample quality by comparing the observed distribution of DCs in each sample (manual and automated) to its corresponding Poisson distribution. Observed and Poisson DC distributions were analyzed with the Pearson chi-squared goodness-of-fit test, which indicates the likelihood of rejecting the null hypothesis that the DCs were Poisson distributed. Only samples with ≥ 1 DC were analyzed. Very low p-values at or below α = 0.005 (99.5% confidence level) reject the null hypothesis and indicate lower quality samples.

Results

Application of chromosome morphology filters to remove FPs

False positive DCs (n=98) from a set of metaphase cells exposed to low dose radiation were classified into morphological subclasses to identify and ultimately eliminate these objects (described in Supplementary File 1). FP subclasses (Figure S1; subclasses A–F) included those exhibiting high levels of sister chromatid separation (A, n=51), chromosome fragmentation (B, n=10), overlap (C, n=17), noisy contour (D, n=5), cellular debris (E, n=4), as well as inaccurate recognition by either the centromere candidate10 or MC/DC6 machine learning algorithms (F, n=11).

Segmentation filters i–viii were applied to reclassify FPs in these images. Scale-invariant filters were tested to determine thresholds that selectively removed subclasses I-III without eliminating any TPs. Of the 51 SCS cases, 35 involved short, acrocentric chromosomes. FPs were distinguished from TPs based on either their lower relative pixel area or width (filters i–v), substantially non-oblong footprint (filter vi), or substantial contour asymmetry across the centerline (filters vii and viii). For filters i-v, normalization to median scores of other objects in the same image was performed, as well as normalization to other measures of central tendency (e.g. z-score, mean, and mode after binning scores). FPs could be eliminated for each morphological subclass (Table S1), with most of the segmentation filters acting on their targeted subclass. However, the effects of each filter were not exclusive to those subclasses.

To evaluate individual filter performance, the percentage of FPs removed by each filter was calculated for the HC-mixed image set (Table 4). A two-sample Kolmogorov–Smirnov test (K–S) was also performed for each filter (α=0.05) on the same data, where one group consisted of the filter scores of all TPs (n=183) and the other group consisted of the scores of all FPs (n=158). All 8 filters rejected the null hypothesis (Table 4), suggesting that these groups are distinguishable by thresholded segmentation filters. Applying the intercandidate contour symmetry filter (filter viii) achieved the largest overall reduction of FPs (44.9%), and eliminated the most SCS-induced FPs (43 of 51) in the low dose exposure set of metaphase images (Table S1). The max width filter (filter iv) yielded the next largest reduction in FPs (27.8%) and was the most efficient filter for detecting the fragmented chromosome class of FPs (8 of 10).

Table 4. Comparison of false positive (FP) discrimination ability between proposed DC filters.

DC filter
designation**
2-sample
K–S, TPs/FPs,
p-value*
FP removed
(%)*
i: Area2.2E-1822.2
ii: Mean width9.2E-1016.5
iii: Median width3.3E-914.6
iv: Max width3.3E-827.8
v: Centromere width8.8E-313.9
vi: Oblongness1.1E-2427.2
vii: Contour
symmetry
1.2E-810.1
viii: Intercandidate
contour symmetry
4.0E-3044.9

*Calculated from HC-mixed image set from Table 1.

**See Methods section 2 for description of each filter.

FPs were eliminated cumulatively by combining multiple segmentation filters. Since individual filters were separately thresholded to avoid removal of TPs, the inclusive disjunction (logical “or” operation) of multiple filters produced a stronger FP discriminator, but was not expected to reduce the TP count. Different combinations of filters were tested using forward selection (Table S2). The best performing filter set removed 58.9% of FPs and consisted of 5 FP filters (i + iv + v + vi + viii). Of these, iv and viii accounted for 54.4% of the FPs, with the others identifying the remaining FPs. Performance was evaluated with independent sets of metaphase images (Table 5), consisting of two HC image sets at low and high dose exposures (HC-low and HC-high) and one CNL image set exposed to low dose radiation (CNL-low). On average, 55 ± 9.6% of FPs were removed among all sets; individually, the filters eliminated 52% FPs from CNL-low, 66% from HC-low, and 48% FPs from HC-high. All TPs were retained in each of the sets after FP filtering (i.e. 100% specificity).

Table 5. Performance evaluation of false positive (FP) filters* on development and validation image datasets.

Image
set**
No. of
TP DCs removed
No. of
FP DCs
removed
FP
removed
(%)
HC-low06466
HC-high02948
CNL-low04352

*FP filters refer to the subset of filters i + iv + v + vi + viii (Methods section 2).

**See Table 1 for sample details.

Table 6. Dose estimation of test samples, with and without false positive (FP) filters* enabled.

HC samples**CNL samples**
Physical dose
(Gy)
3.12.31.41.82.83.4
Estimate,
unfiltered (Gy)
3.951.700.401.402.452.95
Estimate, FP
filters (Gy)
2.651.40.202.202.803.50

*FP filters refer to the subset of filters i + iv + v + vi + viii (Methods section 2). Calibration curve image data were not curated or filtered. HC samples were unselected (INTC03S01: 540 images, INTC03S08: 637 images, and INTC03S10: 708 images) and The CNL samples were previously manually selected (INTC03S04: 448 images; INTC03S05: 500 images; INTC03S07: 385 images).

**See Table 3 for sample details.

Dose calibration curves for HC and CNL data were generated in ADCI to investigate the impact of the FP filters on dose estimation accuracy (Figure 4). Dose estimation errors, the absolute difference between dose estimate by ADCI and the known physical dose, were determined for three CNL and three HC test samples from HC; then, results for uncorrected vs. FP-filtered images were compared (Table 6). In manually curated samples from CNL, accuracy was also improved >2-fold by applying the FP filters (average error decreased from 0.43 Gy to 0.18 Gy).

53d0ac74-507e-436b-aacb-c9e5d6ca5106_figure4.gif

Figure 4. Calibration curves for HC and CNL samples.

The dose-response calibration curves for (A) HC and (B) CNL metaphase cell image sample data. Response (mean DC frequency/cell) is on the vertical axis, corresponding radiation dose (Gy) on the horizontal axis. Green curves are based on unfiltered images, cyan curves were derived by recomputing DC frequencies after applying false positive (FP) filters (filters i + iv + v + vi + viii). HC and CNL curves were constructed by fitting a linear-quadratic curve through their respective HC and CNL calibration samples (refer to Table 2). The CNL curves consistently showed a more pronounced quadratic component than the HC curves, which exhibited a nearly linear response. The curves before (green) and after applying FP filters (cyan) are shown. After application of the filters, the HC and CNL curves showed diminished response at different Gy levels, due to elimination of some FP DCs.

Surprisingly, the dose accuracy of the HC samples did not improve after application of the FP filters (mean absolute error increased from 0.85 Gy to 1.03 Gy). All objects eliminated with these filters in the three HC test samples were reviewed and manually classified as either TP or FP, and the FP specificity across the samples was determined (Table 7), where FP specificity was defined as the ratio of FPs to all filtered objects. Similar to our earlier findings, the FP filters exhibited very high specificity for FPs (97.7–100%), indicating that the filters retained high specificity for TPs in the HC samples.

Table 7. Specificity of false positive (FP) filters* in HC test samples.

Image
sample**
Total no. of
chromosomes
removed
No. of
TPs
removed
No. of
FPs
removed
Specificity
for FPs
INTC03S011930193100%
INTC03S08133313097.7%
INTC03S10143214198.6%

TP, true positive

*FP filters refer to the subset of filters i + iv + v + vi + viii (Methods #2).

**See Table 3 for sample details.

We hypothesized that a difference in image selection protocols between the two laboratories was responsible for the discrepancies seen in classification performance and dose estimation accuracy. CNL manually selected for images deemed suitable for DCA analysis, and HC image selection was done with an automated metaphase classifier that effectively eliminates only images that lack metaphase cells. Manual review of images in these HC and CNL samples suggested differences in input image quality due to these image selection protocols. In concordance with findings from our previous study1, CNL data contained more images with well-spread, minimally-overlapping chromosomes, and fewer images with extreme SCS and chromosome fragments. The HC data contained a greater percentage of high-band-level (less condensed) chromosomes, characteristic of prometaphase/early-metaphase cell images. These chromosomes were the source of many unfiltered FPs, due to the lack of a strong primary constriction at the centromere which affects automated chromosome classification15.

A new set of HC calibration curves were then generated from manually curated, selected images from calibration samples (Figure 5). Images were excluded based on IAEA criteria17, along with cells exhibiting long chromosomes in early prometaphase16. Dose estimation accuracy of the HC test samples was significantly improved by enabling the FP segmentation filters (mean absolute error on unfiltered, curated images was 0.37 Gy prior to and 0.15 Gy after filtering; Table 8). Application of FP filters to both CNL and curated HC data led to >2-fold reduction in the mean absolute error of the estimated dose (p = 0.024, paired two tailed t-test). These results motivated the development of approaches to automatically select higher quality metaphase cell images.

53d0ac74-507e-436b-aacb-c9e5d6ca5106_figure5.gif

Figure 5. Original vs. manually curated calibration curves for HC samples.

The dose-response calibration curves for HC sample data, with and without false positive (FP) filters applied, before and after curation. Response (mean DC frequency/cell) on vertical axis, corresponding radiation dose (Gy) on horizontal axis. Green curve is not curated and includes all images, cyan curve is not curated and applies FP DC filters, red curve is curated, but unfiltered, and dark blue curve is curated and FP filters have been applied. Uncurated curves were generated from 0, 0.5, 1, 2, 3 and 4Gy calibration image data (Table 2). Curated curves were generated from the same data (except 0.5Gy was excluded) after lower quality images were manually removed (Methods section 4). After manual curation, the HC curves show a stronger quadratic component, similar to the original manually curated CNL curves (Figure 4).

Table 8. Dose estimates and deviations from physical dose for HC test samples after applying image selection models.

Image selection
model
INTC03S01
3.1*
INTC03S08
2.3
INTC03S10
1.4
INTC03S04
1.8
INTC03S05
2.8
INTC03S07
3.4
All images
(unfiltered)
2.65, -0.451.3, -1.00.2, -1.23.2, +1.42.1, -0.74.5, +1.1
Combined Z Score,
weight [5, 2, 4, 3, 4,
1]^, top 250 images
2.85, -0.251.8, -0.51.05, -0.354.65, +2.852.5, -0.35, +1.6, out of
bounds
Combined Z Score,
weight [4, 3, 4, 5, 2,
1], top 250
3.0, -0.11.8, -0.50.95, -0.454.95, +3.152.35, -0.455, +1.6, out of
bounds
Combined Z Score,
weight [1, 2, 1, 5, 1,
5], top 250
1.55, -1.551.35, -0.950.4, -1.04.4, +2.62.7, -0.15, +1.6, out of
bounds
Chromosome group
bin method^, top
250
2.6, -0.52.3, +0.01.05, -0.351.6, -0.22.8, +0.02.2, -1.2
Filters I-VI2.05, -1.050.95, -1.350.35, -0.950.6, -1.22.05, -0.751.15, -2.25
Filters I-III &
chromosome group
bin method, top 250
3.0, -0.12.0, -0.30.95, -0.451.85, +0.052.95, +0.152.35, -1.05
Manual image
curation
2.85. -0.252.3, +0.01.05, -0.353.95, +2.152.45, -0.354.0, +0.6

*Sample identifier, physical dose (Gy). False positive filters were enabled. Out of bounds indicates that the estimated dose exceeded the maximum calibrated dose.

^Figure 6 indicates effect of these image selection models on DC frequency for INTC03S04, INTC03S05, INTC03S7 as a function of the number of images analyzed.

53d0ac74-507e-436b-aacb-c9e5d6ca5106_figure6.gif

Figure 6.Relationship between DC frequency (y-axis) and number of images used (x-axis) to calculate frequency.

Samples exposed to different radiation levels and generated by each laboratory can be toggled and compared using the drop down menu (top left). The static image in the portable document format displays this relationship for the HC sample exposed at 3 Gy. Images were ranked by different scoring methods (see key). DC frequencies based on unordered, unselected images (order corresponds to the alphabetized file names, which is random with respect to image quality) are indicated with a blue line, images ranked by Group Bin Distance are shown in orange, and those ranked according to Combined Z Score are shown in green. Lowest count numbers in the ranked images correspond to the highest quality and lower quality images are progressively introduced as the count increases. Graphs were generated with Plotly (https://plot.ly/).

Application of image selection models

To the best of our knowledge, assessment of metaphase cell image quality for DC analysis has not been objectively and quantitatively standardized between laboratories. Cell selection by cytogenetic experts is based on their knowledge of metaphase chromosome conformation, sensitivity, and even individual preferences in interpreting images that can sometimes be inconsistent. Therefore, image selection methods were evaluated through dose estimation of filtered test samples and comparisons with known physical exposures. Images in all calibration and test samples from the same laboratory were processed by the same image selection model. Dose estimates of test samples were calculated using a curve fit to the dose-response of calibration samples. Dose estimation errors indicate the accuracy of dicentric chromosome detection, and therefore provide a means of assessing the accuracy based on the image selection model used.

Each image in a sample was ranked based on its Combined Z Score, which is the sum of the products of the Z score for each of the filters (I – VI) and their corresponding weights. Weights were assigned integer values from 1 to 5. The optimal weights were obtained by searching all possible integer values among the set of calibration samples to determine those exhibiting smallest residual differences with the physical dose after fitting these estimated doses to the curve. This approach, while limiting the search space and reducing the computational complexity, ensured that a diverse combination of weights were used to evaluate each sample. The three optimal weight vectors resulting from this analysis, [5, 2, 4, 3, 4, 1], [4, 3, 4, 5, 2, 1], and [1, 2, 1, 5, 1, 5], were used to independently estimate doses of test samples of unknown exposure.

After images from a sample were assigned either Combined Z Scores or Group Bin Scores and sorted by rank, the 250 top ranked images were selected to determine dicentric aberration frequency for that sample. An adequate number of top ranked images are selected to provide sufficient images to generate a reproducible DC frequency for that sample. In the absence of a predicate filtering step, the ranking procedure has to effectively remove poor quality images that could distort the DC frequency. IAEA recommends >100 DCs be counted for samples with physical doses < 1 Gy17. In practice, laboratories usually score >250 images, but at least 500 cells may be required to achieve this level of DC detection and often more are required for samples with low radiation exposures. Selecting at least the 250–300 top scoring images resulted in stable dicentric frequencies for samples from both laboratories over a range of exposures (Figure 6: the interactive version allows viewing of individual calibration samples from 0 to 4 Gy exposure and three blinded samples from both the CNL and HC laboratories; the HC3Gy sample is shown in the static PDF version). Compared to the unselected, unordered images, the image selection models show a monotonic increase of DC frequency with radiation dose for image counts with stable frequencies for most samples (eg. HC2Gy, HC3Gy, HC-INTC03S10, CNL2Gy, CNL INTC03S05, CNL-INTC03S07). However, DC frequencies can differ by image selection method. For higher ranking images, the Combined Z Score more consistently eliminated cells with DCs than the Group Bin Distance Scoring method, resulting slightly lower overall DC frequencies, which may be due to more stringent selection of cells possessing fewer FPs. Dose responses for the image selection methods are generally lower for samples with large numbers of top ranked, high quality images, which gradually increase with lower image quality due to the presence of increasing numbers of unfiltered FP DCs. By contrast, unfiltered randomly sampled images from the same sample exhibit higher overall DC frequencies due to increased numbers of FP DCs. As expected, all of the DC frequencies converge to the same value when none of the images are excluded by the ranking methods.

Deviations of the estimated doses of the HC and CNL test samples from their corresponding physical exposures were determined for various image selection models (Table 8 and Table 9, respectively). For comparison, the dose estimation results of unselected, comprehensive sets of images for each sample are also shown. Deviations of ≤ 0.5 Gy from their calibrated physical dose are considered acceptable in triage biodosimetry5,12,17. For the unfiltered HC samples, the average absolute error was 0.8 Gy, and only a single sample, INTC03S01, fulfilled the triage criteria. The image selection model comprising filters I-III sorted by Chromosome Group Bin rank was the most accurate, with dose estimates for 4 HC samples (INTC03S01, INTC03S08, INTC03S10 and INTC03S05) exhibiting acceptable error tolerances (± 0.5 Gy). The Combined Z Score ranking with weights: [1, 2, 1, 5, 1, 5] had the lowest dose estimation accuracy for the HC samples (average error is ~1 Gy), with only INTC03S05 having an acceptable dose estimate. Of the 5 unfiltered, manually curated CNL samples, only INTC03S08 had an acceptable dose estimate. However, an image selection model consisting of all Z Score filters I-VI was the most accurate for CNL samples (mean absolute error of ~0.3 Gy), with 4 of 5 samples (INTC03S08, INTC03S04, INTC03S05 and INTC03S07) having acceptable estimated doses.

Table 9. Dose estimates and deviations from physical dose for CNL test samples after applying image selection models.

Image selection modelINTC03S01
3.1*
INTC03S08
2.3
INTC03S04
1.8
INTC03S05
2.8
INTC03S07
3.4
All images4, +0.9, out
of bounds
2.6, +0.32.45, +0.653.6, +0.84, +0.6, out
of bounds
Combined Z Score,
weight [5, 2, 4, 3, 4,
1]^, top 250
3.9, +0.82.8, +0.52.1, +0.33.05, +0.253.55, +0.15
Combined Z Score,
weight [4, 3, 4, 5, 2, 1],
top 250
4, +0.9, out of bounds2.75, +0.451.7, -0.13.05, +0.253.9, +0.5
Combined Z Score,
weight [1, 2, 1, 5, 1, 5],
top 250
3.6, +0.52.4, +0.10.65, -1.152.35, -0.453.05, -0.35
Chromosome group bin
method^, top 250
4, +0.9, out of bounds2.8, +0.51.8, +0.02.55, -0.254, +0.6
Filters I-VI3.75, +0.652.8, +0.51.95, +0.153.05, +0.253.4, +0.0
Filters I-III &
chromosome group bin
method, top 250
4, +0.9,
out of bounds
2.75, +0.451.65, -0.152.25, -0.553.95, +0.55
Manual image curation3.7, +0.62.85, +0.552.2, +0.42.8, +0.03.5, +0.1

*Sample identifier, physical dose (Gy). False positive filters were enabled. Out of bounds indicates that the estimated dose exceeded the maximum calibrated dose.

^Figure 6 indicates the effect of these image selection models on DC frequency for INTC03S01, INTC03S08, INTC03S10 as a function of the number of images analyzed.

While automated image selection rejects poor images and reduces FP DCs, dose estimates can only be considered reliable if sufficient numbers of images remain after filtering. Application of image filters can result in fewer than the recommended images for accurate dose estimation. Samples CNL-INTC03S08 and HC-INTC03S07, had 195 and 109 metaphase cells, respectively, after filtering and image selection. HC-INTC03S07 was of relatively lower quality, and the unfiltered set of 477 metaphase images contained fewer than the recommended minimum number after filtering (Table 10).

Table 10. Goodness of fit Poisson scores* of unfiltered, manually- and ADCI-filtered calibration and test samples.

SampleUnfiltered
images
Manual
image
selection
Automated
image
selection
HC0Gy1.3E-152.2E-01No DCs detected
HC05Gy4.1E-32Unavailable1.1E-02^
HC1Gy1.6E-189.9E-011.0E-01^
HC2Gy4.0E-642.1E-017.6E-04^
HC3Gy2.9E-024.6E-016.1E-01^
HC4Gy2.6E-042.2E-013.1E-01^
HC-INTC03S01<2.2E-308+9.1E-011.2E-01^
HC-INTC03S081.2E-014.6E-018.2E-01^
HC-INTC03S108.9E-013.9E-012.1E-01^
HC-INTC03S04<2.2E-308+<2.2E-308+5.5E-01^
HC-INTC03S051.1E-067.3E-053.5E-03^
HC-INTC03S07<2.2E-308+<2.2E-308+2.0E-04^
CNL0Gy5.2E-031.3E-013.1E-01#
CNL05Gy1.7E-1571.2E-016.0E-32#
CNL1Gy9.8E-301.5E-031.6E-06#
CNL2Gy2.3E-147<2.2E-308+4.5E-02#
CNL3Gy8.5E-076.8E-039.9E-01#
CNL4Gy5.2E-223.3E-021.8E-01#
CNL-INTC03S041.7E-601.9E-025.4E-02#
CNL-INTC03S052.7E-095.2E-023.3E-01#
CNL-INTC03S076.7E-104.2E-054.7E-01#
CNL-INTC03S01<2.2E-308+3.7E-037.6E-11#
CNL-INTC03S085.3E-167.7E-017.8E-01#

*Poisson score is the p-value of chi-square goodness of fit (without merging bins) of observed distribution of DCs/cell vs. Poisson distribution determined from average DC frequency. Filtering parameters chosen for each laboratory exhibit dose estimates that are closest to the physical dose:

^HC image sets were selected with filters I-III and ranked by chromosome group bin score;

+Minimum positive floating value in Windows operating system;

#CNL image sets were selected with filters I-VI.

Sample quality assessment after image selection

To determine if image selection improved sample quality, a Chi-squared goodness of fit test on Poisson distributed DCs was performed, both before and after automated and manual image selection (Table 10). Manual image selection for CNL samples was performed by CNL during sample preparation, while image selection for HC samples was performed on unselected datasets (samples HC-INTC03S01, HC-INTC03S08, HC-INTC03S10 were analyzed, despite <500 available images). The optimal image selection models were used for FP and image filtering for each laboratory (Table 8 and Table 9). The HC samples were selected with filters I-III and the chromosome group bin method, whereas the CNL samples were processed with filters I-VI. At the 1% significance level (i.e. Chi-square goodness-of-fit, p ≤ 0.01), 86% (19 of 22) of unfiltered samples were significantly different from the Poisson distribution, and 76% (13 of 17) of manually- and 77% (17 of 22) of automatically-selected samples did not differ. Manually curated and uncurated sample groups also significantly differed from each other (p = 0.0021; one-tailed Wilcoxon Signed-Rank Test, α=0.05, n=17). Therefore, the Poisson goodness of fit measures improvements in overall sample quality from image model selection. While the overall goodness of fit is improved for all of the automatically selected datasets, the Poisson distributions of DCs in the lowest quality samples (CNL1Gy, CNL05Gy, CNL-INTC03S01, HC-INTC03S05, HC-INTC03S07) were still rejected at a 0.5% significance threshold after filtering.

Discussion

Automated biodosimetric methods to detect DCs can produce incorrect assignments because the algorithms cannot capture the full range of morphological variability inherent in chromosome images of metaphase cells. Accuracy of these radiation exposure estimates can be improved by morphology-based chromosome image segmentation filters that eliminate suboptimal metaphase cell images and false positive DCs in the remaining images. Compared to results generated by the previous version of ADCI which did not reclassify FPs or remove any cell images11, the filters described here reduced FP DC rates by ~55% across a wide range of radiation exposure levels. Additionally, we showed that the object segmentation filters were highly specific for FPs in test image sets consisting of irradiated samples blinded to known dose (97.7–100%, n=6). Overall, the FP filters substantially improved DC classification accuracy.

The segmentation filters successfully target the majority of cells with SCS and chromosome fragments. The intercandidate contour symmetry filter is a particularly promising SCS detector, individually eliminating 84% of all SCS-induced FPs in our test dataset. Acrocentric chromosomes were disproportionally susceptible to SCS-induced errors compared to other chromosome types (69% of SCS cases, despite making up only 22% of human chromosomes). Given the rarity of acrocentric TP DCs (due to width profile inaccuracies at the extreme ends of chromosomes79), filters targeting acrocentric or small chromosomes, in general (such as filters i and vi), can also be useful for reducing SCS-induced FPs.

Certain FP subclasses were commonly targeted by multiple filters. Redundancy among the segmentation features resulted in only a subset of the filters being required to maximize FP elimination. Notably, filters ii–v eliminated FPs based on different definitions of chromosome width. The final FP filter combination consisted of only 5 of the 8 originally proposed filters. However, it should be noted that a combination of only 2 of the filters - the intercandidate contour symmetry (viii) and max width (iv) filters - achieved nearly the same level of FP detection in the test sample dataset, with the others having only incremental benefit.

The image selection filters were required to be scale-invariant, since chromosome structures may vary between cells, individual samples, and laboratory preparations. Scale invariance is also necessary to control for pixel-based chromosome measurements affected by chromosome condensation differences within a metaphase cell, and differences arising from optical magnification. This was achieved by either using image level filter scores normalized to the median “raw” score of all objects within the same cell image (i.e. filters I–V), or in which scores were determined from the ratios of pixel-based features (i.e. filters VI–VIII).

Differences in accuracy between the manually- and automatically-selected images for dose estimation revealed limitations of the current set of filters. The FP object filters in the manually curated CNL and HC image samples reduced the average dose estimation error from 0.4 Gy to <0.2 Gy (with a maximum error of 0.4 Gy), respectively. However, solely applying the FP object filters to unselected HC metaphase data was insufficient to correct this problem (average error increased by 0.15 Gy), and led to more inaccurate dose response values.

Variability in cell image quality contributed to this source of error. Some unselected HC samples contained images with high levels of SCS, which upon processing, produced large numbers of incorrectly classified chromosome fragments in some cells. While FP DC filters i–v target detection of these fragments, they were not reclassified in these cells, because they comprised the predominant chromosome morphology. For similar reasons, FP filtering was not suitable for elimination for removal of FPs in prometaphase cells containing many high resolution, long chromosomes (>700 band level). These observations suggested the need for another class of morphological filters that operated on complete images to remove those of low quality prior to dose estimation.

Image quality is a critical aspect of accurate DC detection and dose estimation. Manual inspection and quality control of metaphase selection is a common and essential practice in cytogenetic and biodosimetry laboratories, but it can be labor-intensive, and is frequently not automated. Image-level filtering automatically applies statistical thresholds to eliminate chromosomes with morphological features and non-chromosomal objects that predispose to FP DC assignments. Image scoring methods can also select a defined number of top-ranked, processed images for dose estimation. These FP filtering and image scoring methods can be applied either individually or in combination, resulting in improvements in the accuracy of DC frequency. Errors in dose estimates are considerably reduced using suitable image selection models in samples with ≥250 images. Doses were accurately estimated for most test samples within ±0.5Gy of their physical doses, as recommended17. Therefore, the image selection models presented provide reliable quality control, and can minimize manual review or DC analysis.

Automated image selection aims to simulate manual image curation. At this point, it does not quite achieve the same overall accuracy as manual image selection, especially for samples containing numerous images of lower quality. However, the respective differences in dose estimates of higher quality samples from HC and CNL, especially at exposures >2 Gy, are not significant. Automating image selection, nevertheless, offers unique advantages over manual image selection by introducing a uniform approach for chromosome analyses, ensuring both increased reliability and speed.

Data and software availability

Python code and sample data files for “Accurate cytogenetic biodosimetry through automated dicentric chromosome curation and metaphase cell selection” are available at http://doi.org/10.5281/zenodo.83353618.

MATLAB code and sample data files for “Accurate cytogenetic biodosimetry through automated dicentric chromosome curation and metaphase cell selection” are available at http://doi.org/10.5281/zenodo.83354019.

Source code license: CC-BY 4.0

Comments on this article Comments (1)

Version 1
VERSION 1 PUBLISHED 09 Aug 2017
Author details Author details
Competing interests
Grant information
Copyright
Download
 
Export To
metrics
Views Downloads
F1000Research - -
PubMed Central
Data from PMC are received and updated monthly.
- -
Citations
CITE
how to cite this article
Liu J, Li Y, Wilkins R et al. Accurate cytogenetic biodosimetry through automated dicentric chromosome curation and metaphase cell selection [version 1; peer review: 2 approved]. F1000Research 2017, 6:1396 (https://doi.org/10.12688/f1000research.12226.1)
NOTE: If applicable, it is important to ensure the information in square brackets after the title is included in all citations of this article.
track
receive updates on this article
Track an article to receive email alerts on any updates to this article.

Open Peer Review

Current Reviewer Status: ?
Key to Reviewer Statuses VIEW
ApprovedThe paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approvedFundamental flaws in the paper seriously undermine the findings and conclusions
Version 1
VERSION 1
PUBLISHED 09 Aug 2017
Views
27
Cite
Reviewer Report 31 Aug 2017
Eric Gregoire, Biological Dosimetry Laboratory (LDB), Institute for Radiological Protection and Nuclear Safety (IRSN), Fontenay-aux-Roses, France 
Approved
VIEWS 27
This paper presents an approach for the improvement of automatic detection of dicentrics and particularly the removal of some kinds of False Positives (FP). Indeed, the dicentric is the standard bio-marker in biological dosimetry. However the manual scoring of dicentric ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Gregoire E. Reviewer Report For: Accurate cytogenetic biodosimetry through automated dicentric chromosome curation and metaphase cell selection [version 1; peer review: 2 approved]. F1000Research 2017, 6:1396 (https://doi.org/10.5256/f1000research.13236.r25106)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 12 Oct 2017
    Peter Rogan, Department of Biochemistry, Schulich School of Medicine and Dentistry, University of Western Ontario, London, N6A 5C1, Canada
    12 Oct 2017
    Author Response
    We agree that a direct comparison of ADCI with DCScore might be worthwhile in the future, but it was not possible to carry this out in the present study.  In ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 12 Oct 2017
    Peter Rogan, Department of Biochemistry, Schulich School of Medicine and Dentistry, University of Western Ontario, London, N6A 5C1, Canada
    12 Oct 2017
    Author Response
    We agree that a direct comparison of ADCI with DCScore might be worthwhile in the future, but it was not possible to carry this out in the present study.  In ... Continue reading
Views
34
Cite
Reviewer Report 25 Aug 2017
David Lloyd, Centre for Radiation, Chemical and Environmental Hazards (CRCE), Public Health England, Didcot, UK 
Approved
VIEWS 34
This paper presents a considerable amount of work devoted to improving the accuracy of existing automatic dicentric systems for biological dosimetry.  The authors have developed some very interesting ideas in constructing filters that detect false positive dicentrics and thereby improve ... Continue reading
CITE
CITE
HOW TO CITE THIS REPORT
Lloyd D. Reviewer Report For: Accurate cytogenetic biodosimetry through automated dicentric chromosome curation and metaphase cell selection [version 1; peer review: 2 approved]. F1000Research 2017, 6:1396 (https://doi.org/10.5256/f1000research.13236.r24918)
NOTE: it is important to ensure the information in square brackets after the title is included in all citations of this article.
  • Author Response 12 Oct 2017
    Peter Rogan, Department of Biochemistry, Schulich School of Medicine and Dentistry, University of Western Ontario, London, N6A 5C1, Canada
    12 Oct 2017
    Author Response
    Regarding the use of fluorescent probes to highlight centromeres:

    PNA-FISH or any type of FISH can achieve higher intensities and contrast at centromeres than Giemsa staining, and would presumably require fewer ... Continue reading
COMMENTS ON THIS REPORT
  • Author Response 12 Oct 2017
    Peter Rogan, Department of Biochemistry, Schulich School of Medicine and Dentistry, University of Western Ontario, London, N6A 5C1, Canada
    12 Oct 2017
    Author Response
    Regarding the use of fluorescent probes to highlight centromeres:

    PNA-FISH or any type of FISH can achieve higher intensities and contrast at centromeres than Giemsa staining, and would presumably require fewer ... Continue reading

Comments on this article Comments (1)

Version 1
VERSION 1 PUBLISHED 09 Aug 2017
Alongside their report, reviewers assign a status to the article:
Approved - the paper is scientifically sound in its current form and only minor, if any, improvements are suggested
Approved with reservations - A number of small changes, sometimes more significant revisions are required to address specific details and improve the papers academic merit.
Not approved - fundamental flaws in the paper seriously undermine the findings and conclusions
Sign In
If you've forgotten your password, please enter your email address below and we'll send you instructions on how to reset your password.

The email address should be the one you originally registered with F1000.

Email address not valid, please try again

You registered with F1000 via Google, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Google account password, please click here.

You registered with F1000 via Facebook, so we cannot reset your password.

To sign in, please click here.

If you still need help with your Facebook account password, please click here.

Code not correct, please try again
Email us for further assistance.
Server error, please try again.