Introduction

Scaphocephaly is the most common form of isolated single-suture synostosis and accounts for 40–60% of the cases of craniosynostosis [1, 2]. Premature closure of the sagittal suture results in an elongated anteroposterior and narrowed transverse dimension of the skull (“boatlike head”) [3,4,5]. A variable phenotype is seen and depends on the region of synostosis of the sagittal suture: anterior part of the suture (frontal bossing), posterior part of the suture (occipital bulging), or complete fusion (both frontal and occipital bulge) [6, 7].

Craniosynostosis alters skull shape and thereby possibly intracranial volume (ICV). An apparent contradiction has been found in previous studies regarding ICV in patients with scaphocephaly; pre-operative ICV in these patients was found to be increased when compared to mean normal values available through Lichtenberg [8,9,10]. However, other studies found normal or diminished ICV in these patients [11, 12].

Although in general the clinician can relatively easily establish the diagnosis scaphocephaly, it is more difficult to evaluate the severity and its esthetic improvement following surgery. Scaphocephaly can differ in severity and phenotype, depending on the degree of premature suture fusion [6, 7]. Cranial index (CI) is the most commonly used method for evaluation of cranial deformities in scaphocephaly. However, CI only takes length and width into account, and therefore a major disadvantage of this method is that it does not capture the frontal and occipital malformations or its correction [13].

The aim of the current study is to implement UCSQ (Utrecht Cranial Shape Quantificator) for quantification of scaphocephaly [14]. UCSQ is an outline-based method of classification of skull shape deformities. We use external landmarks (soft tissue landmarks (left portion and left and right exocanthion), visible with the bare eye) to determine a reference plane at 4 cm height on CT scan. Following, an algorithm measures distance and angle from center of mass on the plane to the skull outline, leading to sinusoid curves. The resulting curves are specific and characteristic for scaphocephaly. UCSQ has the advantage of capturing the actual skull shape variation with every 3D diagnostic system capturing the surface of the head. Additionally, since there is still controversy concerning ICV of patients with scaphocephaly, these volumes are measured and correlated to the severity of the skull deformity. UCSQ is compared to the gold standard of quantification, CI.

Material and methods

Patients

Children with computed tomography (CT) confirmed nonsyndromic scaphocephaly was eligible for inclusion in the present study. Inclusion criteria were an age of 2 years or younger and a pre-operative status. Additionally, pre-operative 3D CT scans needed to be available and needed to contain the complete skull, including orbits and ears. Patients were diagnosed and operated at the Erasmus Medical Center, Sophia Children’s Hospital Rotterdam.

The study was approved by the local Medical Ethics Review Committee (MEC-2016-467). The study was deemed a retrospective clinical study and did not require formal research ethics approval under the Medical Research Involving Human Subjects Act.

Radiological studies

As stated before, pre-operative 3D CT scans of the skull were collected. CT scans with a slice thickness of more than 3.00 mm were excluded.

Additionally, conventional skull X-rays (anterior-posterior projection and lateral projection) of the included patients were collected.

Analysis of curve

In this study, we used the UCSQ in order to create sinusoid curves of the included patients (Fig. 1) [14]. The resulting curves were analyzed. Patients with scaphocephaly have a specific and recognizable skull deformity and therefore a specific pattern of the curve. Based on the sinusoid curve, the severity of craniosynostosis can be diagnosed.

Fig. 1
figure 1

Summary of methods

First, we measured different aspects of the curve. Maximum of forehead (F) and occiput (O), minimum value of the right side of head (trough) (R), and left side (trough) (L) were measured. We calculated the width of the skull (mean of both sides) (R/2 + L/2), difference occiput and sides (O-R/2-L/2), difference forehead and sides (F-R/2-L/2), and difference forehead and occiput (F-O). Additionally, abruptness of the two descents (respectively, Do (descent occiput; first descent) and Df (descent forehead; second descent)) and two ascents (respectively, Ao (ascent occiput; second ascent) and Af (ascent forehead; first ascent)) on the Y-axis were measured between the values of respectively 0.9 and 1.1 (Fig. 2). The described variables (named: variables of the curve) were correlated with the visual score.

Fig. 2
figure 2

Visualization of the used variables. Abbreviations: Af, abruptness first ascent (forehead); Ao, abruptness second ascent (occiput); Df, abruptness second descent (forehead); Do, abruptness first descent (occiput); F, maximum of forehead; L, minimum value of left side of head; R, minimum value of right side of head; XF, X-value of maximum forehead value; XL, X-value of the minimum value of the width on the left side; XR, X-value of the minimum value of the width on the right side

Second, the following parameters were combined and used to quantify severity of scaphocephaly according to UCSQ (named: UCSQ): width of the skull and maximum occiput and forehead. Value of width of skull was given a rank number; the narrower the skull, the higher the rank number. Maximum occiput and forehead were added and given rank numbers; the higher these added values, the higher the rank. Both ranking numbers were added.

Third, in order to differentiate between the different levels of severity of scaphocephaly, we used the most distinctive variables of scaphocephaly, namely, forehead, occiput, and the width of the skull. As stated before, the craniosynostosis skull is longer and narrower than the normal skull. Therefore, the difference between the mean values of a control skull for the previous variables and those of a scaphocephaly patient is indicative for severity. We used the mean values from the control patients, as reported in our previous study [15]. The following calculation in order to determine cut-off values and the different classes of severity (mild, moderate, severe) was developed: (Occiput - 1.1) + (Forehead - 1.15) + (1.70 - Width of skull). In this calculation, the values 1.1 and 1.15 are the values for occiput and forehead in control patients. A width of skull lower than 1.70 (2 × 0.85 as value for both sides of the head) is specific for scaphocephaly, and this value will be lower in case of a severe scaphocephaly. Additionally, cut-off values for each subgroup of severity were proposed.

Visual score

For patients with an available conventional skull X-ray, this X-ray is independently evaluated for width and length of the skull, vertex height, presence of frontal bossing or occipital bulging, and temporal narrowing by three medically trained panel members. When no conventional skull X-ray was available, visual score was performed on CT scan of the skull. The following scoring system (visual score) was assigned for each of the features: score of 0 (normal), 1 (moderate deformity), and 2 (severe deformity). The six parameters were separately evaluated; thus, maximum deformity meant that all the evaluated parameters were severely deformed (a maximum of 12 points could be given). The evaluation is subjective, and no objective criteria are available. Therefore, clinical experience is essential. The normal skull is scored 0 points, the most severe form of scaphocephaly 12 points with presence of a clear frontal bossing and occipital bossing and lower vertex (horse saddle deformity) and increased length (by a fourth) and reduced width (by a fourth) at parietal and temporal level. If 1 of the 6 parameters is between normal and severe, 1 point is scored.

Cranial index

For patients with an available conventional skull X-ray, CI was measured on this skull X-ray. When unavailable, CI was measured on CT scan of the skull. CI represents the ratio of maximum cranial width to maximum cranial length multiplied by 100 (CI = biparietal diameter (width of skull)/occipitofrontal diameter × 100%). CI gives information of the head shape and the severity of the malformation of the skull.

CI can also be measured using the curve following UCSQ. The maximum cranial width in the new method was measured by multiplying the mean value of both sides of the head by 2, and the maximum cranial length was measured by adding the value of the maximum of forehead to the maximum value of the occiput.

Calculating the intracranial volume

The entire intracranial cavity was considered region of interest (ROI) in order to calculate ICV. CT DICOM images were imported to OsiriX (version 7.0, Fondation OsiriX, Geneva, Switzerland) on Mac OSX. Start slice was considered just above foramen magnum and end slice just beneath vertex of the skull. On each axial slice, the ROI was manually outlined on the inner table of the skull, and defects were manually closed. The total ICV was extrapolated (Fig. 3).

Fig. 3
figure 3

Visualization of calculated and measured ICV

Lichtenberg’s normal value of intracranial volume

As reference for normal intracranial volumes of (healthy) children, matched for age and sex, we used Lichtenberg’s data, which is the most frequent reference for normal intracranial volume [8].

Statistical analysis

To assess the level of agreement between observers, intraclass correlation coefficients (ICC) was calculated. The ability of a test to show interobserver reliability was evaluated using the two-way random effects model assuming an average measurement and absolute agreement. An ICC of 1 means perfect reliability, and an ICC of 0 shows poor reliability. ICC for visual score was calculated.

In each patient for each of the evaluated items of the visual assessment, the average of the scores given by the three observers was calculated and used in further statistical evaluation. Each of the calculated averages of the six evaluated items were summed up and resulted in an overall visual assessment, which could range from 0 to 12.

To assess monotonic relationships, between the scored individual features of the visual assessment or the overall visual assessments and each variable of the curve, the Pearson’s correlation coefficient was used. Additionally, UCSQ was correlated to visual score by using Pearson’s correlation coefficient.

Pearson’s correlation between traditional CI and calculated CI of UCSQ, traditional CI, and overall visual score and between calculated CI of the UCSQ and overall visual score were calculated to determine a preferable method of measurement of the CI.

ICV was correlated to both traditional CI and CI of the UCSQ method, visual score, and UCSQ by using Pearson’s correlation coefficient.

The outcomes of the ICC are characterized by Landis and Koch as poor (0.00 to 0.20), fair (0.21 to 0.40), moderate (0.41 to 0.60), good (0.61 to 0.80), or excellent (0.81 to 1.00) [16]. The accepted guidelines for interpreting the correlation coefficients are as follows: + 1 indicates a perfect positive linear relationship, − 1 indicates a perfect negative linear, and 0 indicates no linear relationship. The size of a correlation coefficient can be interpreted as follows: negligible correlation (0.00 to 0.30), low (0.30 to 0.50), moderate (0.50 to 0.70), high (0.70 to 0.90), and very high (0.90 to 1.00) [17].

Statistical analyses were performed using the Statistical Package for the Social Sciences for Windows (Version 21, SPSS Inc., Chicago, IL, USA).

Results

We included 21 children with nonsyndromic scaphocephaly. Mean age of patients was 7 (2–24) months. There were 18 male and three female patients (85.7% vs. 14.3%). In 14 patients, additional conventional skull X-ray was available.

Analysis of curve

Figure 4a shows an overview of the curves of the included patients. Figure 4b shows both the mean curve of scaphocephaly patients and additionally for illustration the mean curve of 5 control patients used in our previous study [15]. Table 1 shows the measured and calculated variables with the use of the generated curves. Mean calculated UCSQ was 22.00 (2.00–42.00).

Fig. 4
figure 4

Overview resulting sinusoid curves. a Overview graphs of all included scaphocephaly patients and minimal, maximal, and all mean value. b Mean curve of included scaphocephaly patients (N = 21) and mean curve of control patients (N = 5)

Table 1 Measured and calculated variables of the curves

Mean of the calculation for severity of scaphocephaly ((Occiput - 1.1) + (Forehead - 1.15) + (1.70 - Width of skull)) was 0.30 (0.01–0.60). Mean (Occiput - 1.1) was 0.08 (0.00–0.15), mean (Forehead - 1.15) was 0.09 (0.01–0.18), and mean (1.70 - Width of skull) was 0.12 (0.00–0.27).

Visual score

ICC for visual score was calculated. The level of agreement between the three observers for the values of the width and length of the skull represents perfect reliability (both 0.81; p < 0.000), for vertex height and occipital bulging represent good reliability (0.63; p < 0.000 and 0.75; p < 0.000), and for presence of frontal bossing and temporal narrowing moderate reliability (0.53; p = 0.019 and 0.43; p = 0.019). For illustration, Fig. 5 shows a patient with the maximum visual score of 12, with presence of a clear frontal bossing and occipital bossing and lower vertex and increased length and reduced width at parietal and temporal level.

Fig. 5
figure 5

Conventional skull X-ray for visual score. This patient had a visual score of 12. a Lateral view. b Anterior-posterior view

Mean visual score was 9.1 (min. 7–max. 12).

CI

Mean traditional CI was 66.01 (57.36–78.58), and mean calculated CI of UCSQ was 65.18 (55.41–75.32).

ICV

The pre-operative intracranial volumes ranged from 671 mL in a 2-month-old child to an intracranial volume of 1303 mL in a 24-month-old child. Mean intracranial volume of all included patients was 910 mL.

Lichtenberg normative data was used for comparison of normal intracranial volume with pre-operative intracranial volumes of the scaphocephaly patients. Figure 6a and b show measurements of ICV of each included patients plotted in these ranges.

Fig. 6
figure 6

Lichtenberg normative intracranial volume curves are gender- and age-specific curves; ICV measurement of each included patient is plotted on the Lichtenberg normative curves. Orange rhombi indicate patients; dotted lines indicate SD lines (± 1 SD and ± 2 SD) of Lichtenberg mean; continuous line indicates Lichtenberg mean. a Boys (n = 18). b Girls (n = 3)

Table 2 shows the ICV of the included patients and the ICV compared to the Lichtenberg’s data. Ten of 18 (56%) boys and 3 of 3 (100%) girls had volumes at or greater than the Lichtenberg mean. In total, 2 boys and 2 girls had an ICV larger than +2 SD. Six of 18 (33%) boys had an ICV lower than the Lichtenberg mean. None of the patients had an ICV lower than – 2 SD.

Table 2 Calculated ICV per patient and comparison to Lichtenberg’s data

Correlations

Overall scores (added value of the evaluated features of conventional skull X-rays) of the visual score show high correlations with O (r = 0.72; p < 0.000), mean width of skull ((R/2 + L/2)) (r = − 0.80; p < 0.000), and moderate correlation with F (r = 0.62; p = 0.003). Negligible correlation was found for the difference between the forehead and occiput (r = 0.12; p > 0.05). Overall visual scores show moderate correlation with abruptness of first descent (Do) (r = − 0.53; p = 0.013), abruptness of second descent (Df) (r = − 0.63; p = 0.002), abruptness of first ascent (Af) (r = 0.61; p = 0.003), and abruptness of the second ascent (Ao) (r = 0.63; p = 0.002).

Additionally, when looking at different items of visual score separately, length of forehead according to visual score was correlated to the extracted value from the curve, showing negligible correlation (r = 0.29; p > 0.05). Length of occiput according to visual score moderately correlated to extracted value from the curve (r = 0.56; p = 0.008), and width of skull according to visual score showed moderate correlation to extracted value from the curve (r = − 0.66; p = 0.001).

Correlation between traditional CI and calculated CI of UCSQ was high (r = 0.88; p < 0.000).

Correlation between traditional CI and overall visual score show a moderate relationship (r = − 0.59; p = 0.005). Correlation between calculated CI of UCSQ and overall visual score was high (r = − 0.81; p < 0.000).

UCSQ was used to quantify the severity based on the selected variables and given ranking numbers. High correlation was found between UCSQ and visual score (r = 0.88; p < 0.000).

Low and negligible correlation was found between ICV and both traditional CI and CI of UCSQ (r = − 0.30; p > 0.05 and r = − 0.23; p > 0.05, respectively). Correlation between ICV and visual score was negligible (r = 0.17; p > 0.05). Negligible correlation was found between ICV and UCSQ (r = 0.26; p > 0.05).

Discussion

The aim of the current study was to evaluate and quantify a group of pre-operative patients diagnosed with isolated scaphocephaly using our UCSQ [15]. Our method is able to differentiate between the different levels of severity in patients with isolated sagittal synostosis.

In addition to CI, several new techniques are introduced in order to classify scaphocephaly; however, these methods are difficult to reproduce and complex [10, 14, 18,19,20,21,22]. No consent has been reached about which method to use in patients with scaphocephaly. The lack of agreement of superiority of any method is partially explained by the insufficiency of available evidence and complexity of the methods.

The first reported and most commonly used method of quantification is the CI, which only takes the length and width of the skull into account, while any other clinical findings or dysmorphology is not. For instance, a cranium with relatively minimal AP length but increased frontal height can be a severe case of scaphocephaly without an appropriately low CI. An updated index of the CI was introduced to differentiate the severity of scaphocephaly, the scaphocephaly severity index (SSI; distance-based approach). In comparison with the CI, the SSI uses calculations on different, perpendicular, transverse planes of CT scans, and additionally, internal or surface landmarks are used [21]. Neither the CI nor the SSI used the actual geometric variation in shape of the skull. A solution for capturing geometric variations of the skull is an outline-based approach [23].

Our UCSQ is a distance- and outline-based approach, which uses recognizable and reproducible surface landmarks and captures the actual geometric variations and cranial width and length of the skull [15]. A major advantage of the outline-based quantification method, instead of the distance-based quantification method, is the ability to represent more aspects of cranial deformation.

In the current study, ICVs were calculated; however, no consistency in pre-operative volume measurements was found. ICV ranged from smaller to bigger than Lichtenberg mean. Four of 21 patients had an ICV larger than + 2 SD, and other ICVs were in the acceptable range. These findings are comparable with earlier reports, noting these differences in volumes [9,10,11,12]. Negligible correlation between severity of scaphocephaly according to UCSQ and ICV was found. ICV did not correlate to CI and visual score. These results indicate that the more severe cases of scaphocephaly do not result in more deviant skull volumes, neither smaller nor larger volumes.

The scaphocephalic skull shape is long and narrow, resulting in high values of the maximum occiput and forehead and low values of the width of the skull. Moderate and high correlation is found between overall visual score and forehead and occiput, respectively, indicating a higher score correlates with a longer forehead and occiput. In addition, high correlation was found for width of head, which shows correlation with a higher score and a more narrow side of the head. This indicates that the overall visual score correlates with the individual variables, length, and width; these variables are therefore suited for quantification of severity.

We found that a high correlation was found between UCSQ and visual score, meaning that the deformity visible with the bare eye can be expressed in number by UCSQ. UCSQ can objectively quantify severity of scaphocephaly. Additionally, correlation between UCSQ and visual score was stronger than the correlation between visual score and CI.

Based on our calculation for severity of scaphocephaly ((Occiput - 1.1) + (Forehead - 1.15) + (1.70 - Width of skull)), we propose the following cut-off values in order to classify severity: mild < 0.20, moderate ≥ 0.20 and < 0.35, and severe ≥ 0.35. However, these cut-off values are only based on 21 patients, and further validation of these values is needed in future research.

When looking into separate items of the visual score, length of forehead and occiput according to visual score show less strong correlation to the extracted values from the curve. Correlation between width of skull according to visual score showed moderate correlation to extracted value from the curve. The measurements were higher than visual scores, indicating that the visual scores underestimate the real deformity. The first descent and second ascent in the curves are linked to the occiput; first ascent and second descent are linked to the forehead. However, these measures were not consistently related to the corresponding specific features of the visual score, possibly due to underestimation of the visual score.

We compared traditional CI to the calculated CI using UCSQ. Hereby, a limitation occurs, a possible underestimation of CI by using UCSQ, since it takes the length at 4 cm above the basal plane and not at the level of the largest AP length. The new method (UCSQ) then results in a lower CI compared to traditional CI. However, the outcomes of the traditional CI and the calculated CI of our method showed high correlation. Traditional CI showed moderate correlation with the visual score, and calculated CI of UCSQ showed high correlation; negative correlation was measured in both methods, consistent with our other findings. This implicates that our method outperforms traditional CI. This can be explained by the wide variation in the methodology in determining maximum cranial length and width in practice by using the traditional CI, making CI measurements across different studies difficult to compare. This is improved by UCSQ. However, traditional CI is easy, fast, and applicable in clinical setting and is therefore the method of preference. In the future, the UCSQ will be implemented on 3D photogrammetry to perform curvature analysis. This may result into the advantages of minimal invasive follow-up of the skull growth and gives a good visualization of the morphology of the skull shape; furthermore, this method can provide insight in changes in skull shape due to (varying) surgical techniques in comparison to nonsurgical management.

In this study, we used an objective method for quantification of scaphocephaly, which provides a satisfactory representation of the actual characteristic morphology and can be used for analysis of the outcome of a given treatment. Using a scoring system that reflects the degree of severity would allow researchers to investigate other issues, such as the correlation between the severity of the skull malformation and cognitive outcome for patients with scaphocephaly. Our current advice is to use traditional CI in clinical practice, since it is easy to use and minimally invasive for the patients. However, UCSQ is a promising method of quantification and classification in craniosynostosis patients. Therefore, further research is needed to implement UCSQ in surgical decision-making and skull growth follow-up. Additionally, no strong correlation was found between severity of the skull deformity (visual and according to UCSQ) and ICV; more severe scaphocephaly does not result in more deviant skull volumes.