Introduction

Vascular birthmarks can result in physical and psychological problems and may lead to developmental and behavioral problems [1]. Modern treatment of vascular birthmarks is with 585-nm pulsed dye laser therapy, which can lead to the restoration of normal skin but, in the majority of patients, results in only a partial improvement [24].

The evaluation of the treatment outcomes of birthmarks has been derived from subjective and qualitative judgments and does not provide a validated method for clinicians [5]. These subjective observations have relied primarily on visual inspection, which makes treatment effects difficult to evaluate with any precision [6]. The color perception of human eyes varies among individuals; moreover, the problems of illuminant metamerism, different lighting conditions, sensitivity spectrum, and adjustment of the eye to color may be involved in color mismatch and inaccurate comparisons [7].

Apart from lacking standardized objective methods of assessing the results after laser treatment, the grading system of the outcomes differs among institutions, influencing even more the professional communication of results between clinicians. Three-grade classification uses the following grading: excellent for lesion improvement of 75 to 100 %, good for 50 to 75 %, and fair for 25 to 50 % [8]. Four-grade system consists of the following steps: excellent (complete clearance of pigmentary lesion), good (subtotal clearance), fair (moderate clearance), and poor (minimal to no response after treatment) [9]. Perez et al. described the scoring scale 0 to 100 where value 100 stands for total response to treatment [6]. A universally accepted system would improve the communication among clinicians and scientists.

Previous reports, concerning objective evaluation, included the following: laser Doppler, reflectance spectrophotometry, tristimulus colorimetry, and videomicroscopy [912]. The above methods have not been widely incorporated into clinical practice either due to high cost of equipment or time-consuming data processing. Moreover, objective methods of assessment with use of photographs could not standardize colors of the differently shadowed areas of the body existing in the same picture, making the analysis incomprehensive [13, 14].

Therefore, there is a need for a comprehensive, automated, cheap, and clinically applicable objective method to evaluate the outcome and prognosis after treatment. Digital photography with computer analysis appears to be a promising technology for the evaluation of birthmarks. Digital photography and computers are affordable and widely available in clinical practice. Photographs are routine documentation of results after laser treatment. Taking photographs avoids contact with the skin, thereby avoiding the problem of birthmark blanching seen with some techniques. However, most cameras use color presentation, called red green blue. This format is dependent on lightning conditions, device, and focusing; thus, it does not allow accurate comparison of the pre- and postoperative photographs.

This problem has been previously overcome by converting pixels in the picture into Commission International d’Eclairage (CIE) L*a*b* color scale [15]. CIELab [L*, luminance; a*, balance between green (−) and red (+); and b*, balance between blue (−) and yellow (+)] was intended to provide a standard, approximately uniform color scale which could easily compare color values [16]. CIELab is the most complete color presentation, and it describes the whole variety of colors visible to the human eye. It is claimed to be the reference as the light- and device-independent model. However, the results are still affected by the differences in shadowing between the healthy skin and the birthmark in one picture. Therefore, the studies assessing usefulness of digital two-dimensional color photographs only with CIELab were only partially successful [13, 14].

Innovative additional manipulation of skin shadowing with use of hue saturation value (HSV) color presentation can provide a validated method for objective assessment [17]. HSV is highly compatible with visual perception. Differentiation in color characteristics is useful in image processing. Therefore, HSV can be used to highlight homologously shadowed areas of the lesion and healthy skin. The present study aims to present a reliable method of objective evaluation of results after laser treatment for birthmarks, which can be clinically applicable for everyday surgical practice.

Patients and methods

For the study, we included 48 adults treated for birthmarks on the face or neck using pulsed dye laser in the Plastic and Reconstructive Surgery Department, St John’s Hospital, Livingston, UK. The inclusion criteria of the retrospective analysis were the following: birthmarks on the face or neck, patients with follow-up period at least 1 year after final stage of treatment, and pre- and posttreatment digital color photographs taken only by the professional medical photographer at the Medical Photography Department in order to reduce bias to minimum. Laser treatments applied for birthmarks cause posttreatment bruising/contusion that may last for several days; thus, the outcome should be assessed at least 1 month after the treatment. In our study, we used posttreatment photographs taken during the follow-up appointment at least 1 year after the final stage of laser application.

Each pre- and postoperative photograph was subjected to evaluation by three core physicians (consultants in plastic surgery working with lasers on everyday basis) and three laypeople as well as to computerized, triple objective comparative assessment of color shading with use of the program Skin Lesion Color Change (SLCC). In all three instances, the treatment outcome was presented as percentage of success (%), where 0 % corresponded to total failure, and 100 % described total success [6]. Moreover, we performed detailed analysis in relation to location as divided by Leonardo da Vinci: forehead, location 1; middle face, location 2; lower face, location 3; and neck, location 4.

Objective computer assessment

The computer program SLCC has been created by one of the authors (PS) and is copyright protected (Fig. 1). It uses combined color presentation of CIELab with HSV. Algorithm for calculation of treatment results in SLCC consists of two main steps. The first one is conversion of color scale from red green blue (RGB) to the combined scale of CIELab and HSV through transitional standardized RGB (sRGB) and CIE XYZ color presentations (Fig. 2). The above conversion is performed in both, pre- and postoperative photographs. In the second step, the program calculates the degree of color shading between the lesion and healthy skin from the standardized, modified pre- (ΔE) and postoperative (ΔE′) photographs (\( \varDelta \mathrm{E}={{\left[ {{{{\left( {\varDelta {{\mathrm{L}}^{*}}} \right)}}^2}+{{{\left( {\varDelta {{\mathrm{a}}^{*}}} \right)}}^2}+{{{\left( {\varDelta {{\mathrm{b}}^{*}}} \right)}}^2}} \right]}^{{{1 \left/ {2} \right.}}}}\to \mathrm{Result}=\left( {{{{1-\varDelta \mathrm{E}\prime }} \left/ {{\varDelta \mathrm{E}}} \right.}} \right)\times 100\,\% \)). To eliminate any outside variances such as skin tanning/lightening due to seasonal weather changes, as well as to further compensate for any ambient variances from one image to another, the patient’s normal skin served as a control/reference when calculating PWS color, as previously described [14].

Fig. 1
figure 1

Main screen of the SLCC program, presenting the process of analyzing the outcome of the laser treatment for birthmarks; a preoperative period, b state after last stage of treatment

Fig. 2
figure 2

Algorithm for conversion of color space from RGB to combined coordinates of CIELab and HSV, RGB, sRGB, CIE XYZ, CIELab, and HSV; different color presentations, detailed description in text

The clinical usefulness of the method used in the program was assessed with intra- and interexaminer reliability of the results (in other words, repeatability and accuracy, respectively) [18]. Repeatability was analyzed by comparing triple evaluations of the same lesion with the specific method. Accuracy of subjective methods of assessment, by core physicians and by laypeople, was compared to results obtained with SLCC. Both accuracy and repeatability were presented as technical error measurement (TEM) and showed the potential variations in measuring the specific parameter (in units; here, in percentage). Reliability (R) was described using TEM and standard deviation of all single measurements, with scale of values from 0 weakest to 1 strongest.

Statistical tests were used to analyze the results. The normality of the groups was assessed with Shapiro–Wilk test. The parameters were compared with t test for dependent samples, when assessing two groups for accuracy, and with MANOVA, when assessing three groups for repeatability. The difference was claimed as statistically significant when p < 0.05.

Results and discussion

In the study, we compared objective assessment with the computer program of vascular birthmarks treated with laser, with the subjective assessment done by core physicians and by laypeople. Our results confirmed the usefulness of the proposed SLCC method. The detailed layout of the results is presented in Table 1.

Table 1 The detailed layout of analyses of the treatment results, performed by core physicians, laypeople, and with SLCC

The general part of the analysis revealed that the median outcome of the treatment was 61 % when assessed with SLCC; 76.67 % was evaluated by core physicians, and 83.33 %, by laypeople. The variability of the results was similar in analysis done by core physicians and with SLCC (30.29 and 30.38 %, respectively), and was higher than that by laypeople (26.08 %). When assessed with SLCC method, most patients benefited from treatment at the level from 60 to 70 %, whereas in core physicians’ opinion, the average level of treatment success was between 70 and 80 %. Distribution of the treatment outcomes assessed with SLCC and by core physicians is presented in Fig. 3. The results were in line with the previous reports, where the most commonly seen outcome after laser treatment of birthmarks was from 60 to 80 % [1, 12, 13]. However, the results of assessment with each of three methods were different with one another; thus, a more detailed analysis of reliability was undertaken.

Fig. 3
figure 3

Distribution of treatment outcomes among patients, assessed with SLCC and by core physicians

Validation of usefulness of the three methods was performed in the study with use of parameters TEM and R, characterizing repeatability and accuracy. The repeatability of results was higher with use of computer program in SLCC method (R = 0.99) than among core physicians (R = 0.81) or among laypeople (R = 0.63) (Fig. 4). In another words, the results obtained after multiple objective assessments were almost identical to one another, when assessed with SLCC (TEM = 1.49 %, p > 0.05). Additionally, they were less similar among core physicians (TEM = 9.91 %, p < 0.01) and differed even more among laypeople (TEM = 13.84 %, p < 0.0001). It is difficult to compare our results to previous studies because other authors used only correlation coefficients to describe the interexaminer reliability, which narrowed the possibilities to validate their analysis [13, 14].

Fig. 4
figure 4

Repeatability [presented as reliability (R)] of the results obtained by core physicians, laypeople, and with SLCC

The accuracy of the results was also taken into consideration in our study. There was a stronger correlation of the results between objective assessment with SLCC and subjective evaluation performed by core physicians (R = 0.54) than between objective assessment with SLCC and subjective evaluation done by laypeople (R = 0.29) (Fig. 5). It means that the results calculated with the computer program and SLCC were less different to the assessment done by core physicians (TEM = 13.81 %, p < 0.0001) than by laypeople (TEM = 17.29 %, p < 0.0001). Both results obtained with SLCC and by core physicians were characterized by similar distribution of treatment outcomes (Fig. 3). However, it should be noted that the core physicians gave slightly more favorable opinion concerning the treatment results than SLCC. Similarly, the previously described studies concerning usefulness of CIELab in assessing laser treatment could not be taken into comparison because they used only correlation coefficient [13, 14].

Fig. 5
figure 5

Accuracy (presented as reliability) of the results obtained by core physicians, laypeople, and with SLCC

The detailed analysis in relation to the location revealed that core physicians, laypeople, and SLCC differed in results in location 2 (71.67, 80.00, and 57.00 %, respectively; p < 0.05), location 3 (82.50, 85.00, and 64.00 %, respectively; p < 0.05), and location 4 (78.33, 83.33, and 69.50 %, respectively; p < 0.05), whereas the outcomes were assessed similarly in the case of lesions in location 1 (70.00, 78.33, and 73.00 %, respectively; p > 0.05) (Table 1). Generally, core physicians gave higher scores in most areas, and laypeople were even more optimistic. This corresponds well to the previous reports [13, 14].

In relation to lesion location, analysis of accuracy of results obtained with SLCC objective method in comparison to the subjective methods revealed that birthmarks in location 1 were analyzed by core physicians (R = 0.79, TEM = 7.39 %, p > 0.05) and by laypeople (R = 0.91, TEM = 3.51 %, p > 0.05) similarly to SLCC method; the results did not differ significantly from analysis with the computer method. In contrast, in the case of lesions present in locations 2, 3, and 4, the accuracy of subjective methods was significantly lower than SLCC (p < 0.05) (Table 1). It is noticeable that the results of treatment of lesions localized on the forehead were almost identical in three groups (Fig. 6). We suggest that the accurate subjective assessment of lesions localized on the forehead was possible because of the usually good light exposure of this area of the face with almost absent skin shadowing. In conclusion, the SLCC method may be considered for objective assessment of the results after laser treatment for vascular and pigmented birthmarks.

Fig. 6
figure 6

Comparative analysis of the treatment outcomes in relation to the location of the birthmark