Editorial Type:
Article Category: Research Article
 | 
Online Publication Date: 01 Mar 2008

Validation and Reliability of Visual Assessment with a Shade Guide for Tooth-Color Classification

,
,
,
, and
Page Range: 121 – 126
DOI: 10.2341/07-71
Save
Download PDF

SUMMARY

This study evaluated the validity and reliability of the visual assessment of tooth color using a commercial shade guide. Ninety-two individuals were randomly selected from subjects enrolled in a randomized controlled trial comparing two formulations of carbamide peroxide. Initially, each individual had the color of his or her six maxillary anterior teeth (n=552) determined by one examiner using a digital spectrophotometer (Vita Easyshade). Then, a visual assessment was made by two calibrated examiners using a shade guide (Vitapan Classical). Reliability was evaluated either by kappa coefficient when all the colors were grouped into two categories (dark and light) or by weighted kappa when all 16 colors of the shade guide were considered. The digital spectrophotometer (Vita Easyshade) was used as the gold standard to calculate sensitivity and specificity of the visual assessment. The reliability of the visual assessment using two color categories was “substantial” (k=0.69). When all colors of the shade guide were considered, the reliability was “mild” (kw=0.33). Sensitivity and specificity of the visual assessment with respect to the gold standard was 86.9% and 81.9%, respectively. Despite its subjectivity, visual assessment of tooth color using the Vitapan Classical shade guide is a valid method, with good reliability for differentiating between dark and light colors.

INTRODUCTION

Concerns about dental aesthetics have increased substantially in recent years, including concerns about tooth whiteness.1–3 Such aesthetic concerns, along with a range of options to treat discolored teeth (such as bleaching, metal free ceramic restorations, direct or indirect composite restorations), highlight the need for correct color classification to compare the effectiveness of these various treatments.

Tooth color is determined by a combination of extrinsic and intrinsic factors.4–5 Extrinsic factors are related to the deposition of either food or beverage stains over the enamel and the acquired pellicle.6–7 Intrinsic factors are related to the enamel and dentin properties of reflection and light dispersion. Tooth color is strongly influenced by these dentin properties. Different methodological approaches have been used to determine the tooth's color and changes in tooth color, which occur during bleaching treatments.8 Color classification, using a standard shade guide (for example, Vitapan Classical shade guide, Vita-Zahnfabrik, Bad Säckingen, Germany) with the buccal aspects of teeth, is the most frequent method used to compare the restored color with natural tooth color.29 Subjectivity and other factors (the clinical experience of the examiner, fatigue of the human eye and room decoration)8 can affect tooth color classification using this standard method. Control and standardization of these factors, together with good training by the examiner, can improve the ability to accurately classify tooth color in a clinical context.68

Recently, digital systems (spectrophotometers,10 colorimeters8 or digital cameras)11 have been used to measure tooth color. Within these systems, color is expressed in CIEL*a*b* space, which provides its specification in three dimensions and allows for more accurate assessments.912 These digital systems are precise instruments that produce highly reliable, easily evaluated results in terms of visual importance.13 High cost and complex operation, however, restricts use of these digital systems to laboratory or clinical research.1014

Although less accurate than digital systems, visual assessment with a shade guide is still the most common method used for color classification in longitudinal studies involving bleaching agents.1–2 Nonetheless, there is a lack of peer-reviewed literature regarding validation of this method. The objective of this study was to investigate the validity and reliability of tooth color measurement using visual assessment with a shade guide.

METHODS AND MATERIALS

This research protocol was approved by the Ethics Committee of the Federal University of Pelotas. Ninety-two individuals (age 18–55 years) were selected from those enrolled in a randomized controlled trial comparing two different concentrations of carbamide peroxide. The color of the six maxillary anterior teeth was assessed for each participant.

Two senior dental students were selected to perform the visual assessment using the shade guide. These examiners were trained in two phases. In the first training phase, they attended a two-hour training session during which the basic concepts of color, its components and the factors that may influence its determination were presented. Two methods that measure color were presented, the Vitapan Classical shade guide (Vita-Zahnfabrik, Bad Säckingen, Germany) and the Vita Easyshade digital spectrophotometer (Vita-Zahnfabrik). A series of slides showing the buccal face of the six maxillary anterior teeth (with different colors) were exhibited to the visual assessment examiners, both of whom were asked to select the corresponding color in the shade guide. In the second training phase, the visual assessment examiners were asked to determine the tooth color of the maxillary incisors and canines (n=96) of 16 volunteers selected among students and staff of the dental school. The color was classified objectively using the digital spectrophotometer (by the supervisor of the study) and subjectively using the Vitapan Classical shade guide (by the visual assessment examiners). These procedures lasted two days (n=48/day) and were performed in the afternoon with sunlight and room illumination and without any communication between the examiners.

The colors in the shade guide were organized from B1 (lightest color) to C4 (darkest color) (Figure 1). All 16 of the volunteers who participated in the second training phase were evaluated by both examiners. Each examiner had a clinical assistant to record the selected color classification codes into standardized forms.

Figure 1. Color scores by sequence of colors in the Vitapan Classical shade guide (from lighter to darker colors).Figure 1. Color scores by sequence of colors in the Vitapan Classical shade guide (from lighter to darker colors).Figure 1. Color scores by sequence of colors in the Vitapan Classical shade guide (from lighter to darker colors).
Figure 1 Color scores by sequence of colors in the Vitapan Classical shade guide (from lighter to darker colors).

Citation: Operative Dentistry 33, 2; 10.2341/07-71

Using digital spectrophotometric analysis (Vita Easyshade) as the gold standard, three measurements were made, with the active point of the instrument focused on the middle third of each tooth. A mean color of the three measurements was calculated. The digital spectrophotometric system used the CIEL*a*b* system, allowing for the determination of color in three-dimensional space. The L* value is a measure of the lightness of an object. The a* value is a measure of red (positive a*) or green (negative a*). The b* value is a measure of yellow (positive b*) or blue (negative b*). The color difference between the color coordinates is calculated as: Δ E* = [(Δ L*)2 + (Δ a*)2 + (Δ b*)2]½.15 The colors were numbered from 1 (B1) to 16 (C4) for statistical analysis.16

After completing the training process, matrices were prepared to compare the degree of agreement between the examiners and the gold standard. Then, the kappa coefficient (k) was calculated using either all of the colors of the shade guide (weighted kappa) or they were grouped according to chroma (simple kappa), which represents the degree of saturation of color. For kappa calculation purposes, if the gold standard classified the color as A3.5, then the chroma value was recorded as 3. The decision to group the results according to chroma was made due to the difficulty in obtaining a good degree of agreement between the examiners and the gold standard when using all 16 colors of the Vitapan Classical shade guide.

Thirteen clinical sessions in 13 days were necessary to train the examiners. The field study was initiated after both examiners achieved agreement with the gold standard at greater than 70%, according to the grouping by chroma.

Prior to data collection, each patient was examined and submitted to a prophylaxis to remove extrinsic staining in advance of clinical examination. To be included in the randomized controlled trial, the patients needed to have a color of C1 or darker (mean color of the six maxillary anterior teeth) (Figure 1). Other inclusion criteria were the absence of composite restorations that involved more than 1/6 of the buccal surface and the absence of caries or periodontal disease. The study excluded pregnant or lactating women, smokers, patients with a previous history of dental hypersensitivity, individuals with tetracycline-stained teeth or those with teeth presenting fluorosis or endodontic treatment.

The digital spectrophotometric and visual assessment followed the same steps that were applied for procedures used in the training. To calculate reliability, different analyses were performed. First, the teeth were clustered into four groups (1, 2, 3 and 4) by chroma. Then, the teeth were grouped into two categories: A3-C4 (darker colors) and B1-D4 (lighter colors). For these analyses, the agreement was calculated using the simple kappa coefficient, comparing the observed concordance with the expected concordance.17 The weighted kappa (kw) was used when all 16 colors of the Vitapan Classical shade guide were taken into consideration. The weights of kw were established in the following manner. A weight of 1 represented total agreement (when the color selected by the visual assessment matched exactly the color determined by the gold standard). A weight of 0.5 was assigned when the visual assessment color was immediately above or below the one determined by the gold standard. A “zero” weight was assigned to all the other situations. Both examiners were blind to the assessments of the other and the results from digital spectrophotometric analysis.

The interpretation of kappa values was set as follows:18 “almost perfect” (0.8–1.0), “substantial” (0.60–0.79), “moderate” (0.40–0.59), “mild” (0.20–0.39), “poor” (0–0.19) and “no agreement” (<0.0). To evaluate the validity of the visual assessment of color using the shade guide, measurements from the visual assessment were compared to measurements from the digital spectrophotometric analysis. A contingency table was prepared by dividing the colors into two groups: A3–C4 (darker colors) and B1-D4 (lighter colors). The colors evaluated as darker were considered to be a “positive test” result, and those evaluated as lighter were considered a “negative test” result. Then, it was possible to calculate the sensitivity, specificity, positive predictive value (PPV), negative predictive value (PNV) and accuracy.19

For the 16 colors of the Vita scale, the difference between the two methods and the association between them was calculated using the paired Wilcoxon test and the Spearman coefficient (rho). A Bland-Altman plot was constructed to detect measurement bias. The statistical significance level was set at 0.05.

RESULTS

The mean age of the study population (92 individuals) was 25.3 years, with a standard deviation (SD) of 7.9. Two-thirds of the study population consisted of women. Mean (SD) and median values of tooth colors obtained by spectrophotometric analysis were 8.78 (4.4) and 11, respectively. The corresponding values for the visual assessment were 8.60 (4.2) and 10, respectively (Table 1).

Table 1 Means and standard deviations (SD) for the visual assessment and digital spectrophotometric analyses, difference between analyses (visual assessment—digital spectrophotometric) and weighted kappa considering the 16 colors of the Vita scale.
Table 1

The difference (−0.18; 95% CI: −0.44–0.08) between the visual assessment and digital spectrophotometric analysis was not statistically significant (p=0.07) when considering the 16 colors. The Bland-Altman plot (Figure 2) showed that most of the observations were situated between ± 2 SDs of the mean difference around the middle line, which represents an absence of difference between the two methods and indicates that the analysis was not affected by measurement bias. The Spearman's coefficient of correlation (rho) showed a high correlation between the two methods of tooth color measurement (rho=0.72).

Figure 2. Bland-Altman plots using the 16 colors of the Vitapan Classical shade guide, comparing visual assessment and digital spectrophotometric analysis. The middle line is the average of the mean difference between the visual assessment and digital spectrophotometric analysis. The upper and lower lines represent the 95% confidence interval around the mean differences.Figure 2. Bland-Altman plots using the 16 colors of the Vitapan Classical shade guide, comparing visual assessment and digital spectrophotometric analysis. The middle line is the average of the mean difference between the visual assessment and digital spectrophotometric analysis. The upper and lower lines represent the 95% confidence interval around the mean differences.Figure 2. Bland-Altman plots using the 16 colors of the Vitapan Classical shade guide, comparing visual assessment and digital spectrophotometric analysis. The middle line is the average of the mean difference between the visual assessment and digital spectrophotometric analysis. The upper and lower lines represent the 95% confidence interval around the mean differences.
Figure 2 Bland-Altman plots using the 16 colors of the Vitapan Classical shade guide, comparing visual assessment and digital spectrophotometric analysis. The middle line is the average of the mean difference between the visual assessment and digital spectrophotometric analysis. The upper and lower lines represent the 95% confidence interval around the mean differences.

Citation: Operative Dentistry 33, 2; 10.2341/07-71

“Mild” agreement (kw=0.33) between examiners and the gold standard was observed when considering all 16 colors of the Vita scale (Table 1). With color grouping by chroma, reliability still remained “mild” (k=0.39). However, when classified into two categories (light and dark colors), the kappa value increased to 0.69, denoting “substantial” agreement (Table 2).

Table 2 Visual Assessment Using Shade Guide Results Compared to the Gold Standard
Table 2

The sensitivity of the visual assessment method was 86.9% (95% CI: 84.1–89.7), showing that the majority of teeth with darker colors were detected. Similarly, specificity was high—81.9% (95% CI: 78.7–85.1)—identifying a high rate of true negatives (light colors) (Table 2).

Among the 320 teeth classified as positives (dark teeth) by the examiners, 278 were true positives (PPV=86.9%; 95% CI: 84.1–89.7). The number of true negative values was 190 (PNV=81.9%; 95% CI 78.7–85.1). The accuracy of the test was 84.8% (95% CI: 81.8–87.8).

DISCUSSION

The correct interpretation of tooth color plays a fundamental role in decisions about the need for aesthetic treatments. It is also essential to assess changes in color that occur during bleaching procedures. Among the various methods used to classify tooth color, the Vita shade guide is the most frequently used,120 which justifies use of this guide in the current study.

Although visual assessment using the Vitapan Classical shade guide is used widely, visual assessment depends on several variables,621 including the source of illumination, the characteristics of the tooth and variation in observer training and experience.48 Few studies have attempted to validate this method.21

Factors inherent to the Vitapan Classical shade guide itself also can limit the correct determination of the tooth color. For example, the range of colors available in the guide does not cover all possible tooth colors. As a consequence, the results cannot be exactly converted to the CIEL*a*b* space.4 The high spectrum of colors (16) presented in the shade guide increased the chance of an error when compared to the digital spectrophotometric analysis.

When considering 16 colors, the agreement between visual assessment by the examiners and digital spectrophotometer was low (kw=0.33). When assessing agreement for individual teeth separately, the kappa was even lower. Such a finding could be explained by the predominance of shades of color situated in the medium spectrum of the shade guide, impairing detecting small color changes by the examiners. Peres and others22 standardized the training of examiners for assessing the amount of dental caries and demonstrated that using means may lead to an overestimation of reliability. Therefore, individual teeth, instead of the means, was recommended as a unit of analysis.

The findings of the current study showed that the examiners had difficulty identifying exact colors (as determined by the gold standard) or colors immediately adjacent on the Vitapan Classical shade scale. In a previous study, Guan and others23 observed a positive correlation (r=0.94) using the Spearman correlation coefficient between the digital spectrophotometer and visual assessment methods of tooth color using extracted teeth. That study, however, tested this correlation in extracted teeth, while in the current study, the comparison was made using intact teeth. In fact, the environmental conditions that surround extracted teeth cannot be fully compared to the conditions found when examining teeth in the mouth of a patient.

In the current study, when colors were grouped according to chroma, it was possible to detect an important improvement in reliability. A similar finding was reported by Hugo and others.9 Those authors divided the colors of the Vita shade guide into four categories according to hue. Visual assessments were performed by three examiners and compared against three digital methods. Agreement among the three different examiners was 52.9%, which was better than the agreement for each digital method compared to the group as a whole. As found in the current study, a reduction in the number of options probably had helped to improve agreement between examiners, as well as between examiners and the gold standard.

Several clinical trials on bleaching treatment have used two classification methods (visual assessment vs digital assessment) to determine tooth color.1–211 Such trials, however, have not reported the reliability and validity of the methods employed. Based on the results of the current study, it was observed that visual assessment with the shade guide was a valid method and that it was capable of identifying darkened teeth when they were present (sensitivity=86.9%) and identified true lighter teeth (specificity=81.9%). Therefore, it was demonstrated that well-trained examiners attained suitable skill to differentiate individuals who actually need bleaching treatment from those who did not.

The findings of the current study also highlight the requirement of intense training before applying visual assessment using the Vitapan shade guide in laboratory or clinical studies.813 Nevertheless, for routine daily use in dental clinics, the classification of tooth color using shade guides seems to be adequate and sufficient, providing trained dentists with a reliable method to discriminate between light and darkened teeth.

CONCLUSIONS

The results of this study indicate that visual assessment using a shade guide is a valid and reliable method to differentiate light from dark colors of teeth.

Acknowledgments

The authors thank the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) for its financial support (scholarship). Also, the authors thank Mr Ferdinan Leida, Luís Coimbra and Marcus Conde, as well as Mrs Sani Heckmann and Sílvia Fontes for their contributions to the development of this study. In addition, the authors acknowledge Dr Christine Erdmann, Department of Epidemiology, School of Public Health, University of Michigan, for her assistance with manuscript revision.

References

  • 1
    Kihn, P.
    ,
    D. M.Barnes
    ,
    E.Romberg
    , and
    K.Peterson
    . 2000. A clinical evaluation of 10 percent vs 15 percent carbamide peroxide tooth-whitening agents.Journal of the American Dental Association131
    10
    :14781484.
  • 2
    Matis, B. A.
    ,
    H. N.Mousa
    ,
    M. A.Cochran
    , and
    G. J.Eckert
    . 2000. Clinical evaluation of bleaching agents of different concentrations.Quintessence International31
    5
    :303310.
  • 3
    Haywood, V. B.
    2003. Frequently asked questions about bleaching.Compendium of Continuing Education in Dentistry24
    4A
    :324337.
  • 4
    Joiner, A.
    2004. Tooth colour: A review of the literature.Journal of Dentistry32
    Supplement 1
    :312.
  • 5
    Walsh, T. F.
    ,
    A.Rawlinson
    ,
    D.Wildgoose
    ,
    I.Marlow
    ,
    J.Haywood
    , and
    J. M.Ward
    . 2005. Clinical evaluation of the stain removing ability of a whitening dentifrice and stain controlling system.Journal of Dentistry33
    5
    :413418.
  • 6
    Watts, A.
    and
    M.Addy
    . 2001. Tooth discolouration and staining: A review of the literature.British Dental Journal190
    6
    :309316.
  • 7
    Dahl, J. E.
    and
    U.Pallesen
    . 2003. Tooth bleaching—a critical review of the biological aspects.Critical Reviews in Oral Biology and Medicine14
    4
    :292304.
  • 8
    Joiner, A.
    2006. The bleaching of teeth: A review of the literature.Journal of Dentistry34
    7
    :412419.
  • 9
    Hugo, B.
    ,
    T.Witzel
    , and
    B.Klaiber
    . 2005. Comparison of in vivo visual and computer-aided tooth shade determination.Clinical Oral Investigation9
    4
    :244250.
  • 10
    Braun, A.
    ,
    S.Jepsen
    , and
    F.Krause
    . 2007. Spectrophotometric and visual evaluation of vital tooth bleaching employing different carbamide peroxide concentrations.Dental Materials23
    2
    :165169.
  • 11
    Wee, A. G.
    ,
    D. T.Lindsey
    ,
    S.Kuo
    , and
    W. M.Johnston
    . 2006. Color accuracy of commercial digital cameras for use in dentistry.Dental Materials22
    6
    :553559.
  • 12
    Mokhlis, G. R.
    ,
    B. A.Matis
    ,
    M. A.Cochran
    , and
    G. J.Eckert
    . 2000. A clinical evaluation of carbamide peroxide and hydrogen peroxide whitening agents during daytime use.Journal of the American Dental Association131
    9
    :12691277.
  • 13
    Okubo, S. R.
    ,
    A.Kanawati
    ,
    M. W.Richards
    , and
    S.Childressd
    . 1998. Evaluation of visual and instrument shade matching.The Journal of Prosthetic Dentistry80
    6
    :642648.
  • 14
    Tung, F. F.
    ,
    G. R.Goldstein
    ,
    S.Jang
    , and
    E.Hittelman
    . 2002. The repeatability of an intraoral dental colorimeter.Journal of Prosthetic Dentistry88
    6
    :585590.
  • 15
    Commission Internationale De L'Eclairage 1978. Recommendations on uniform colour spaces, colour difference equations and psychometric colour terms.15
    Supplement 2
    :
    Paris
    Bureau Central de la CIE
    .
  • 16
    Nathoo, S. A.
    ,
    B.Stewart
    ,
    Y. P.Zhang
    ,
    P.Chaknis
    ,
    K. N.Rustogi
    ,
    W.Devizio
    ,
    M.Petrone
    , and
    A. R.Volpe
    . 2002. Efficacy of a novel, non-tray, paint-on 18% carbamide peroxide whitening gel.Compendium of Continuing Education in Dentistry23
    11
    :2631.
  • 17
    Fleiss, J. L.
    1981. Statistical Methods for Rates and Proportions,
    2nd ed
    .
    New York
    .
  • 18
    Landis, J. R.
    and
    G. G.Koch
    . 1977. The measurement of observer agreement for categorical data.Biometrics33
    1
    :159174.
  • 19
    Fletcher, R. H.
    ,
    S. W.Fletcher
    , and
    E. H.Wagner
    . 1996. Epidemiologia Clínica: Elementos Essenciais Artes Médicas.
    Porto Alegre
    . p. 398.
  • 20
    Paul, S.
    ,
    A.Peter
    ,
    N.Pietrobon
    , and
    C. H. F.Hämmerle
    . 2002. Visual and spectrophotometric shade analysis of human teeth.Journal of Dental Research81
    8
    :578582.
  • 21
    Hassel, A. J.
    ,
    U.Koke
    ,
    M.Schmitter
    ,
    J.Beck
    , and
    P.Rammelsberg
    . 2005. Clinical effect of different shade guide systems on the tooth shades of ceramic-veneered restorations.International Journal of Prosthodontics18
    5
    :422426.
  • 22
    Peres, M. A.
    ,
    J. L.Traebert
    , and
    W.Marcenes
    . 2001. [Calibração de examinadores para estudos epidemiológicos de cárie dentária].Caderno de Saúde Pública17
    1
    :153159.
  • 23
    Guan, Y. H.
    ,
    D. L.Lath
    ,
    T. H.Lilley
    ,
    D. R.Willmot
    ,
    I.Marlow
    , and
    A. H.Brook
    . 2005. The measurement of tooth whiteness by image analysis and spectrophotometry: A comparison.Journal of Oral Rehabilitation32
    1
    :715.
Copyright: Copyright: © 2008 This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. 2008
Figure 1
Figure 1

Color scores by sequence of colors in the Vitapan Classical shade guide (from lighter to darker colors).


Figure 2
Figure 2

Bland-Altman plots using the 16 colors of the Vitapan Classical shade guide, comparing visual assessment and digital spectrophotometric analysis. The middle line is the average of the mean difference between the visual assessment and digital spectrophotometric analysis. The upper and lower lines represent the 95% confidence interval around the mean differences.


Contributor Notes

Sônia Saeger Meireles, DDS, MS, PhD student in Operative Dentistry, Federal University of Pelotas, Pelotas, RS, Brazil

Flávio Fernando Demarco, DDS, PhD, associate professor, Department of Operative Dentistry, Federal University of Pelotas, Pelotas, RS, Brazil

Iná da Silva dos Santos, MD, MS, PhD, professor, Departamento de Medicina Social, Federal University of Pelotas, Pelotas, RS, Brazil

Samuel de Carvalho Dumith, MS, PhD student in Epidemiology, Federal University of Pelotas, Pelotas, RS, Brazil

Álvaro Della Bona, DDS, MS, PhD, professor, School of Dentistry, University of Passo Fundo, Passo Fundo, RS, Brazil

*Reprint request: Programa de Pós-Graduação em Odontologia, FO/UFPel, Rua Gonçalves Chaves, 457 Sala 504 Centro, Pelotas/RS, CEP: 96015-560; e-mail: flavio.demarco@pq.cnpq.br
Received: 10 Apr 2007
  • Download PDF