Background

Understanding cognitive reactions after exposure to graphic health warnings (GHWs), is crucial when considering the design and format of GHWs. This review focuses on perceived harm reactions that are operationalized in the literature as thinking about the risks of smoking [1], perceived likelihood of harm from smoking [2] and identifying that smoking causes tobacco related diseases [3], as well as quitting intentions, that are measured as the intention to quit smoking in a certain period of time (i.e. next week) [4] or the extent to which health warning labels make the person more likely to quit smoking [5].

The literature offers theoretical explanations of health risk communications that explain GHWs constructs (perceived risk, cessation intention, smoking behavior) and their relationships [6]. For instance, the extended parallel process model (EPPM) indicates that if a high level of severity from engaging in certain unhealthy behaviors was perceived, individuals will be motivated to behave in order to avoid such risks [7]. Similarly, if individuals hold the belief that they are capable of changing their behavior (perceived self-efficacy) as well as their risk of negative outcomes (perceived response efficacy), the healthier behavior will be motivated to happen. Thus, the EPPM suggests that GHWs messaging that aims to increase smokers’ motivation to quit should include convey messages that contain both threat and efficacy [6].

The WHO Framework Convention on Tobacco Control (FCTC) specifies the importance of labelling health warnings describing the harmful effects of tobacco use in Article 11 [8] but did not provide detailed instructions on the design and format of GHWs apart from generic descriptions of being “large, clear, visible and legible” (p. 10). Depending on the design and format of GHWs, they can have varying effects on cognitive reactions (including perceived risks and quit intentions). The main GHW variable evaluated is format, which includes text vs. graphic warning (or a combination of both), size of the warning on package (i.e. 50%), location (i.e. front of the package) and color. A recent study indicated that a combined disclosure format (text with low/high emotion images) increased risk perceptions and quit intentions relative to text-only [9]. Other evidence suggests that emotionally evocative images depicting hazards (i.e. GHWs proposed by the US FDA) are not effective in communicating wider smoking risks when compared to text featuring irrelevant pictures (e.g. depictions of a car accident), or text-only [10].

Other contradictions are evident for message framing. Positive versus negative framing of tobacco warning messages have been recently examined, contrasting the effects of GHWs communicating the harmful effects of smoking (negative framing) or the benefits of quitting (positive framing). Studies that investigated text message framing effects of tobacco health warnings have found greater efficacy when messages are framed in the negative [11, 12]. However, others identify support for the use of positive message framing when targeting illness prevention behaviors including smoking cessation [13] and yet others have identified reactance to negative messages (fear appeals) for smoker groups [14]. Examination of a wider evidence base for message framing indicates support for the effectiveness of positively framed messages to invoke behavioral change. Examination of 14 direct tests of positive versus negative appeals indicate that 11 studies offer evidence that positive appeals are stronger when compared to negative approaches [14,15,16,17,18,19,20,21,22,23,24]. Hence there remains some uncertainty regarding the role of alternate message framing formats that may further enhance the effectiveness of GHWs. A systematic and comprehensive review of message framing and GHW format is required.

Recent literature reviews have reported the effects of strengthened GHW formats (i.e. larger in size and message) on knowledge and attitudes, attention [25], active smoker’s behavior [26], and young adults including adolescents [27, 28]. Noar et al’s review has a similar focus on longitudinal studies and examines several of the outcomes of interest in the current study. Yet, there is no recent systematic review that has examined the impact of different formats of GHWs on communicating smoking health risks and quit intentions. Because of the paucity of evidence from independent sources, it is important to look at evidence from peer-reviewed journals reporting empirical data. Therefore, this review aims to systematically identify and synthesize evidence from longitudinal studies that have examined GHW formats and their effect on perceived risk of tobacco use and quit intentions of both smokers and non-smokers. The research question of this review is:

What warning and/or disclosure formats are optimal for communicating the risk of smoking on health and quit intentions?

Method

The PRISMA guidelines [29] were followed to conduct the systematic search in order to ensure completeness and transparency.

Key search terms

The following search terms were used for this review:

tobacco OR cigar* OR bidis OR beedi OR e-cigar* OR heat-not-burn OR shisha OR “heat not burn” OR “roll your own” OR roll-your-own

AND.

warning* OR pictorial OR graphic OR packag*

AND.

intervention* OR evaluation OR trial OR campaign* OR program* OR experiment* OR effect* OR impact*

The asterisk allowed for the inclusion of term variations (e.g., singular vs plural).

Databases

Ten databases were systematically searched by three researchers (J.K., T.S., P.S.) for relevant records in December 2017 and again in September 2019. Database names and the number of records retrieved from each database are shown in Table 1. Variation in the number of records retrieved from the different databases is explained by the size and the subject specialization of each database.

Table 1 Number of records retrieved by database

The combined total of records downloaded from all databases was 3430. All downloaded records were imported into EndNote X9. From the initial records collected, 2071 duplicate records were removed (first by EndNote, then by reviewers T.S. and P.S.), leaving 1359 unique sources.

All downloaded records were imported into Endnote X9. After duplications were removed, two reviewers (T.S., P.S.) reviewed titles and abstracts of the remaining papers were reviewed for eligibility. Any papers that were

  • not peer-reviewed (e.g., newspapers, theses, or conference proceedings);

  • not written in English;

  • not tobacco focused;

  • not GHW focused;

  • systematic literature reviews;

  • cross-sectional studies with only one time point

were excluded.

Longitudinal studies with empirical data that consisted of the following information were included:

  • Published in peer-review journals;

  • Published between 2008 and 2019, only studies that were published in the past 10 y were included to ensure the recency of this review;

  • Reporting detailed information on the formats of GHWs;

  • Reporting on outcomes pertaining to either: communicating the risk of smoking on health, or the benefits of quitting, or quit intention.

Selected studies were qualified for inclusion considering the operationalization of perceived risk and quit intentions, given the diversity of measurements for each behavioral outcome. Perceived risks were operationalized as thinking about the risks of smoking, perceived likelihood of harm from smoking and identifying that smoking causes tobacco related diseases. Quit intentions, were operationalized as the intention to quit smoking in a certain period of time or the extent to which health warning labels make the person more likely to quit smoking.

Data extraction and coding framework

A coding framework was developed to enable a standardized method for extraction of the following information from qualified records, as shown in Table 2.

  • Bibliographic information, such as authors, title, and publication year;

  • Study characteristics, such as study location, study design, and sample size;

  • Formats of GHWs, such as the size and location of graphics;

  • Outcome evaluation, such as the effectiveness of GHWs on communicating the risk of smoking on health and the benefits of quitting, as well as quit intentions.

Table 2 Coding framework

Trained coders (T.S., P.S.) extracted data from each record according to the published details contained in the publication. All of the records were cross checked by at least two independent coders to ensure consistency of data extraction.

Quality assessment

To assess the risk of bias in included studies we applied either the Risk Of Bias In Non-Randomized Studies - of Interventions tool (ROBINS-I) [30] or Risk of Bias tool 2 (RoB2) [31] to each study selected into the systematic review. The ROBINS-I tool was used to evaluate the risk of bias in the results of non-randomized studies of interventions (NRSI) that compare the health effects of two or more interventions. This tool is highly applicable to the evaluation of GHW label studies because the types of NRSIs that can be evaluated using this tool are quantitative studies estimating the effectiveness (harm or benefit) of an intervention, which did not use randomization to allocate units (individuals or clusters of individuals) to comparison groups. This description applies to the majority of the studies selected into our review. The designs include studies where allocation occurs during the course of usual treatment decisions or peoples’ choices. In our systematic search results, the studies were of NRSI designs and nine papers were RCTs. For those RCTs, a Cochrane developed quality assessment framework – Risk of Bias Ver. 2 (RoB2) was used. RoB2 is a tool used to assess the risk of bias of the intervention effect between two intervention groups (the experimental and comparator group) [23]. This tool is applicable to the evaluation of quantitative GHW label studies ascertaining the effect of adhering to a controlled intervention (i.e. pictorial GHW, text only GHW or no GHW) where individuals were randomly assigned to intervention or comparator groups.

Each study was carefully examined and coded considering all the ways in which it might be put at risk of bias with close reference to the guidelines. Two independent reviewers (T.S., P.S.) coded each study and coding was compared. Any discrepancies were discussed and resolved with the help of a third and fourth reviewer (B.P., K.K.).

Results

After removing duplicates and ineligible articles, 35 studies met the inclusion criteria, resulting in the sample of qualified records. The PRISMA flow diagram summarizing the exclusion and inclusion process is shown in Fig. 1.

Fig. 1
figure 1

PRISMA Flow Diagram

Study Demographics

Overall, 40% of the studies were conducted across multiple countries (n = 14) such as Australia, Canada and Mexico [3, 32, 33], while 60% of interventions were conducted in one country (n = 21). Out of the 35 qualified studies, 63% of interventions were conducted in developed countries (n = 22), 29% in countries in development (n = 10), and 8% in both types of populations (n = 3). Frequently, data across a range of countries were reported in a single study, therefore, the quantity of counties mentioned is higher than the total of 35 studies. Developed countries evaluated include Australia (n = 12) [32, 34], the United States (n = 12) [35, 36], Canada (n = 11) [3, 33], Europe (n = 9) (five in the UK [1, 35], three in Germany [37,38,39], one in France and the Netherlands [38], and one in Italy [40]. Interventions conducted in countries in development (n = 10) included Asia (n = 6) (five in Malaysia [5, 41], three in Thailand [42, 43], one in China [5], one in India [4] and one in Vietnam [44], Latin America (n = 5) (four in Mexico [45], one in Uruguay [46] and one in Africa (in Mauritius [47]).

Regarding types of tobacco product packages that were tested, 86% of interventions were conducted on generic cigarette packs (n = 30), three were conducted on RYO cigarette packs [41, 43, 48], one was conducted on smokeless tobacco [4] and one was conducted on moist snuff, snus, and e-cigarettes [49]. The intervention group sample sizes varied from 44 [39] to 5991 [4]. Most of the studies were focused on smokers.

Over 70% of interventions were non-randomized designs (n = 26) followed by 26% of randomized interventions (n = 9) [39, 50]. Table 3 summaries the study demographics as well as key findings of included papers.

Table 3 Study characteristics

Quality assessment

Two tools were used to assess study quality. These are presented in turn. The ROBINS-I tool was applied to 26 of the 35 included studies due to the NRSI nature of the studies. The majority (96.2%) of the 26 studies were found to be at moderate risk of bias in at least one of the domains. The moderate overall bias was attributed mostly to ‘confounding’ variables. Confounding was predominately due to introduction of enhanced pictorial health warnings [3, 35], other marketing campaigns and smoking policies (e.g. price increases) during the course of the longitudinal study [34, 48], or due to variance in smoking quantity amongst participants [32, 38, 51]. One study was found to be serious in at least one domain [1]. Partos, Borland [1] was rated as serious in the ‘selection of participants’ domain because selection into the study was based on self-report of quitting smoking and the main outcome measures were quit-related with no note on whether selection bias was taken into account. Over half of the studies (18 out of 26 studies) were at low risk of bias due to missing data. Many studies had almost complete outcome data or used suitable data analysis methods to minimise the bias from missing data. Table 4 summarizes risk of bias ratings for individual studies.

Table 4 Quality assessment

The RoB2 tool was applied to accurately address bias in nine studies that were identified as RCTs. Six studies were judged to be at a low risk of bias among all domains [37, 39, 49, 50, 52, 53]. Two studies were judged to have some concerns in at least one domain [36, 54]. Parada, Hall [36] was low in all domains except for bias due to missing outcome data. One paper [36] stated “Proportions of participants who completed the follow-up surveys at each of the follow up weeks included 86% at week 1, 82% at week 2, 81% at week 3 and 84% at week 4” (p 878). No information regarding how missing data was handled or if the missing data biased the final result was provided lowering assessment scores. Fathelrahman, Omar [54] was rated as some concerns in effect of assignment to interventions but was low in all other domains. There was unclear information on whether participants were blinded to the intervention because “In both groups, participants were given the packs all at the same time and instructed to take a few minutes to examine them” (p. 4091). In addition, research staff were aware of the intervention that was assigned to participants as they assigned the cigarette packaging.

One study was judged to be a high risk of bias overall as a result of a high risk from the randomization process [55]. There was no description of the randomization process. Furthermore, there were substantial differences between intervention group sizes, compared with the intended allocation ratio. Group differences were not examined “because of small cell sizes, we did not examine statistical differences in socio-demo- graphics by study condition” (p. 787). For all other domains, McQueen, Kreuter [55] was judged to be at low risk of bias. Table 5 presents details on RoB2 assessment applied to randomized controlled trials included in this review.

Table 5 Quality assessment

GHW formats

GHWs vary in formatting according to the size of image, placement on the package, message framing, congruence, colour, text size, message attribution (source), and the inclusion of Quitline information. The included studies described warning formats for images, with sizes ranging from 30 to 90% coverage of cigarette packaging. Formats in front of the cigarette package varied from 30% (n = 13) [35, 56], 50% (n = 15) [41, 50] to 75% (n = 8) [32, 34], and included other sizes (n = 10) between 40% [5] and 90% [33]. Formats on the back of the cigarette package varied from 30% (n = 1) [37], 50% (n = 14) [41, 50] to 75% (n = 4) [33, 57]. Once again other sizes (n = 7) such as 40% [38], 80% [46], 90% (n = 13) [34, 48] and 100% (n = 3) were described [3, 32, 33].

In regard to the location of the images on the cigarette pack, 77% of the studies (n = 27) evaluated front and back image formats. Twelve studies evaluated other location formats including only the front image (n = 5) [39, 54], front, back & sides (n = 1) [32], front, back & package inserts (n = 4) [3, 33, 47, 57], and no location on the pack (GHW images shown on a screen) (n = 2) [37, 49].

In regard to message framing, 91% of the studies (n = 32) evaluated negatively framed message formats. Negative framing refers to fear-based images, for instance, a graphic photo of a severe disease (e.g. mouth cancer) in Australian GHWs [35]. Nine studies evaluated other types of message framing including symbolic messages (n = 3), for example, an image of a scorpion to communicate danger for Indian GHWs on smokeless tobacco [4], or an empty cradle paired with the message ‘tobacco hurts everyone’ in Canadian GHWs [58], and mixed (n = 6) framing where negative and symbolic GHW formats in the same groups [3, 32, 33, 47, 57, 59] were evident.

GHW formats where the message was presented in images and text communicating a consistent, aligned message is a key factor for GHW effectiveness. One study evaluated an incongruent message format, notably a scorpion GHW that was not congruent with the message ‘tobacco kills’ that was implemented when the GHW was changed in India [4]. Likewise, 94% of the studies (n = 33) evaluated color GHW image formats. Only two studies evaluated black and white formats from RYO cigarette packs in Thailand [41, 43].

In regards to text formats that accompany GHWs, text size varies from approximately 10% of the package, for instance in Malaysia [5] and the UK [1], to 100% of the back of the package in the case of Mexico [3]. Fifty-four percent of studies evaluated formats that displayed a text source of attribution (n = 19). For instance, the ‘Health Authority Warning’ source that appeared in 2006–2012 Australian GHWs (n = 9) [34, 35], Health Canada (n = 8) [32, 47], and US Department of Health and Human Services (US HHS) (n = 5) [55, 60]. 46% of studies evaluated formats that did not display any source of attribution (n = 16) [42, 46]. Forty-three percent of studies tested formats that displayed a quit line number (n = 15), for instance, Australian GHW formats before and after plain packaging policy [34]. Finally, 37% of studies tested formats that did not display any quit line number (n = 13), such as Uruguay [46], and 17% of studies tested both formats (n = 6). Table 6 summarizes the formatting features of included studies.

Table 6 Formatting of GHWs

Perceived harm outcomes

Thirty-four studies measured the impact of changes to GHWs on perceived risk using various measures across studies and countries. To be noted that the measurement of perceived risks is substantially inconsistent across included studies. In eight studies, a single measure of “To what extent, if at all, do the health warning labels make you think about the health risks of smoking” (or slight variation of wording) was used to measure the perceived risk of smoking [1, 5, 41, 45, 46, 54, 55, 58, 59]. Seven studies used combined “cognitive response” measures [4, 34, 35, 38, 42, 43, 61]. See the Table 7 for the detailed measures. The perceived risk measure of “extent to which the warnings made the respondent think about the health risks of smoking” was often combined with the “if the health warnings made them more likely to quit” to cognitive reactions. Other commonly used measures were perceived likelihood of harm which were used in three studies [50, 52, 53] and identifying that smoking causes tobacco related diseases was used in six studies [3, 33, 37, 47, 56, 62]. A brief summary of the findings is presented in Table 7.

Table 7 Perceived harm outcomes

Twenty-four studies showed an increase in perceived harm over time after the implementation of GHWs [3, 5, 34, 50, 59]. Gravely, Fong [46] reported an increase of perceived risk between pre-policy wave 2 (31.5%) and post-policy wave 3 (43.3%). Similarly, Elton-Marshall, Xu [5] and Fathelrahman, Li [42] showed an increase in perceived risk from pre-policy (6.90%) and (M = 3.6, SD = 1.9) and post-policy (11.80%) and (M = 3.8, SD = 2.0) respectively, in Malaysia. However, Li, Fathelrahman [43] showed that whilst there was an increase in perceived harm in Malaysia from pre to post-policy (M = 0.07, SD = 0.06); M = 1.01, SD = 0.06), there was a continued decrease in perceived harm over the years after implementation of GHW (M = 0.47, SD = 0.06). It is noteworthy that all studies that increased perceived risks have GHWs that are 50% and greater, and most of them have credible sources listed next to the warning messages.

Furthermore, the remaining 10 studies found a decrease in perceived harm over the period of the study. As briefly introduced above, a further six studies showed a decrease over a number of years after policy implementation of graphic health or improved graphic [1, 32, 35, 58, 61, 62]. As found in Borland, Wilson [35], Canada in wave 1 had implemented its GHWs and over the four survey waves, the measure of perceived harm decreased from 1.93 (response strength) to 1.84. Hitchman, Driezen [58] confirmed that there was a decrease (OR: − 0.320) in the effectiveness of Canada’s GHWs over the period 2003–2011. Likewise, similar results were found in Australia with Kennedy, Spafford [62], investigating the impact of blindness tobacco warning labels, finding that 69.5% of Australians were aware of the risk that smoking causes blindness, however 3 y later that decreased to 57.5% of Australians. The wear-out effects seem to be evident regardless of GHW formats.

Three studies found that GHWs had a negative impact on perceived risk [36, 37, 52]. All of these studies had short exposure periods with Brewer, Parada [52] and Parada, Hall [36] only testing pre and post exposure to health warnings. Brewer, Parada [52] decreased perceived risk from 0.04 to 0.03 over the period of the study and similarly Parada, Hall [36] showed the same trend of perceived risk decreasing from 3.2 to 3.1 over the course of the four weeks. No clear conclusions can be drawn about GHW formats in relation to perceived risks.

Quit intention outcomes

Twenty-one of the 35 studies measured the impact of GHW on people’s intention to quit smoking. In four of the studies, quit intentions and perceived risk measures were combined into a cognitive response measure [35, 38, 43, 61] therefore it is not possible to separate the effect of warning format variables on quit intentions from the effect on perceived harm variables for the combined measures. Measures used for intention to quit focused on the “extent to which health warning labels make you more likely to quit smoking” in some studies [5, 41, 46, 59]. In other studies, measures of quit intention focused around planning or intending to quit smoking in the next week, month or year [4, 45, 48, 54].

Eleven studies showed an increase in quit intentions over the course of the study [4, 5, 36, 38, 41, 45, 46, 50, 51, 53, 59]. Gravely, Fong [46] found an increase in thinking about quitting from 21.6% of respondents at pre-policy to 31.3% post-policy in India. Likewise, the implementation of policy in Mauritius [59] resulted in quit intentions increase from 13.5% pre-policy to 26.6% post-policy which was 10–12 months later. In comparison to others, Hall, Sheeran [53] was a short study comprising of a four-week trial, though it had similar findings of an increase in quit intentions over this period. Baseline measures of quit intention among participants in the pictorial warning trial were M = 2.3, SD = 0.9 and at week four, after exposure to pictorial warnings on their cigarette package, quit intentions significantly increased to M = 2.57, SD = 1.07. However, Brewer, Parada [52] followed a comparable study design over a four-week period and found a decrease in quit intentions from baseline (Cohen’s D = 0.26) to week four follow-up (Cohen’s D = 0.16).

Six of the studies identified a decrease in quit intentions after completion of the study [35, 43, 48, 52, 58, 61]. Like perceived risk, a few studies highlighted the immediate impact that new GHW have in increasing quit intentions but an overall decline in the impact months or years later [35, 43, 48, 58]. Hitchman, Driezen [58] and Borland, Wilson [35] both confirmed that for Canada there was a decrease in the impact of its GHW from when they were introduced in 2002 to 2011. Borland, Wilson [35] found that the strength of response for quit intentions (Range from 1 to 3.67) decreased from 1.93 in Wave 1 (2002) to 1.84 in Wave 4 (2006). Similarly, Hitchman, Driezen [58] reported that quit intentions decreased by − 0.504 (OR) over an eight-year period. Durkin, Brennan [48] found a related pattern in Australia. Prior to the introduction of plain packaging 36.10% of smokers intended to quit. During the last stage of the study, quit intentions increased to 42%. However, one-year post implementation of the plain packaging quit intentions decreased back to 35.60%. Taken together, studies identify a positive impact of GHW on intentions to quit that can decline over time. Table 8 presents detailed findings from each relevant study.

Table 8 Quit intention outcomes

Discussion

GHWs are an important component of the suite of tobacco control policies [63,64,65]. Literature has identified that a significant wear out becomes present when GHW are left in market over sustained periods [36, 66, 67]. To contribute to the evidence base, this review aimed to systematically identify and synthesize evidence from longitudinal studies which tested different GHW formats and message frames to understand the effects on perceived risk of smoking and benefits of quitting and intentions to quit smoking. Taken together, a total of 35 studies demonstrated that GHWs increase awareness of health risks over time and they have desired outcomes including changing beliefs about smoking, increasing intention to quit and many more. The current review delved deeper into the mechanics of GHWs relationship to perceived health risks and intentions to quit smoking. The contributions of this study are fourfold. Each contribution is detailed in turn.

Firstly, the review identified that pictorial GHWs deliver a superior performance when compared to text only messages both in terms of magnitude and the number of positive outcomes achieved. Pictorial GHWs that are prominent (bigger than 50%) increase perceived health risks and intentions to quit smoking. This review found that GHWs exceeding 50% or more of pack size were most common, and most GHWs were printed on both the front and the back of packs. This review also identified that all studies that increased perceived health risks featured GHWs that exceeded 50% or more of cigarette pack size. The implications are clear that any country that has not mandated that GHWs be 50% of the pack or more should move to do so, given that health risks and intentions to quit smoking can result. Studies have called for more distribution channels of GHWs beyond tobacco packaging [68, 69]. Four studies [3, 33, 47, 57] in this review used package inserts as a complimentary approach to traditional GHWs. Other studies also call for GHWs to be displayed at the point-of-sale [70], via national TV ads [71] and digital channels [68]. Schmidt, Ranney [69] identified that GHWs in public service announcements can improve GHW persuasion.

Secondly, this study illuminates an ongoing reliance on negatively framed messages. Support for the use of positive message framing when targeting illness prevention behaviors including smoking cessation is available [13]. Evidence demonstrates that smokers react to negative messages (fear appeals) harming the quit intention relationship [12]. This review identifies a landscape dominated by negatively framed messages and whilst this approach is effective in increasing understanding of the risks smokers face, other approaches may be needed to induce smokers who are fully aware of health risks to quit smoking. Examination of the wider message framing evidence base indicates that positively framed messages can invoke behavioral change [72, 73]. Accordingly, research examining positive framing is called for to identify messages that can be utilized to increase quit intentions and quitting behavior. This systematic literature review has identified there is no work utilizing a longitudinal research design that has examined the positive benefits of quitting indicating there is a need to extend understanding of the role communication of the benefits of quitting (saving money, having more energy) has on quitting intentions. Further, most of the GHWs are fear-based images with strong visual and direct stimuli and only a few studies have symbolic images. The usage of symbolic images or more subtle cues needs to be studied further to draw conclusive evidence.

Thirdly, this review unveiled that source attribution is not common, although research has indicated credibility strongly contributes to believability [69, 74,75,76]. Only half of the studies specified credible sources for their warning messages (e.g., Health Canada or US HHS). Perceived risks are a cognitive measure that is heavily influenced by trustworthiness of expertise [77] and perceived authoritativeness [78]. Schmidt, Ranney [69] argued that source credibility needs to be further investigated, either topic-specific, or organization-specific, in order to improve long-term behavioral change in response to tobacco control communications.

Lastly, this review also identified that perceived harms and quit intentions were generally increased by GHWs, and confirmed prior literature [32, 43], noting that wear-out was observed across a range of GHW formats. While understanding the role of GHW formats and message framing must remain a priority, research focused on identifying GHW wear out points is needed to ensure GHW efficacy is optimized. Practically, changing sets of GHWs at the beginning of wear-out is the key to maximize the effectiveness.

Limitations and Future Research

This review has three major limitations that also warrant future research to further extend our understanding of the relationship between GHW formatting and the effectiveness of GHWs. Firstly, outcome measures are inconsistent, which needs to be considered when interpreting the findings in this paper. Future research should focus on using widely accepted and validated scales (such as scales from the Measurement Instrument Database for the Social Science, https://www.midss.org/) in order to produce results that can be directly compared across studies. Secondly, most evaluation work uses non-randomized study designs. Due to the nature of GHW policy implementation, confounding issues become inevitable. More experimental studies conducted in highly controlled environments should be conducted to deliver optimal levels of scientific rigor. RCT studies were generally assessed with a strong quality rating; However, RCTs tend to have smaller sample sizes and shorter study timeframes, which should be taken into consideration when interpreting review findings. Future research should consider utilizing RCT’s as the default study design and increase sample sizes if resources permit, in order to achieve the most reliable results. Lastly, this review did not include any grey literature such as government reports, policy statements, and industry data reports. Although grey literature was generally deemed as low-quality, in this review a thorough check was conducted in the attempt to identify any relevant grey literature in recent years, and none was found and included. Future research should consider widening the types and years of grey literature in order to capture any insights to further extend our understanding of the relationship between GHW formatting and the effectiveness of GHWs.

Conclusions

This review focuses on the strongest evidence available in the literature. Longitudinal research designs permit causal conclusions on the format of GHWs and message framing on quit intentions and smoking risks to be drawn. The findings of this review contribute to the literature extending understanding of GHW formats identifying that negatively framed pictorial GHW warnings displayed on 50% or more of packs increase perceived health risks and intentions to quit smoking. An ongoing reliance on negatively framed messages was evident, which is concerning given these can create reactance for some smokers and testing of alternate approaches (e.g. benefits of quitting smoking) is recommended given the capacity for positively framed messaging to achieve behavior changes. This review recommends further testing of alternative imagery warnings that can better communicate benefits of quitting and more use of objective measures to extend examinations beyond self-reporting.