Abstract
Establishing causality has long been an issue in the advertising-to-sales relationship. The ideal approach in establishing causality is randomized assignment of matched respondents to an exposure and non-exposure group. Given the difficulty in implementing such an experimental approach, observational data has been used in establishing a linkage between advertising and sales. The problem here is that the lack of randomized trials requires controlling for endogeneity (i.e., factors other than advertising related to sales), and accounting for heterogeneity (i.e, variations in consumer characteristics affecting responses to advertising). The authors apply two state-of-the-art methods to single-source data so as to control for endogeneity and account for heterogeneity. Based on this analysis, the authors hope to improve on the use of observational data in accounting for advertising effects in a causal fashion.
MANAGEMENT SLANT
Refined methods of statistical matching are an effective substitute for experimentation in accounting for causality in the television advertising-to-sales relationship.
Such matching must account for both demographic characteristics and pre-campaign purchase behavior.
In controlling for endogeneity and accounting for heterogeneity in the television advertising-to-sales relationship, causal results showed that the campaign for the focal brand was most effective in influencing light as opposed to heavy users.
Failure to control for endogeneity results in the opposite effect; that is, the campaign is most effective in influencing heavy users.
INTRODUCTION
Researchers have relied on two types of data to establish causality in relating television advertising to sales: experimentation and observation. This paper focuses first on the limits of experimentation and considers utilizing observational data to establish causality by matching exposed and unexposed groups. A key consideration in estimating the causal effect of advertisements on sales by using observational data rather than experimentation is the need to control for endogeneity. Controlling for endogeneity requires accounting for variables correlated to sales independent of advertising effects. Along with controlling for endogeneity, substantial variation in how particular consumers respond to advertisements—namely, heterogeneous advertising effects—must also be accounted for. Estimating such effects requires accounting for differences in consumer characteristics that may condition advertising exposure independent of the influence of advertising on sales.
In the report with an approach closest to that of the authors, Berkovich and Wood (2016) studied the direct effect of an advertising campaign on sales on the basis of a matched sample of advertising exposure and nonexposure households. They exploited demographics and purchase statistics in a matching process and compared purchase behaviors between those groups to measure sales lift. In the current paper, the authors significantly advance this approach by applying two state-of-the-art methods to control for endogeneity and account for heterogeneity in estimating the causal effects of television advertising on sales lift for a focal brand. Through better matching combined with better controls for endogeneity, the authors hope to improve the use of observational data in accounting for causality in the advertising-to-sales relationship. In so doing, the authors provide detailed procedures so that this paper helps expand the toolbox for practitioners.
METHODOLOGY
Experimentation versus Observational Data
Experimentation is the gold standard in measuring the causal effects of advertising on sales by randomly assigning respondents to an exposure group and a nonexposure group, thus ensuring that the response (sales) is a function of the experimental stimulus (advertising; e.g., Aaker and Carman, 1982; Hu, Lodish, and Krieger, 2007; Lodish, Abraham, Kalmenson, Livelsberger, et al., 1995). Experimentation has been used previously to test pre-campaign advertising effects through test markets (Eskin, 1975; Eskin and Baron, 1977). Experimentation is difficult to implement once a campaign is launched, because of the difficulty in ensuring the integrity of a control (nonexposure) group. Marketers using traditional media such as television and radio would find it difficult to ensure that a control group was not exposed to any advertising during the campaign period. They could utilize split-cable television, but directing resources is more likely to be a function of marketing rather than experimental objectives; namely, targeting the most likely buyers rather than assigning respondents to randomized groups. Marketers may be reluctant to withhold advertising from a randomized control group because of the sacrifice in revenue as a result. If test groups are selected by targeting criteria rather than randomization, this creates the same problems of endogeneity as for observational data; namely, that the relationship between advertising and sales could be artifactual in that sales results are independent of advertising exposure.
The primary alternative to experimentation is to use observational data (e.g., single-source data relating advertising exposure to purchases at the individual level) and match those not exposed with those exposed on a variety of criteria (e.g., Bronnenberg, Dubé, and Mela, 2010; Draganska, Hartmann, and Stanglein, 2014; Kumar, Bezawada, Rishika, and Janakiraman, et al., 2016). The problem here is to ensure that matching treatment and control groups on key criteria is sufficiently randomized to control for endogeneity. This problem has been reported in previous research (Blake, Nosko, and Tadelis, 2015; Lewis, Rao, and Riley, 2011).
Measuring Sales Lift
The authors define sales lift as the difference in sales results before and immediately after an advertising campaign resulting from the campaign. Berkovich and Wood (2016) explained that, to be included in a sales lift study, households must meet pre- and post-campaign periods “static” to ensure that they are actively purchasing the focal product. This static is a function of various shopping behaviors, such as purchase frequency, purchase spending, and interpurchase time. Similarly, the current authors measure sales lift by using different household samples as a function of purchase frequency and purchase spending on the focal brand and category during the pre- and post-campaign periods (e.g., top 10 percent, 25 percent, and 50 percent of total households in terms of purchase frequency and spending for the focal brand). In this study, the pre-campaign period is defined as one year before the start of the television campaign. Although Berkovich and Wood (2016) measured household-level incremental sales considering matched samples between test and control groups, they primarily relied on demographics and purchase statistics in a matching process and then simply compared statistics of purchase behaviors between those groups to measure sales lift. In contrast, this study exploits formal econometric approaches—propensity score matching and difference in differences—for matching samples and measuring sales lift, and will discuss all details of the empirical methods in the subsequent section.
Data and Summary Statistics
This study tests sales lift for a brand in a chocolate-candy category for a television campaign that ran from March 28 to June 16, 2016. Sales data refer to household-based scanner data collected by Nielsen Catalina Solutions for a sample of households in the United States whose purchases are scanned and recorded after each shopping trip. The data are single source also, combining television advertising exposure and purchase data from the same household at the individual level so that the effects of television advertising on purchase behaviors can be studied. There are data for 435,327 households in total, including all shopping information from March 29, 2015, to June 26, 2016. (See Table 1 for the summary statistics for all the details of purchase behavior in the chocolate-candy product category.)
Controlling for Endogeneity
Because this study uses observational data, controlling for endogeneity is central to the analysis. Two sources of endogeneity in the relationship between advertising and sales are targeting-induced endogeneity and exposure-induced endogeneity (Gordon, Zettelmeyer, Bhargava, and Chapsky, 2019; Lewis et al., 2011). Targeting-induced endogeneity occurs when marketers target heavy buyers, resulting in purchases that would have occurred independent of advertising effects. Relating advertising to sales overstates the association. Exposure-induced endogeneity is a function of variations in advertising exposure independent of advertising effects. If demographic characteristics are related to the likelihood of viewing a campaign, exposure is likely to be a function of such characteristics as well as advertising effectiveness. Relating advertising effectiveness to sales must control for demographic covariates related to the level of advertising exposure.
Additionally, previous literature (e.g., Anand and Shachar, 2011; Lovett and Peress, 2015) particularly consider endogeneity of television advertising that is mainly driven by unobserved characteristics such as geography or demographics. The most recent study investigated the problem of individual-level television advertisement targeting by leveraging granular data linking household-level television advertisement viewing with product purchase (Tuchman, Nair, and Gardete, 2018). Although these papers rigorously control for advertising endogeneity using a structural model, it would be difficult for practitioners to implement and replicate this strategy because of a sophisticated analysis process. Therefore, the authors propose another approach to control for television advertising endogeneity in a reduced-form way that is simpler and easier to execute.
The authors combine two state-of-the art econometric approaches, difference in differences and propensity score matching, to measure television advertising effects after controlling for the endogeneity of television advertising exposure. Ever since the work of Ashenfelter and Card (1985), the use of the difference in differences method has become widespread in marketing and economics (e.g., Angrist and Pischke, 2009; Goldfarb and Tucker, 2014; Imbens and Wooldridge, 2009; Meyer, 1995). In this approach, outcomes (i.e., purchases of the focal brand) are observed for two groups (the advertisement-exposed group and the non-advertisement-exposed group) for two time periods (pre- and post-campaign periods).
Specifically, the following regression equation was used: (1)
where i indicates a household and t indicates a shopping occasion. Purchaseit is an indicator equal to 1 if household i purchased the focal brand at time t. Similarly, Ad Exposurei is an indicator equal to 1 if household i was exposed to a television advertisement at least once during the campaign period and 0 if otherwise. Campaign Periodt is also an indicator equal to 1 if time t is after the starting date of the television advertising campaign; otherwise, it is equal to 0. The final variable is Priceit. Last, εit is a stochastic error term.
First, price is one of the most important components affecting consumer purchase in consumer-packaged goods, and it is expected to influence purchase response negatively. Second, to identify the effect of television advertisements on purchase, Ad Exposurei, Campaign Periodt, and the interaction term must be measured. One of the groups is exposed to television advertisements in the campaign period but not in the pre-campaign period. The second group is not exposed to television advertisements during either period. In the case in which the same units within a group are observed in each time period, the average gain in the second (control) group can be subtracted from the average gain in the first (treatment) group. This removes biases in the second period comparisons between the treatment and control groups that could be the results of permanent differences between those groups, as well as biases from comparisons over time in the treatment group that could be the results of trends.
As for the interpretation of the coefficients, β1 captures possible differences between the television-advertising-exposed and non-advertising-exposed groups. The coefficient of the time period, dummy β2, captures aggregate factors that would affect changes in the purchase of our focal brand, even in the absence of the television advertising campaign. The authors aim to determine coefficient β3, which accounts for the interaction term between treatment status and behavior in the pretreatment period. (2)
β3 measures the incremental advertising effects of those exposed to the campaign versus those not exposed while accounting for pre-campaign behavior.
This coefficient measures the causal effects of television advertising exposure during the campaign period on brand purchases. The authors hypothesize that β3 is positive and significant if television advertisements have positive causal effects on brand purchases. The authors run the ordinary least squares regression of the equation, a linear probability model.
As mentioned previously, however, the advertising exposure variable in Equation 1 is endogenous because of the exposure selection issue. To control for the endogeneity of television advertising exposure, this paper uses another state-of-the-art econometric technique: propensity score matching. This approach is used in many academic works in marketing; for example, examining the effectiveness of Internet and television advertisements on brand building (Draganska et al., 2014), understanding the effects of firm-generated social media on customer metrics (Kumar et al., 2016), and investigating the effects of access to digital video recorders on sales of advertised products (Bronnenberg et al., 2010). The propensity score is the probability of treatment assignment (i.e., status of being exposed to television advertisements) conditional on observed baseline characteristics (i.e., demographics and precampaign period purchase behaviors). The propensity score allows the design and analysis of an observational (nonrandomized) study so that it mimics some of the particular characteristics of a randomized controlled trial. In particular, the propensity score is a balancing score so, conditional on the propensity score, the distribution of the observed baseline covariates will be similar for advertising-exposed and non-advertising-exposed households.
To compute the propensity score, the authors used the following equation: (3)
where i indicates a household and Ad Exposurei is an indicator equal to 1 if household i was exposed to television advertisements at least once during the campaign period and 0 otherwise. The important term is Xi, which is a vector of covariates, demographics, and pre-campaign period purchase behaviors (including income, age, household size, race, gender, population of the household’s location, marital status, education, origin, purchase spending, purchase amount, purchase with promotion at both the brand and category levels, and main shopping retailers).
On the basis of the propensity scores, the nearest neighbor is matched, enabling matching advertising-exposed households with non-advertising-exposed households so that the two groups can be compared in a process similar to that of a randomized control trial.
The authors found a close match between households exposed and those not exposed to advertising in demographic characteristics and a lack of statistically significant difference in pre-campaign purchase behavior as covariates, which might affect the probability of being exposed to advertisements (See Table 2). A balanced distribution of the covariates affecting advertising exposure probability between these two groups shows how we control for the endogeneity of television advertising exposure.
This approach is similar to the one used by Berkovich and Wood (2016) to match test and control households on the basis of household demographics and shopping characteristics. They stated that the purpose of matching is to be able to attribute any increase in spending to advertising rather than to an existing preference for the product being studied. The current study goes further by matching the households on the basis of their propensity scores of exposure to television advertisements, which is a function of demographics and pre-campaign period purchase behaviors. Specifically, the current study computes the propensity scores of all households in the data and chooses one of the households in the control group (i.e., a household not exposed to television advertising) that shows the nearest propensity score of the household in the treatment group (i.e., a household exposed to television advertising).
After the matching process, there is now a dummy variable, advertising exposure, which is equal to 1 if household i is exposed to advertisements and 0 otherwise for all households. This newly created advertising exposure variable is used as a covariate in Equation 1. By doing so, the effects of the television advertising campaign on brand purchase after controlling for the endogeneity of advertising exposure can be measured (See Table 4, shown later).
Heterogeneous Advertising Effects
The estimates in previous sections (e.g., β3 in Equation 1) indicate average causal effects of advertising on brand purchase, implying that all the households exposed to the advertising campaign have the same sensitivity to television advertisements. How the effects vary across subhouseholds is also of interest, since different subgroups of the households are likely to react differently to television advertisements. Previous literature in marketing and economics (e.g., Berkovich and Wood, 2016; Rossi and Allenby, 1993; Zenetti and Otter, 2014) emphasizes the importance of such heterogeneous advertising effects on consumer purchase decisions.
To capture heterogeneous advertising effects, this study exploits a more robust analysis that is an extension of the difference in differences analysis described earlier: a difference in difference in differences analysis using a different control group within the treatment state (i.e., advertisement exposure). The causal effects of television advertisements (i.e., β3 in Equation 1) in the difference in differences analysis imply that all households that were exposed to advertisements have the same sensitivity in responding to advertisements. The effectiveness of the advertisement, however, could vary across households, depending on their purchase behaviors. Heavy brand purchasers, for example, may be more (or less) likely to respond to advertisements compared with light brand purchasers. To understand heterogeneous causal advertising effects on brand purchase, more robust indicator variables are needed: the top 50 percent, top 25 percent, and top 10 percent of brand purchasers based on the total purchase frequency before the campaign period. The indicator variable for the top 50 percent is equal to 1 if the household purchased the study’s focal brand at least twice in the pre-campaign period and 0 if otherwise. The authors create the other indicators in the same manner; for example, the top 25 percent of brand purchasers equals 1 if they bought the brand at least three times in the pre-campaign period and 0 if otherwise. (See Table 3 for a summary of the definition, total number, and actual percentage of these household segments.)
Next, several interaction terms are added in Equation 1 as follows, using the top 50 percent as an example: (4)
As for the interpretation of the coefficients, β4 captures the causal effects of the campaign on households that have never purchased or purchased at least once the focal brand in the pre-campaign period (i.e., households that are not in the top 50 percent). β7 denotes the difference in causal effects of the advertising campaign between households that were exposed to the campaign and purchased the focal brand at least twice in the pre-campaign period versus those that did not purchase or purchased at least once (β4) while controlling for prepurchase behavior. Similarly, replacing the Top50% with the other indicator variables, such as Top25% and Top10%, allows for the heterogeneity of advertising effects between corresponding groups to be calculated. This approach is called the difference in differences in differences. Here, β7 is described as follows: (5)
Again, β7 measures the difference in causal effects of television advertisements between the top 50 percent of households (purchased at least twice) and everyone else.
RESULTS
The authors report the causal effects of the television advertising campaign on brand purchase, controlling for endogeneity (See Table 4). The second column of Table 4 shows the results from the difference in differences analysis without a matching process to control for advertising exposure. The third column shows the results from the difference in differences analysis using both demographics and pre-campaign period purchase behaviors in the matching process.
The key measure is β3, the interaction between advertising exposure and the campaign period. In the second column, after matching with consumer demographics and pre-campaign purchase behavior, the marginal effect of the interaction between advertising exposure and campaign period is 0.005 (i.e., the incremental effects of β3 in Equation 1), meaning that the households that were exposed to advertisements during the campaign period more likely will purchase the brand by 0.5 percentage points, compared with the households that were not exposed to advertisements during the campaign period. Given that the average brand purchase probability across all observations is 15.17 percentage points, the 0.5 percentage point increase corresponds to a 3.30 percent increase in brand purchase probability (0.5/15.17).
The interesting observation is that if advertising exposure endogeneity is not controlled, there is little effect of advertising exposure on brand purchase. Without a matching process, the coefficient of the interaction term becomes insignificant (See Table 4, column 1). That means households that were exposed to advertisements during the campaign period are statistically indifferent from the households that were not exposed to advertisements during the campaign period. The findings demonstrate the importance of accounting for pre-campaign purchases and demographic matching to show advertising effects.
The authors report the heterogeneous causal effects of television advertisements from the estimations of the difference in difference in differences analysis (See Table 5). Here, heterogeneity is accounted for from brand purchase frequency in the pre-campaign period. As the purpose of allowing for heterogeneity in this analysis is to distinguish the brand choice between heavier and lighter brand users, it can be observed how television advertisements influence consumers by frequency of brand purchase. Each column shows results of different segments, with brand purchases defined at the top 50, 25, and 10 percentiles (See Table 3). The authors use both demographics and pre-campaign period purchase behaviors in the matching process to control for the endogeneity of television advertisement exposure.
The results show when households are split using the indicator variable for the top 50 percent versus bottom 50 percent of brand purchasers; that is, households that purchased the focal brand at least twice in the pre-campaign period (brand purchasers) versus those that did not or purchased at least once (See Table 5, column 2). β4 measures the causal incremental effect of television advertisements for nonpurchasers or lighter brand purchasers. β7 measures the incremental difference in causal effect of the campaign for heavier brand purchasers versus nonpurchasers or lighter brand purchasers in the pre-campaign period. The results show a significant effect of the campaign on lighter brand purchasers or nonpurchasers, with an incremental increase in brand purchases of 0.8 percentage points; the campaign, however, had a smaller effect on heavier brand purchasers. β7 shows a decrease of an additional 1.8 percentage points in brand purchases for heavier brand purchasers. The results suggest that the advertising campaign increases brand purchase for both purchasers and nonpurchasers but more so for lighter brand purchasers and nonpurchasers.
The results also show when households are split using the top 25 percent indicator variable; that is, households that purchased at least three times in the pre-campaign period versus those that did not purchase or who purchased, at most, twice—the bottom 75 percent (See Table 5, column 3). The results show that the causal effect of television advertisements for those in the bottom 75 percent (β4) is a significant incremental increase of 0.6 percentage points in brand purchases. The difference in the incremental causal effect between households in the top 25 percent and those in the bottom 75 percent (β7) is negative (and significant), and this result suggests that the causal effect of television advertisements for heavier brand purchasers is less effective than for nonbrand and lighter brand purchasers. A similar conclusion can be drawn from the results, where households are split using the top 10 percent indicator variable; those who purchased at least six times versus those who purchased, at most, five times or did not purchase (See Table 5, column 4).
In short, the results in Table 5 suggest that the campaign was effective across the board for both users and nonusers but most effective for nonusers and lighter users. Targeting heavy brand users would yield no advantage in increasing brand purchases.
Next, the authors further investigate how the results of heterogeneous advertisement effect controlling for endogeneity differ from the results of not controlling for it. The authors conduct the same analysis with the sample without matching (See Table 6). Similar to the results in the previous table, each column shows results of different segments, with brand purchases defined at the top 50, 25, and 10 percentiles of all the households in the sample (i.e., 435,327). Interestingly, the coefficients of the three-way interaction terms (i.e., β7) in all the segments are positive and significant, implying that advertising effects are greater for heavier brand purchasers compared with nonpurchasers or lighter brand purchasers. Indeed, this result drives an opposite insight to the findings in Table 5 and thus shows why controlling for advertising endogeneity is important for understanding the correct advertisement effect on brand purchase.
Overall, the findings suggest that, for this chocolate-candy brand at least, a broader based targeting to light brand users and nonusers is more effective than a more concentrated targeting to the heavy quarter or heavy tenth of brand users.
DISCUSSION
Limitations and Future Research
The data are derived from a single brand source. There is no information on switching behavior and, as a result, the authors cannot analyze competitive influences on the advertising-to-sales relationship in a dynamic marketplace. This means that, although it is possible to measure causal advertising effects on sales for the focal brand, brand managers do not know where a sales increment came from as a result of the advertising campaign. Future research could build on this analysis by investigating a multibrand data source that could show brand managers how consumers substitute their brand purchases due to an advertising campaign. Examining the causal effects that advertisements have on sales by considering a substitution pattern between brands would provide better insight into the product category. Moreover, this analysis focused on one category. Future research could focus on causal advertising effects across categories.
Conclusion
Controlling for advertising endogeneity and accounting for heterogeneity in the campaign for the focal brand, this analysis confirms several findings. First and most important, without matching for pre-campaign purchases, advertising effects in the campaign period become insignificant. This demonstrates the importance of matching by purchase predispositions to account for advertising effects in evaluating the advertising-to-sales relationship. Second, the campaign was effective in influencing both brand purchasers and nonbrand purchasers but more effective for nonbrand purchasers and lighter brand purchasers compared with heavy brand purchasers. Third, without a matching process to control for endogeneity, the analysis on heterogeneous advertisement effects shows incorrect insight (i.e., advertising to heavier brand purchasers is more effective). This demonstrates why using the proposed approaches is important.
The analytical approach illustrated for controlling for endogeneity in accounting for the causal effects of advertising and showing its heterogeneous effects is, perhaps, more significant.
ABOUT THE AUTHORS
Henry Assael is Professor Emeritus of Marketing at the Stern School of Business, New York University. He has published twelve articles in this journal over the years in the area of evaluating advertising and media effectiveness, most notably his article titled “From Silos to Synergy” on cross-media effectiveness that won the inaugural JAR Best Paper award in in 2011. Assael has authored the following texts: Consumer Behavior and Marketing Action (seven editions); Marketing (three editions); and Marketing: Principles and Strategy.
Masakazu Ishihara is associate professor of marketing at Stern School of Business, New York University. His research focuses on quantitative marketing and empirical industrial organization with particular interest in the dynamic effects of marketing strategies, forward-looking decision making by consumers and firms, and marketing and economics in the entertainment industry. Ishihara received the inaugural 2010 INFORMS Society for Marketing Science (ISMS) Doctoral Dissertation Competition Award, and Honorable Mention of Dick Wittink Prize Award in 2011. He has also been selected as one of the Marketing Science Institute (MSI) Young Scholars of 2017.
Baek Jung Kim is assistant professor of marketing at Sauder School of Business, University of British Columbia. He coauthored this paper as a doctoral student at NYU Stern School of Business. Kim’s research interests include digital platform adoption and usage.
ACKNOWLEDGMENTS
The authors thank Leslie Wood, chief research officer, and Andrew Faehnle, director of research and development, at Nielsen Catalina Solutions for the data used in the analysis and for their guidance in establishing the analytical approach.
- Received May 1, 2019.
- Received (in revised form) November 20, 2019.
- Accepted January 28, 2020.
- Copyright © 2021 ARF. All rights reserved.