Skip to content

Uncategorized

9 min readJoris van Huët

Structural Causal Models in Marketing Explained

Learn how Judea Pearl's structural causal models (SCMs) apply to marketing attribution, including DAGs, do-calculus, and counterfactuals.

Share
Quick Answer·9 min read

Structural Causal Models in Marketing Explained: Learn how Judea Pearl's structural causal models (SCMs) apply to marketing attribution, including DAGs, do-calculus, and counterfactuals.

Read the full article below for detailed insights and actionable strategies.

The attribution problem

One sale. Four channels. 400% credit claimed.

100
1 sale
Meta
100%
claimed
Google
100%
claimed
TikTok
100%
claimed
Klaviyo
100%
claimed

Reported revenue: 400 · Actual revenue: 100 · Gap: €300

Structural Causal Models in Marketing: Pearl's Framework Explained

A structural causal model (SCM) is a mathematical framework developed by Judea Pearl that represents cause-and-effect relationships between variables using a combination of directed acyclic graphs (DAGs) and structural equations. In marketing, SCMs provide the theoretical foundation for determining whether a campaign actually caused a sale, rather than merely correlating with it.

Pearl's framework is not just academic theory. It underpins the most rigorous approaches to marketing attribution available today, including the causal inference methods that are replacing correlation-based multi-touch attribution at forward-thinking e-commerce brands.

What Is a Structural Causal Model?

An SCM consists of three components:

1. Endogenous Variables (V)

These are the variables within the system that the model explains. In a marketing context, endogenous variables might include:

2. Exogenous Variables (U)

These are external factors that influence the system but are not explained by it. Examples include:

  • Seasonality
  • Competitor activity
  • Economic conditions
  • Weather

3. Structural Equations (F)

These are the functions that define how each endogenous variable is determined by its causes. For example:

Revenue = f(Meta_spend, Google_spend, TikTok_spend, Seasonality, Organic_demand, Noise)

The structural equations are not merely statistical associations. They encode the causal mechanisms. Changing the input on one side of the equation changes the output, which is what makes the model causal rather than correlational.

Pearl's Causal Hierarchy: The Ladder of Causation

Pearl organizes causal reasoning into three levels, often called the "Ladder of Causation." This hierarchy is essential for understanding why most marketing attribution falls short.

Rung 1: Association (Seeing)

Question: "What do I observe?"

This is the domain of standard analytics. You observe that customers who clicked Google Ads have a higher conversion rate than those who did not. Traditional attribution models like last-click and linear attribution operate entirely on this rung.

The problem: association does not imply causation. Customers who click branded search ads were already intending to buy. The high conversion rate reflects their pre-existing intent, not the ad's causal impact.

Rung 2: Intervention (Doing)

Question: "What happens if I do X?"

This is the domain of experiments and Pearl's do-calculus. Instead of observing what happened, you ask what would happen if you intervened, for example, if you increased Meta spend by 20%.

The key notation is P(Y | do(X)), read as "the probability of Y given that we set X to a specific value." This is fundamentally different from P(Y | X), which is merely the probability of Y given that we observed X.

In marketing terms: P(Revenue | do(Meta_spend = $50K)) asks "what would revenue be if we actively set Meta spend to $50K?" This accounts for the fact that spending decisions are not random; they are influenced by past performance, seasonality, and other factors that create confounding.

Rung 3: Counterfactuals (Imagining)

Question: "What would have happened if I had done differently?"

This is the highest level of causal reasoning and the most relevant for attribution. A counterfactual asks: given that we ran this specific campaign and observed this specific outcome, what would the outcome have been if we had not run the campaign?

The difference between the actual outcome and the counterfactual outcome is the campaign's true incremental impact. This is precisely what incrementality testing and causal attribution aim to measure.

Most marketing measurement tools operate on Rung 1. The best operate on Rungs 2 and 3.

Directed Acyclic Graphs (DAGs) in Marketing

A DAG is the visual representation of an SCM. Each node represents a variable, and each arrow represents a direct causal relationship. "Acyclic" means no variable can cause itself through a chain of effects.

A Simple Marketing DAG

Consider a basic e-commerce scenario:

Meta Spend ──→ Website Traffic ──→ Revenue
                                      ↑
Google Spend ──→ Website Traffic ──→──┘
                                      ↑
Seasonality ──────────────────────→──┘

This DAG encodes several causal claims:

  • Meta spend causally affects website traffic, which causally affects revenue
  • Google spend also causally affects website traffic and revenue
  • Seasonality directly affects revenue independent of ad spend

Why DAGs Matter for Attribution

DAGs make explicit the assumptions behind any causal analysis. They force you to articulate which variables cause which, and critically, which variables do not have direct causal relationships. This matters for identifying confounding variables.

For example, suppose both Meta spend and revenue are influenced by seasonality. During Q4, you spend more on Meta ads, and revenue is higher. Without accounting for the seasonality confounder, a naive analysis would overestimate Meta's causal impact because part of the revenue increase was caused by seasonal demand, not by the ads.

The DAG reveals this confounding path: Seasonality causes both Meta Spend and Revenue. To isolate Meta's true causal effect, you must control for seasonality, either through statistical adjustment or experimental design.

Do-Calculus: From Observation to Intervention

Pearl's do-calculus is a set of three mathematical rules that determine when and how you can compute causal effects from observational data. In practical terms, it tells you when you can answer Rung 2 questions using data collected at Rung 1.

For marketing, this is transformative. Running true experiments (Rung 2) requires holding out customers or geographies from campaigns, which costs money and is operationally difficult. Do-calculus identifies conditions under which you can estimate causal effects from the data you already have.

The key insight is the concept of the backdoor criterion. If you can identify a set of variables that blocks all confounding paths between your treatment (ad spend) and your outcome (revenue), then conditioning on those variables allows you to estimate the causal effect without running an experiment.

In practice, this means a well-specified SCM with the right data can give you causal attribution estimates from observational data alone, no geo-lift tests required (though experiments remain valuable for validation).

How SCMs Apply to Marketing Attribution

Replacing Rules with Causal Structure

Traditional rules-based attribution assigns credit using predetermined formulas that have no empirical basis. The U-shaped model gives 40% credit to the first and last touchpoints not because those interactions were measured to be 40% responsible, but because someone chose that number.

SCMs replace these arbitrary rules with a structural model of how marketing actually causes revenue. The credit assigned to each channel emerges from the model's estimated causal effects, not from a formula. If Meta prospecting truly caused 35% of incremental revenue, the model assigns 35%, whether that touchpoint was first, last, or in the middle.

Handling Confounders

Marketing data is riddled with confounders. Brands spend more during periods of high demand. Channels that target high-intent customers appear more effective simply because their audience was more likely to convert anyway. Promotional periods drive both increased spending and increased revenue.

SCMs provide a principled framework for identifying and controlling these confounders, using Bayesian methods and the DAG structure to separate the causal signal from the noise.

Estimating Counterfactuals

The most practical application of SCMs in marketing is counterfactual estimation. For each campaign or channel, the model answers: what would revenue have been without this marketing activity?

This is the foundation of incremental revenue measurement. A channel might touch many customers (high attributed credit under rules-based models) but cause very few incremental conversions. The SCM reveals this by comparing actual outcomes to counterfactual outcomes.

From Theory to Practice: SCMs in Modern Attribution Tools

The theoretical elegance of SCMs is well established. The practical challenge has been making these methods accessible to marketing teams that do not have PhD statisticians on staff.

Several developments have closed this gap:

Automated DAG Discovery

Modern machine learning methods can learn causal graph structures from data, reducing the need for manual DAG specification. While expert knowledge is still valuable for validation, algorithms like PC, GES, and causal forest methods can propose plausible causal structures.

Bayesian Structural Time Series

Bayesian structural time series (BSTS) models operationalize the counterfactual reasoning of SCMs in a time series framework. They are the engine behind tools like Google's CausalImpact and are used extensively in modern causal attribution platforms.

Productized Causal Attribution

Platforms like Causality Engine have built Pearl's framework into a product that e-commerce brands can use without deep statistical expertise. The SCM runs under the hood, processing data from Meta Ads, Google Ads, TikTok Ads, and Shopify to estimate counterfactual revenue for each channel daily. See pricing details for current plans.

This represents a fundamental shift: the same causal reasoning that Pearl formalized for academic research now drives real-time budget decisions for growing e-commerce brands, including beauty brands, fashion brands, and supplements brands.

Why Marketers Should Care About SCMs

You do not need to learn do-calculus to benefit from SCMs. But understanding the framework helps you evaluate attribution tools critically:

  1. Ask whether the tool measures causation or correlation. If it relies on click tracking and credit allocation rules, it operates on Rung 1 of Pearl's ladder. If it estimates counterfactuals, it operates on Rung 3.

  2. Ask about confounding. Does the tool account for seasonality, promotions, and other factors that confound the relationship between spend and revenue? SCM-based tools handle this structurally.

  3. Ask about incrementality. Does the tool tell you what each channel caused, or what each channel touched? Only causal methods rooted in frameworks like Pearl's SCMs answer the first question.

For a deeper comparison of methodologies, see our guide on rules-based attribution versus causal inference and how machine learning is transforming attribution.

Start Using Causal Attribution

Structural causal models have moved from academic papers to production marketing tools. The brands that adopt these methods now gain a measurement advantage that compounds over time, as every budget decision is informed by genuine causal evidence rather than correlational guesswork.

See Pearl's framework in action with a Causality Engine demo or start your free trial to measure the true incremental impact of your marketing.

Get attribution insights in your inbox

One email per week. No spam. Unsubscribe anytime.

Key Terms in This Article

Attribution Platform

Attribution Platform is a software tool that connects marketing activities to customer actions. It tracks touchpoints across channels to measure campaign impact.

Backdoor Criterion

Backdoor Criterion identifies a sufficient set of variables to control for confounding between a treatment and an outcome. It blocks all 'backdoor paths' on a directed acyclic graph (DAG).

Causal Attribution

Causal Attribution uses causal inference to determine which marketing touchpoints genuinely cause conversions, not just correlate with them.

Confounding Variable

Confounding Variable is an unmeasured factor that influences both the marketing input and the desired outcome, distorting the true impact of a campaign.

Incrementality Testing

Incrementality Testing measures the additional impact of a marketing campaign. It compares exposed and control groups to determine causal effect.

Marketing Attribution

Marketing attribution assigns credit to marketing touchpoints that contribute to a conversion or sale. Causal inference enhances attribution models by identifying true cause-effect relationships.

Multi-Touch Attribution

Multi-Touch Attribution assigns credit to multiple marketing touchpoints across the customer journey. It provides a comprehensive view of channel impact on conversions.

Structural Causal Model (SCM)

Structural Causal Model (SCM) is a mathematical framework representing causal relationships between variables. It uses equations and directed acyclic graphs to describe how variables influence each other.

Related Articles

Ready to see your real numbers?

Upload your GA4 data. See which channels drive incremental sales. Confidence-scored results in minutes.

Book a Demo

Full refund if you don't see it.

Stay ahead of the attribution curve

Weekly insights on marketing attribution, incrementality testing, and data-driven growth. Written for marketers who care about real numbers, not vanity metrics.

No spam. Unsubscribe anytime. We respect your data.

Confident clarity.For every channel.

See which channels actually drive your revenue. Confidence-scored results in minutes — not months. Full refund if you don't see the value.