Cohort analysis and retention

How to understand user behaviour over time and reduce churn

9 min

Aggregate metrics hide crucial patterns. Growth in active users can mask declining retention if acquisition compensates for losses. Cohort analysis breaks data down by time-based groups to reveal real engagement and loyalty trends.

This guide explains how to build cohorts, interpret retention curves, calculate churn correctly and turn these analyses into concrete actions that improve the user lifecycle.

What are cohorts and why do they matter?

A cohort is a group of users who share a temporal characteristic: they signed up the same week, made their first purchase the same month, or installed the app in the same period. Grouping by cohort lets you compare how users from different eras behave under similar conditions.

Without cohorts, you mix new users with veterans and metrics lose meaning. A retention rate of "30%" can be excellent if measured at month 6 or disastrous if measured at week 1. Cohorts provide that essential time dimension.

How to build a cohort analysis

The starting point is choosing the anchor event that defines the cohort: sign-up, first purchase, first session. Then you define the time interval (daily, weekly, monthly) and the return metric: what action must the user repeat to be considered "retained"?

In GA4, you can use cohort explorations for basic analysis. For deeper analysis, export to BigQuery and build cohort tables with SQL. Tools like Amplitude or Mixpanel offer advanced cohort analysis without requiring SQL.

  • Define the anchor event that creates the cohort
  • Choose a time interval consistent with your product cycle
  • Select a return metric aligned with business value
  • Compare cohorts against each other to spot improvements or regressions

Interpreting retention curves

A retention curve shows what percentage of a cohort remains active over time. The shape reveals product health: a sharp initial drop followed by a plateau indicates that users who survive the activation phase stick around. A continuous decline signals a systemic value problem.

The ideal curve flattens quickly at a high percentage. Compare your curves against industry benchmarks: a B2B SaaS can expect 90–95% monthly retention, while a consumer app might consider 20–30% at 30 days as good.

Measuring churn correctly

Churn rate is the percentage of users who stop using your product in a given period. It sounds simple, but the calculation is full of pitfalls: do you count from the start or end of the period? Do you exclude new sign-ups? Do you use user churn or revenue churn?

For SaaS, distinguish between gross churn (users lost / total) and net churn (which subtracts expansions). A negative net churn — existing customers generate more revenue than is lost to cancellations — is the hallmark of a thriving business.

  • Gross churn: percentage of users or revenue lost
  • Net churn: subtracts upgrades and expansions from existing customers
  • Revenue churn vs user churn: losing one large customer differs from losing several small ones
  • Involuntary churn: expired cards, payment failures, not user negligence

Turning analysis into action

Cohort analysis becomes valuable when it generates actions. If the January cohort retains better than the February one, investigate what changed: new onboarding? pricing change? a different acquisition campaign that brought less qualified users?

Segment cohorts by acquisition channel, pricing plan or behaviour in the first few days. Users who complete onboarding within the first 24 hours typically retain 2–3 times better. Identifying these patterns lets you optimise activation and reduce early churn.

Tools for cohort analysis

GA4 offers basic cohort explorations within its explorations section. For more sophisticated analysis, Amplitude and Mixpanel let you define complex cohorts with multiple conditions and visualise retention with granular segmentation.

If your data lives in BigQuery or your own warehouse, building cohort tables with SQL gives you maximum flexibility. You can cross-reference product data with CRM, payment or support data to understand retention from multiple angles.

  • GA4 Explorations: basic but accessible and free
  • Amplitude / Mixpanel: product analytics with advanced cohorts
  • BigQuery + SQL: maximum flexibility for complex datasets
  • Looker Studio: visualisation of cohorts built in BigQuery

Key Takeaways

  • Aggregate metrics hide retention problems that cohorts reveal
  • The shape of the retention curve indicates product health
  • Distinguish between gross and net churn to understand real impact
  • Segment cohorts by channel, plan and behaviour to find patterns
  • Users who activate quickly retain significantly better

Want to understand why your users leave?

We analyse your retention data with cohorts, identify drop-off points and design strategies to reduce churn.