User retention analysis framework | data science product sense

Sophia Yang · Intermediate ·📋 Product Management ·4y ago

Key Takeaways

This video covers a user retention analysis framework from a data science perspective, including measuring retention across three dimensions and using analytical frameworks to identify the 'aha moment', with tools such as retention curves and cohort analysis.

Full Transcript

hello everyone my name is sophia i am a data scientist welcome to my video in this video i plan to talk about a retention analysis framework from a data science perspective i will talk about the following topics three dimensions of measuring retention time user status and action analytical frameworks of retention analysis how to find the aha moment and a happy moment through analysis and investigate why users stay and leave so if you're interested in learning about retention analysis or interested in learning how to answer the business business growth frameworks aarrr stands for acquisition activation retention revenue and referral but when you think about it it doesn't make sense to put acquisition first it doesn't matter how many new users you get it doesn't mean anything if you can't keep them stay and also of course acquisition strategies like ads are very expensive it is a lot cheaper to retain a user than to get a new user and for many other reasons some people later reprioritized the aarr framework to the rar framework with retention as the number one priority the rar framework stands for retention activation referral revenue and acquisition so the first question that comes to mind might be how do we define and measure retention what is retention retention measures how many users return to your product over some specified time if you look at this definition you might see some problems with this definition what do we what do we mean by users what do we mean by coming back and what do we mean by time those are really good questions and those are the exact three dimensions i'm going to talk about understanding those measures and dimensions will help you choose the right one for your product the first dimension is time the most popular retention measure that everyone uses is the end day retention it measures that among users who first use the product at day zero or proportion of them are still active at day in here day could also be week or month for a one-week retention measure or a n month retention measure whether to use daily weekly or monthly attention totally depends on your product and how often users use your product for example for gaming products with high stickiness it is typical to measure and day retention on a daily basis unbounded retention also called rolling retention measures among users who first use the products at day zero or proportion of them are still active on and after they end bracket retention is more flexible you can define whichever time frame you're interested in for example pinterest measures the percentage of new signups that are still doing key actions during doing a one-week time window of 28 to 35 days after sign up we'll talk about what is the key actions later the second dimension i want to talk about is user status in addition to time user status is often another important dimension to consider so how do we define user status there are many ways to define user statuses you might see different definitions from different companies and different products people have their own way to define user statuses so here is just one way and one example that user status can be defined new user churn user inactive user reactive user and active user it is often important to calculate retention for different user statuses let's think about it new user retention measures the proportion of new users who stay active active user retention measures the proportion of active users who stays active in the example of feature changes we might see that active user retention goes down because users have already gotten used to the product however in the meantime the new user retention could go up suggesting that this feature change could be an improvement improvement in the long run the third dimension i want to talk about is action when we say users use a product we didn't define what we mean by use should we define use as visiting the product page staying for a certain amount of time conducting certain actions or purchasing a product we call the actions we use to calculate retention key actions with so many measures and all those dimensions which one do we use and where do we start for the first dimension time i will start with the classic end day week or month retention and investigate the rest later to determine the time interval i would plot how often users use the product plot the percentage percentage of users who use a product x amount of times with various intervals and plot retention with those various intervals to see which one makes sense for your product it's okay to include more than one time interval in your analysis for the second dimension user status i will start with the new user retention and investigate the active retention active user retention later especially if you are developing our new product third for the third dimension uh user actions it actually depends on the product goal are you more investigated in monetization values or are you at a stage to grow and retain your users through certain actions um yeah so really depends on your product and your business case next let's talk about the analytical frameworks of retention analysis the first thing is retention curve which is widely used everyone probably knows a retention curve plots the retention over time with time on the x-axis and retention rate on the y-axis ideally we would like to see a smiling curve when users come back more and more over time a declining curve signifies danger and a flight flattening curve signifies a healthy product the goal for a product is to shift the curve up and flatten or up till the curve it is often common to also do a retention cohort analysis cohort analysis tells us where do we see products do well and not well we can define cohorts from many different categories such as demographics acquisitions and behavior in terms of the graphical representations we can represent the retention cohorts in a grouped retention curve or a triangle retention chart here is an example with the acquisition time as the cohort and finally we have statistical analysis like any other types of analysis we can start by calculating descriptive statistics and correlations we can also conduct survival analysis to determine which attributes matter in addition the rfm retention frequency monitoring analysis framework is often popular for user segmentations so where should we start with analysis a starting point would be to see a dashboard showing the retention curve and triangle retention charts for all the interesting cohorts and groups then we can generate generate insights from the dashboard and starting from there well the important value of retention analysis is that we can do some really concrete analysis to generate insights for example for new users we can try to find the aha moment throughout our analysis for new users the most important thing is the onboarding experience that happens within the first few days or weeks almost all products wants to help users to achieve the aha moment as quickly as possible the question is how do how do we define a user's aha moment what is the user's behavior that contribute to the aha moment for example facebook focuses on getting seven friends in 10 days the behavior getting friends and the magic number seven friends in 10 days indicates users future success i don't think that we all should find a behavior with a magic number for all of our products but i do think it is important to figure out what behavior contribute to the aha moment from our data then the product team can use this information to improve our onboarding experience a starting point of this analysis is to find the user event and the onboarding behaviors that are related to retention in future weeks and month so simple correlation analysis or descriptive statistics will show you this kind of insight for long-term users we want to find the habit moment how do the most successful long-term users form a habit to use a product who are those users what features do they keep using what attributes do they have and what is the user journey for them the goal is to find successful feature features and the user journeys investing those successful features and encouraging new users to follow successful user journeys again we can do some descriptive analysis on long-term users to identify those user behaviors and features and then conduct cohort analysis to find user journeys for those people okay so you might wonder for a given product why do people choose to leave and stay there might be many many reasons why people leave and stay there might be many reasons why people choose to leave and churn for example there might not be they might not understand the product the product might not be might be hard to use people prefer a competitor's product the product might have some issues like bugs or being slow um the new user is not the target user there could be a mismatch between the users and the core features and also people might only need a product for a very short amount a short period of time and they don't need our product anymore and then there could be many reasons why people stay for example people may love the product the personalized notifications work it has become a habit for people to use the product um [Music] and many more those reasons behind why users stay and leave might be hard to derive directly from the product data instead doing user research and experiments can help for example conducting experiments on a better onboarding experience might help people understand the product the product better doing during onboarding experiments on push notifications might help remind people on the values of the product and experiments on that on various product features might help identify and promote the key features people love understanding the reason is hard collaboration here is the key to understand the users we can collaborate with the ux research team to help design the surveys and interviews and generate insights to run experiments us data scientists can collaborate with the engineers to to design experiments and analyze results hopefully this video provides you some perspectives on how to get started on your retention analysis and how to deep dive for future analysis please let me know if you have any thoughts or comments i would love to hear them thank you

Original Description

Hi there, my name is Sophia Yang. I'm a data scientist. This video will walk you through a user retention analysis framework from a data science perspective. This video will cover three dimensions of measuring retention (time, user status, action), analytical frameworks, how to find the “aha moment” and the “habit moment” through analysis, and investigate why users leave and stay. ⭐ Stay in touch: Medium: https://sophiamyang.medium.com/ Twitter: https://twitter.com/sophiamyang Linkedin: https://www.linkedin.com/in/sophiamyang/ #retention #analysis #datascience #datascientist
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Sophia Yang · Sophia Yang · 10 of 60

1 Customer lifetime value in a discrete-time contractual setting (math and Python implementation)
Customer lifetime value in a discrete-time contractual setting (math and Python implementation)
Sophia Yang
2 Time series analysis using Prophet in Python — Math explained
Time series analysis using Prophet in Python — Math explained
Sophia Yang
3 Multiclass logistic/softmax regression from scratch
Multiclass logistic/softmax regression from scratch
Sophia Yang
4 Deploy a Python Visualization Panel App to Google Cloud App Engine
Deploy a Python Visualization Panel App to Google Cloud App Engine
Sophia Yang
5 Deploy a Python Visualization Panel App to Google Cloud Run
Deploy a Python Visualization Panel App to Google Cloud Run
Sophia Yang
6 [Read a paper (with code)] Beyond Accuracy: Behavioral Testing of NLP models with CheckList
[Read a paper (with code)] Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Sophia Yang
7 5-step data science workflow
5-step data science workflow
Sophia Yang
8 Multi-armed bandit algorithms - ETC Explore then Commit
Multi-armed bandit algorithms - ETC Explore then Commit
Sophia Yang
9 Multi-armed bandit algorithms - Epsilon greedy algorithm
Multi-armed bandit algorithms - Epsilon greedy algorithm
Sophia Yang
User retention analysis framework | data science product sense
User retention analysis framework | data science product sense
Sophia Yang
11 Visualization and Interactive Dashboard in Python: My favorite Python Viz tools — HoloViz
Visualization and Interactive Dashboard in Python: My favorite Python Viz tools — HoloViz
Sophia Yang
12 Multi-armed bandit algorithms: Thompson Sampling
Multi-armed bandit algorithms: Thompson Sampling
Sophia Yang
13 The Easiest Way to Create an Interactive Dashboard in Python
The Easiest Way to Create an Interactive Dashboard in Python
Sophia Yang
14 Big Data Visualization Using Datashader in Python | How does Datashader work and why is it so fast?
Big Data Visualization Using Datashader in Python | How does Datashader work and why is it so fast?
Sophia Yang
15 Why do you want to be a data scientist? Don't be a data scientist if ...
Why do you want to be a data scientist? Don't be a data scientist if ...
Sophia Yang
16 Johnny Depp v Amber Heard Twitter Sentiment Analysis | Is Camille Vasquez the real winner | 🤗 NLP
Johnny Depp v Amber Heard Twitter Sentiment Analysis | Is Camille Vasquez the real winner | 🤗 NLP
Sophia Yang
17 How to build a product that sells itself | Product-led Growth | Book Summary | Read a book with me
How to build a product that sells itself | Product-led Growth | Book Summary | Read a book with me
Sophia Yang
18 Designing Machine Learning Systems | book summary | Read a book with me
Designing Machine Learning Systems | book summary | Read a book with me
Sophia Yang
19 Where do data scientists/analysts go next? Love and hate in data analytics (ft. Shashank Kalanithi)
Where do data scientists/analysts go next? Love and hate in data analytics (ft. Shashank Kalanithi)
Sophia Yang
20 Meet the Author: Fundamentals of Data Engineering | DS/ML book club
Meet the Author: Fundamentals of Data Engineering | DS/ML book club
Sophia Yang
21 What's new in hvPlot releases 0.8.0 & 0.8.1?
What's new in hvPlot releases 0.8.0 & 0.8.1?
Sophia Yang
22 Meet the Author: Machine Learning Design Patterns | What do ML/Research Engineers do at Google?
Meet the Author: Machine Learning Design Patterns | What do ML/Research Engineers do at Google?
Sophia Yang
23 Machine Learning Design Patterns | Google Executive | Investor | Meet the Author
Machine Learning Design Patterns | Google Executive | Investor | Meet the Author
Sophia Yang
24 How to solve data quality issues | Data Reliability | Meet the Author
How to solve data quality issues | Data Reliability | Meet the Author
Sophia Yang
25 Reliable Machine Learning author interview | DS/ML book club
Reliable Machine Learning author interview | DS/ML book club
Sophia Yang
26 Toronto VLOG | First vlog | Meet my favorite author | Toronto ML Summit conference
Toronto VLOG | First vlog | Meet my favorite author | Toronto ML Summit conference
Sophia Yang
27 TOP 6 tech news in 2022 #shorts
TOP 6 tech news in 2022 #shorts
Sophia Yang
28 How to deploy a Panel app to Hugging Face using Docker?
How to deploy a Panel app to Hugging Face using Docker?
Sophia Yang
29 Tech news this week | ChatGPT, Hacks, Snowflake, CES #shorts
Tech news this week | ChatGPT, Hacks, Snowflake, CES #shorts
Sophia Yang
30 🗞️ Tech news this week: ChatGPT, DreamerV3, Muse, VALL-E, Mineral, DoNotPay, Tesla, SBF... #shorts
🗞️ Tech news this week: ChatGPT, DreamerV3, Muse, VALL-E, Mineral, DoNotPay, Tesla, SBF... #shorts
Sophia Yang
31 Tech news this week | Boston Dynamics, Microsoft, Snowflake, Google, and more #shorts
Tech news this week | Boston Dynamics, Microsoft, Snowflake, Google, and more #shorts
Sophia Yang
32 The story of Metaflow | Effective Data Science Infrastructure | Book author interview
The story of Metaflow | Effective Data Science Infrastructure | Book author interview
Sophia Yang
33 Tech news this week #shorts
Tech news this week #shorts
Sophia Yang
34 A day in life of a data scientist | Data Day Texas | Interview 12 authors/speakers
A day in life of a data scientist | Data Day Texas | Interview 12 authors/speakers
Sophia Yang
35 Tech news this week #shorts
Tech news this week #shorts
Sophia Yang
36 Explainable AI with Shapley Values (Part 1: Game Theory)
Explainable AI with Shapley Values (Part 1: Game Theory)
Sophia Yang
37 Explainable AI with Shapley Values (Part 2: Estimate Shapley Values)
Explainable AI with Shapley Values (Part 2: Estimate Shapley Values)
Sophia Yang
38 Explainable AI with Shapley Values (Part 3: KernelSHAP)
Explainable AI with Shapley Values (Part 3: KernelSHAP)
Sophia Yang
39 Tech news this week | AI search war between Microsoft and Google #shorts
Tech news this week | AI search war between Microsoft and Google #shorts
Sophia Yang
40 The Story of ChatGPT's creator OpenAI | From Riches to Fame
The Story of ChatGPT's creator OpenAI | From Riches to Fame
Sophia Yang
41 Explainable AI for Practitioners | Must-read for XAI | author interview
Explainable AI for Practitioners | Must-read for XAI | author interview
Sophia Yang
42 Train your own language model with nanoGPT | Let’s build a songwriter
Train your own language model with nanoGPT | Let’s build a songwriter
Sophia Yang
43 The easiest way to work with large language models | Learn LangChain in 10min
The easiest way to work with large language models | Learn LangChain in 10min
Sophia Yang
44 The BEST browser? AI article summary, image generation, website insights. Microsoft Edge Copilot!
The BEST browser? AI article summary, image generation, website insights. Microsoft Edge Copilot!
Sophia Yang
45 startup scene in data | insights from 50+ data startups from Data Council
startup scene in data | insights from 50+ data startups from Data Council
Sophia Yang
46 NLP with Transformers author interview with Lewis Tunstall from Hugging Face
NLP with Transformers author interview with Lewis Tunstall from Hugging Face
Sophia Yang
47 4 ways to do question answering in LangChain | chat with long PDF docs | BEST method
4 ways to do question answering in LangChain | chat with long PDF docs | BEST method
Sophia Yang
48 5 Steps to Build a Question Answering PDF Chatbot: LangChain + OpenAI + Panel + HuggingFace.
5 Steps to Build a Question Answering PDF Chatbot: LangChain + OpenAI + Panel + HuggingFace.
Sophia Yang
49 4 Autonomous AI Agents: “Westworld” simulation, Camel, BabyAGI, AutoGPT, Camel ⭐ LangChain ⭐
4 Autonomous AI Agents: “Westworld” simulation, Camel, BabyAGI, AutoGPT, Camel ⭐ LangChain ⭐
Sophia Yang
50 MiniGPT4: image understanding & open-source!
MiniGPT4: image understanding & open-source!
Sophia Yang
51 BEST Practices in Prompt Engineering: Learnings and Thoughts from Andrew Ng's New Course
BEST Practices in Prompt Engineering: Learnings and Thoughts from Andrew Ng's New Course
Sophia Yang
52 Designing Machine Learning Systems author interview with Chip Huyen
Designing Machine Learning Systems author interview with Chip Huyen
Sophia Yang
53 Tech news this week: code interpreter, Mojo, Redpajama, MPT7b, StarCoder #shorts
Tech news this week: code interpreter, Mojo, Redpajama, MPT7b, StarCoder #shorts
Sophia Yang
54 🤗 Hugging Face Transformers Agent | LangChain comparisons
🤗 Hugging Face Transformers Agent | LangChain comparisons
Sophia Yang
55 📢 Tech news this week #shorts
📢 Tech news this week #shorts
Sophia Yang
56 📢 Tech news this week #shorts
📢 Tech news this week #shorts
Sophia Yang
57 The BEST ChatGPT Plugins | Brand NEW Bing Search | Web browsing, CODING, summarizing, and more
The BEST ChatGPT Plugins | Brand NEW Bing Search | Web browsing, CODING, summarizing, and more
Sophia Yang
58 Tech news this week #shorts #short
Tech news this week #shorts #short
Sophia Yang
59 📢 Tech news this week #shorts
📢 Tech news this week #shorts
Sophia Yang
60 Deep Learning with PyTorch Author Interview with Eli Stevens, Luca Antiga, and Thomas Viehmann
Deep Learning with PyTorch Author Interview with Eli Stevens, Luca Antiga, and Thomas Viehmann
Sophia Yang

This video teaches a user retention analysis framework from a data science perspective, covering key concepts such as retention curves and cohort analysis, and providing practical steps for applying these tools to inform product strategy and improve user retention.

Key Takeaways
  1. Define user status and retention metrics
  2. Plot retention curves to visualize user retention over time
  3. Conduct cohort analysis to identify trends and patterns
  4. Collaborate with data scientists, engineers, and UX researchers to inform product development
  5. Apply statistical analysis to calculate descriptive statistics and correlations
  6. Use survival analysis to understand user behavior
💡 The 'aha moment' can be identified through cohort analysis and user research, and is critical to informing product strategy and improving user retention.

Related Reads

📰
Communications: The Forgotten Ingredient in Innovation
Effective communication is crucial for successful innovation, not just programming skills
Medium · AI
📰
Taking a break helped me get back to my side project
Taking breaks can help you recharge and come back to your side projects with renewed energy and focus
Dev.to · Harsh
📰
I Built 25 Free Historical Life Simulators. Here's What the Process Taught Me About Decision-Making.
Building 25 historical life simulators taught the author valuable lessons about decision-making, highlighting the importance of experimentation, risk-taking, and learning from failure.
Dev.to · Văn Tuấn Lê
📰
Adding one field to Notion cost me 2.5 hours. The same change in Tana took 30 seconds.
Compare the efficiency of Notion and Tana for workflow management by implementing a simple change, and learn why Tana outperformed Notion in this test
Dev.to · 강해수
Up next
One Wing Won't Fly | Gurmukh Singh Bawa | TEDxDWPS Ludhiana Youth
TEDx Talks
Watch →