Python Tutorial : Skewness and Kurtosis

DataCamp · Beginner ·🔢 Mathematical Foundations ·6y ago

Skills: ML Maths Basics60%

Key Takeaways

This video tutorial covers the concepts of skewness and kurtosis in Python, using libraries such as SciPy, to analyze and understand risk in financial returns. It demonstrates how to calculate skewness and kurtosis, interpret their values, and use them to identify non-normal distributions.

Full Transcript

daily volatility and mean give you a good indication of daily risk in return but skewness kurtosis and scaled volatility will help you build a more holistic view of risk the third moment skewness can be thought of as a measure of how much a distribution leans to the left or the right negative skew is a right leaning curve while positive skew is a left leaning curve in finance you would tend to want positive skewness with a higher probability of significantly good returns on the right-hand side of the distribution and the compressed predictable left-hand distribution of negative returns to calculate the skewness of a return distribution simply call the skew function after importing skew from scipy dot stats be sure to drop any n/a values using the drop and a method skewness above zero indicates possible non normality which once again you can expect to find in financial returns kurtosis is a measure of the thickness of the tails of a distribution which can be used as a proxy for the probability of outliers if you recall normal distributions tend to have a kurtosis near 3 most financial returns are leptokurtic which simply means that they tend to have a positive excess kurtosis or kurtosis greater than 3 since kurtosis is so often compared to a normal distribution many functions in Python will automatically return excess kurtosis which is essentially the sample kurtosis minus 3 which helps demonstrate whether the probability of outliers is higher or lower than a normal distribution if excess kurtosis is higher than 0 the kurtosis is higher than a normal distribution inside pi for example the kurtosis function is actually computing excess kurtosis if you wanted to calculate the true sample kurtosis you would actually need to add 3 to the result but for most cases you're going to be interested in excess kurtosis anyways so this functionality is fine as long as you are aware of it in hi excess kurtosis is an indication of high risk when large movements in returns happen often this can be a very bad thing for your portfolio if it moves in the wrong direction high kurtosis distributions are said to have thick tails which means that outliers such as extreme negative and positive returns are more common before we move on let me briefly add one final tool to your arsenal if the kurtosis of a distribution is greater than 3 and the skewness is nonzero the data is most likely non normal but what if the values are close to but not quite normal you can use the Shapiro well Casta tist achill test to estimate the probability that the data is normally distributed the null hypothesis of the Shapiro will test is that the data are normally distributed and when the p-value returned is less than or equal to 0.05 that means you can safely reject the null hypothesis and assume that the data are non normal now it's your turn

Original Description

Want to learn more? Take the full course at https://learn.datacamp.com/courses/introduction-to-portfolio-risk-management-in-python at your own pace. More than a video, you'll learn hands-on coding & quickly apply skills to your daily work. --- Daily volatility and mean give you a good indication of daily risk and return, but skewness, kurtosis, and scaled volatility will help you build a more wholistic view of risk. The third moment, skewness, can be thought of as a measure of how much a distribution leans to the left or right. Negative skew is a right-leaning curve, while positive skew is a left leaning curve. In finance, you would tend to want positive skewness, with a higher probability of significantly good returns on the right hand side of the distribution, and a compressed, predictable left-hand distribution of negative returns. To calculate the skewness of a return distribution, simply call the skew() function after importing skew from scipy dot stats. Be sure to drop any NA values using the dropna() method. Skewness above 0 indicates possible non-normality, which once again you can expect to find in financial returns. Kurtosis is a measure of the thickness of the tails of a distribution, which can be used a proxy for the probability of outliers. If you recall, normal distributions tend to have a kurtosis near 3. Most financial returns are leptokurtic, which simply means that they tend to have positive excess kurtosis, or kurtosis greater than 3. Since kurtosis is so often compared to a normal distribution, many functions in Python will automatically return excess kurtosis, which is essentially the sample kurtosis minus 3, which helps demonstrate whether the probability of outliers is higher or lower than a normal distribution. If excess kurtosis is higher than 0, the kurtosis is higher than a normal distribution. In SciPy, for example, the kurtosis function is actually computing excess kurtosis. If you wanted to calculate the true sample kurtosis,

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from DataCamp · DataCamp · 0 of 60

← Previous Next →

SQL Server Tutorial: Date manipulation

SQL Server Tutorial: Date manipulation

R Tutorial: Intermediate Interactive Data Visualization with plotly in R

R Tutorial: Intermediate Interactive Data Visualization with plotly in R

R Tutorial: Adding aesthetics to represent a variable

R Tutorial: Adding aesthetics to represent a variable

R Tutorial: Moving Beyond Simple Interactivity

R Tutorial: Moving Beyond Simple Interactivity

Python Tutorial: Why use ML for marketing? Strategies and use cases

Python Tutorial: Why use ML for marketing? Strategies and use cases

Python Tutorial: Preparation for modeling

Python Tutorial: Preparation for modeling

Python Tutorial: Machine Learning modeling steps

Python Tutorial: Machine Learning modeling steps

R Tutorial: The prior model

R Tutorial: The prior model

R Tutorial: Data & the likelihood

R Tutorial: Data & the likelihood

R Tutorial: The posterior model

R Tutorial: The posterior model

R Tutorial: An Introduction to plotly

R Tutorial: An Introduction to plotly

R Tutorial: Plotting a single variable

R Tutorial: Plotting a single variable

R Tutorial: Bivariate graphics

R Tutorial: Bivariate graphics

Python Tutorial: Customer Segmentation in Python

Python Tutorial: Customer Segmentation in Python

Python Tutorial: Time cohorts

Python Tutorial: Time cohorts

Python Tutorial: Calculate cohort metrics

Python Tutorial: Calculate cohort metrics

Python Tutorial: Cohort analysis visualization

Python Tutorial: Cohort analysis visualization

R Tutorial: Building Dashboards with flexdashboard

R Tutorial: Building Dashboards with flexdashboard

R Tutorial: Anatomy of a flexdashboard

R Tutorial: Anatomy of a flexdashboard

R Tutorial: Layout basics

R Tutorial: Layout basics

R Tutorial: Advanced layouts

R Tutorial: Advanced layouts

Python Tutorial: Time Series Analysis in Python

Python Tutorial: Time Series Analysis in Python

Python Tutorial: Correlation of Two Time Series

Python Tutorial: Correlation of Two Time Series

Python Tutorial: Simple Linear Regressions

Python Tutorial: Simple Linear Regressions

Python Tutorial: Autocorrelation

Python Tutorial: Autocorrelation

R Tutorial: The gapminder dataset

R Tutorial: The gapminder dataset

R Tutorial: The filter verb

R Tutorial: The filter verb

R Tutorial: The arrange verb

R Tutorial: The arrange verb

R Tutorial: The mutate verb

R Tutorial: The mutate verb

R Tutorial: What is cluster analysis?

R Tutorial: What is cluster analysis?

R Tutorial: Distance between two observations

R Tutorial: Distance between two observations

R Tutorial: The importance of scale

R Tutorial: The importance of scale

R Tutorial: Measuring distance for categorical data

R Tutorial: Measuring distance for categorical data

Python Tutorial: Plotting multiple graphs

Python Tutorial: Plotting multiple graphs

Python Tutorial: Customizing axes

Python Tutorial: Customizing axes

Python Tutorial: Legends, annotations, & styles

Python Tutorial: Legends, annotations, & styles

Python Tutorial: Introduction to iterators

Python Tutorial: Introduction to iterators

Python Tutorial: Playing with iterators

Python Tutorial: Playing with iterators

Python Tutorial: Using iterators to load large files into memory

Python Tutorial: Using iterators to load large files into memory

SQL Tutorial: Introduction to Relational Databases in SQL

SQL Tutorial: Introduction to Relational Databases in SQL

SQL Tutorial: Tables: At the core of every database

SQL Tutorial: Tables: At the core of every database

SQL Tutorial: Update your database as the structure changes

SQL Tutorial: Update your database as the structure changes

Python Tutorial: Classification-Tree Learning

Python Tutorial: Classification-Tree Learning

Python Tutorial: Decision-Tree for Classification

Python Tutorial: Decision-Tree for Classification

Python Tutorial: Decision-Tree for Regression

Python Tutorial: Decision-Tree for Regression

Python Tutorial: Census Subject Tables

Python Tutorial: Census Subject Tables

Python Tutorial: Census Geography

Python Tutorial: Census Geography

Python Tutorial: Using the Census API

Python Tutorial: Using the Census API

R Tutorial: A/B Testing in R

R Tutorial: A/B Testing in R

R Tutorial: Baseline Conversion Rates

R Tutorial: Baseline Conversion Rates

R Tutorial: Designing an Experiment - Power Analysis

R Tutorial: Designing an Experiment - Power Analysis

R Tutorial: Introduction to qualitative data

R Tutorial: Introduction to qualitative data

R Tutorial: Understanding your qualitative variables

R Tutorial: Understanding your qualitative variables

R Tutorial: Making Better Plots

R Tutorial: Making Better Plots

SQL Tutorial: OLTP and OLAP

SQL Tutorial: OLTP and OLAP

SQL Tutorial: Storing data

SQL Tutorial: Storing data

SQL Tutorial: Database design

SQL Tutorial: Database design

Python Tutorial: Introduction to spaCy

Python Tutorial: Introduction to spaCy

Python Tutorial: Statistical Models

Python Tutorial: Statistical Models

Python Tutorial: Rule-based Matching

Python Tutorial: Rule-based Matching

This video tutorial teaches you how to calculate and interpret skewness and kurtosis in Python, and how to use these metrics to analyze financial returns and identify non-normal distributions. By the end of this lesson, you'll be able to apply these concepts to real-world data analysis tasks.

Key Takeaways

Import the necessary libraries, including SciPy
Calculate the skewness of a return distribution using the skew function
Calculate the kurtosis of a return distribution using the kurtosis function
Interpret the values of skewness and kurtosis
Use the Shapiro-Wilk test to estimate the probability that the data is normally distributed

💡 Skewness and kurtosis are important metrics for understanding risk in financial returns, and can be used to identify non-normal distributions.

🔒 Pro feature: Ask AI to explain this lesson →

More on: ML Maths Basics

View skill →

Important Steps I Have Followed To Improve My Data Science Skills- Sharing My Experience

Important Steps I Have Followed To Improve My Data Science Skills- Sharing My Experience

Learn Python FAST for Beginners 🚀#coding #conditionals #loops #functions

Learn Python FAST for Beginners 🚀#coding #conditionals #loops #functions

ChethanAIChronicles

“Hello, world” from scratch on a 6502 — Part 1

“Hello, world” from scratch on a 6502 — Part 1

PCA (Principal Component Analysis) in Python - Machine Learning From Scratch 11 - Python Tutorial

PCA (Principal Component Analysis) in Python - Machine Learning From Scratch 11 - Python Tutorial

ROC and AUC in R

ROC and AUC in R

StatQuest with Josh Starmer

Data Science Fundamentals: Data Cleaning in Python

Data Science Fundamentals: Data Cleaning in Python

Related AI Lessons

Super Mario is mathier than you think

Super Mario's world is full of mathematical concepts, making it a great example of how math is used in real-world problem-solving

MIT Technology Review

A Geometry Puzzle With 3 Circles

Solve a geometry puzzle involving 3 circles using mathematical reasoning and visualization techniques

Medium · Data Science

The Consecutive Integers Divisibility Trick

Learn the Consecutive Integers Divisibility Trick to simplify difficult proofs in mathematics and programming

Medium · Programming

The Mayans Invented Zero Before Most of the World — Here Is Their Number System in Python

Learn about the Mayan number system and its implementation in Python, highlighting the importance of zero in their base-20 system

Medium · Python

How to Open OSM Files (OpenStreetMap Data)

File Extension Geeks