AutoPlotter - A GUI based Exploratory Data Analysis in Python

Data Professor · Beginner ·🎨 Image & Video AI ·5y ago

Key Takeaways

The video demonstrates how to use the AutoPlotter library in Python for exploratory data analysis, covering its three main functionalities: Data Preview, Plots, and Analysis. It showcases the library's capabilities, including data filtering, sorting, and visualization, using the NBA player stats dataset as an example.

Full Transcript

exploratory data analysis is one of the most important part of the data science process because it allows you to gain an understanding of your data set and such preliminary understanding will allow you to plan your own data science projects as well as shaping the direction at which the project is heading for and so in this video we're going to be taking a look at the auto plotter python library which will allow you to perform a point-and-click or gui based exploratory data analysis of your own data set and without further ado we're starting right now so this illustration was drawn by myself in collaboration with kenji and so you could check out this particular article and also the video described in this article as well and so i'll provide that in the video description and so let's head over to the auto plotter on the pi pi as you can see here it will allow you to do a do i based exploratory data analysis and this report that will be generated by auto plotter library is built on top of dash and dash is a python library from the same creator of plotly which allows you to build an intuitive and interactive web application to explore your data sets so let me know in the comments if you're interested in a tutorial video of dash so let's head over to the google code lab that i've created and the first thing that you want to do is you want to install the auto plotter library and in order to do that you could type in pip install auto plotter and let's do that all right and it's finished and now let's import the libraries and we're gonna import the run app from auto plotter and then we're also going to import pandas as pd and now we're going to run it all right and we're going to import the nba player stats data set that was mentioned in one of the prior videos on my youtube channel and you could get the data set from the github of data professor and so let's have a look at it as you can see it is the data set of the national basketball association or nba and it has all of the player stats data and it was obtained from the basketballreference.com website and i'll provide you the link to the video describing the web scraping of this particular data set let's head over back to the notebook and we're going to run the cell so that we will assign it to the df variable and we're using the pandas read csv function and let's have a look at the data frame that we have read in and as you can see here it is comprised of 708 rows 29 columns and you can see that we could simply run the app by using the run app function which we have imported prior to this and the input argument here will be df which is the data set and then you could decide on whether you want to run it in either the mode of inline or external mode and so inline will allow you to run it directly from within the google code lab or a jupyter notebook and if you and if you run it in an external mode you could run it in a separate window of its own so let me show you both of them let's start with the inline and it's loading and now you can see the data frame and notice that you could navigate through each of the page and each page will be comprised of approximately let me count 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 15 rows and you could scroll the mouse to the left and right and you could click on the sorting icon here in order to sort the data and you could also filter the data by typing in the operators like for example if you would like data greater than 0.5 just type greater than 0.5 hit on enter and now the data will only be those having values of greater than 0.5 and now you see that the number of pages reduced to 23. let's try less than 0.5 let's try less than 0.1 okay so you see that the total pages has decreased over here have okay this one works shooting guard center or how about the team t-o-t l-a-l lakers okay so the team works the position works and how about age let's say that we want players greater than 30 years old we could type in greater than 30 and we see that there is approximately eight pages here and how about less than 30. so we can see that most of the players in the nba were aged less than 30. so there's 38 pages in comparison to eight pages here all right so this is for quickly exploring the data sets in the data frame and this is right here in the data preview icon and let's head over to the plots icon here click on it and you can see that you could use either gc plot 2 seaborn plotly plotly dark or also the presentation version and please note that the presentation version is essentially in white background and let's give heat map a try and you could select x-axis y-axis data let's go with the team and then y-axis let's go with the position and the value let's select the age and then here you're going to see that the age ranges from 20 to approximately 35. you can see according to the color for each of the positions here against the team here you can see the relative age of the players so if it's yellow then the player is older and if it's purple then the player is younger so you can see the relative age between the positions and the team and let's choose another parameter here let's try the three point percentage and here you're going to see that atlanta has a shooting guard which has pretty high three point percentage and another team would be milwaukee and indiana for power forward and shooting guards choose another one about two point percentage all right and now you're gonna see that evenly distributed throughout teams in the nba and so you could also try out other types of graphs as well and let's head over to the analysis part click on the icon here and the first tab here is data distribution and so here it is selecting player and let me deselect it and let's select the team and then you're gonna see the histogram for each team you're gonna see the count of the occurrence here and let's try another let's select another one let's say age and then scroll down you're gonna see the histogram of the age distribution let's try another let's try three point percent and then you're gonna see three point percent here so there are zero percent and then in the range of about 0.1 until approximately 0.524 where you could see that there is a greater portion at approximately 0.275 until 3 7 all right and on top here you can see the box plot and let's see let's click on the distribution and then you're going to see let's wait for it all right for the distribution of h you're going to see it as a line here same for the distribution of three points okay because this is categorical data let's click on the statistical analysis and so you're gonna see the descriptive statistics for the count mean standard deviation minimum value maximum value 25 percent 50 and 75 quartile for each of the columns here let's click on the association tab and then you're gonna be able to compute the correlation between let's say age and the games played and there's a pretty low correlation so there's no correlation there let's try age and about three point percent no correlation there about free throw is there any correlation not significant but slightly higher how about free throw and three point somewhat higher correlation okay so let's try the external version run it and then you're going to notice that it creates this url click on it and from within collab it will launch this new window and then you could also perform the similar analysis that i've mentioned directly in here okay and so if you're finding value in this video please give it a thumbs up subscribe if you haven't yet already make sure to hit on the notification bell so that you will be notified of the next video and as always the best way to learn data science is to do data science please enjoy the journey

Original Description

In this video, I will be providing a high-level overview showing you how to perform exploratory data analysis (EDA) in Python using the AutoPlotter library. In a nutshell, there are 3 main functionalities in AutoPlotter: (1) Data Preview, (2) Plots and (3) Analysis. 🌟 Join as a Member to support this Channel: https://www.youtube.com/channel/UCV8e2g4IWQqK71bbzGDEI4Q/join 🌟 Download Kite for FREE https://www.kite.com/get-kite/?utm_medium=referral&utm_source=youtube&utm_campaign=dataprofessor&utm_content=description-only ⭕ Links for this video: - GitHub https://github.com/dataprofessor/autoplotter - EDA in Python using Pandas https://youtu.be/9m4n2xVzk9o - AutoPlotter https://pypi.org/project/autoplotter/ ⭕ Support my work: 🌟 Subscribe to the Coding Professor channel https://www.youtube.com/channel/UCJzlfIoF8nmWqJIv_iWQVRw?sub_confirmation=1 🌟 Subscribe to the Data Professor https://www.youtube.com/dataprofessor?sub_confirmation=1 🌟 Join the Newsletter of Data Professor http://newsletter.dataprofessor.org 🌟 Buy me a coffee https://www.buymeacoffee.com/dataprofessor ⭕ Recommended Books: 🌟https://kit.co/dataprofessor ✅ Python Basics: A Practical Introduction to Python 3 https://amzn.to/3awdWgm ✅ Learn Python Programming (The no-nonsense, beginner's guide) https://amzn.to/2RFpSpn ✅ Learn to Program with Minecraft https://amzn.to/3x2MujZ ✅ Automate the Boring Stuff with Python, 2nd Edition: Practical Programming for Total Beginners https://amzn.to/2QzkyDs ⭕ Disclaimer: Recommended books and tools are affiliate links that gives me a portion of sales at no cost to you, which will contribute to the improvement of this channel's contents. ⭕ Stock photos, graphics and videos used on this channel: ✅ https://1.envato.market/c/2346717/628379/4662 ⭕ Credit: - Icon of ring in video thumbnail image is created by Freepik (https://www.freepik.com) via Flaticon (http://www.flaticon.com) - Icon of Python logo is created by Iconfinder https://www.iconfinder.com/icons/451
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Data Professor · Data Professor · 0 of 60

← Previous Next →
1 How a Biologist became a Data Scientist
How a Biologist became a Data Scientist
Data Professor
2 WEKA Tutorial #1.1 - How to Build a Data Mining Model from Scratch
WEKA Tutorial #1.1 - How to Build a Data Mining Model from Scratch
Data Professor
3 WEKA Tutorial #1.2 - How to Build a Data Mining Model from Scratch
WEKA Tutorial #1.2 - How to Build a Data Mining Model from Scratch
Data Professor
4 WEKA Tutorial #1.3 - How to Build a Data Mining Model from Scratch
WEKA Tutorial #1.3 - How to Build a Data Mining Model from Scratch
Data Professor
5 Computational Drug Discovery: Machine Learning for Making Sense of Big Data in Drug Discovery
Computational Drug Discovery: Machine Learning for Making Sense of Big Data in Drug Discovery
Data Professor
6 Quotes #1 on Big Data and Data Science
Quotes #1 on Big Data and Data Science
Data Professor
7 Quotes #2 on Big Data and Data Science
Quotes #2 on Big Data and Data Science
Data Professor
8 Quotes #3 on Big Data and Data Science
Quotes #3 on Big Data and Data Science
Data Professor
9 Quotes #4 on Big Data and Data Science
Quotes #4 on Big Data and Data Science
Data Professor
10 Quotes #5 on Big Data and Data Science
Quotes #5 on Big Data and Data Science
Data Professor
11 Data Science 101: Starting a Data Science / Data Mining Project
Data Science 101: Starting a Data Science / Data Mining Project
Data Professor
12 Data Science 101: CRISP-DM - Data Mining / Data Science in 6 Steps
Data Science 101: CRISP-DM - Data Mining / Data Science in 6 Steps
Data Professor
13 R Programming 101: How to Define Variables
R Programming 101: How to Define Variables
Data Professor
14 R Programming 101: Read and Write CSV files
R Programming 101: Read and Write CSV files
Data Professor
15 Data Science 101: Basic Command-Line for Data Science
Data Science 101: Basic Command-Line for Data Science
Data Professor
16 Strategies for Learning Data Science in 2020 (Data Science 101)
Strategies for Learning Data Science in 2020 (Data Science 101)
Data Professor
17 Building your Data Science Portfolio with GitHub (Data Science 101)
Building your Data Science Portfolio with GitHub (Data Science 101)
Data Professor
18 R Programming 101: Setting up R programming environment (R, RStudio and RStudio.cloud)
R Programming 101: Setting up R programming environment (R, RStudio and RStudio.cloud)
Data Professor
19 Exploratory Data Analysis in R: Towards Data Understanding
Exploratory Data Analysis in R: Towards Data Understanding
Data Professor
20 Exploratory Data Analysis in R: Quick Dive into Data Visualization
Exploratory Data Analysis in R: Quick Dive into Data Visualization
Data Professor
21 Machine Learning in R: Building a Classification Model
Machine Learning in R: Building a Classification Model
Data Professor
22 Machine Learning in R: Repurpose Machine Learning Code for New Data
Machine Learning in R: Repurpose Machine Learning Code for New Data
Data Professor
23 Data Science 101: Deploying your Machine Learning Model
Data Science 101: Deploying your Machine Learning Model
Data Professor
24 Machine Learning in R: Deploy Machine Learning Model using RDS
Machine Learning in R: Deploy Machine Learning Model using RDS
Data Professor
25 Data Pre-processing in R: Handling Missing Data
Data Pre-processing in R: Handling Missing Data
Data Professor
26 Machine Learning in R: Speed up Model Building with Parallel Computing
Machine Learning in R: Speed up Model Building with Parallel Computing
Data Professor
27 Data Science 101: Overview of Machine Learning Model Building Process
Data Science 101: Overview of Machine Learning Model Building Process
Data Professor
28 Web Apps in R: Building your First Web Application in R | Shiny Tutorial Ep 1
Web Apps in R: Building your First Web Application in R | Shiny Tutorial Ep 1
Data Professor
29 Web Apps in R: Build Interactive Histogram Web Application in R | Shiny Tutorial Ep 2
Web Apps in R: Build Interactive Histogram Web Application in R | Shiny Tutorial Ep 2
Data Professor
30 Web Apps in R: Building Data-Driven Web Application in R | Shiny Tutorial Ep 3
Web Apps in R: Building Data-Driven Web Application in R | Shiny Tutorial Ep 3
Data Professor
31 Web Apps in R: Building the Machine Learning Web Application in R | Shiny Tutorial Ep 4
Web Apps in R: Building the Machine Learning Web Application in R | Shiny Tutorial Ep 4
Data Professor
32 Web Apps in R: Build BMI Calculator web application in R for health monitoring | Shiny Tutorial Ep 5
Web Apps in R: Build BMI Calculator web application in R for health monitoring | Shiny Tutorial Ep 5
Data Professor
33 Machine Learning in R: Building a Linear Regression Model
Machine Learning in R: Building a Linear Regression Model
Data Professor
34 What programming language to learn for Data Science? R versus Python
What programming language to learn for Data Science? R versus Python
Data Professor
35 How to Become a Data Scientist (Learning Path and Skill Sets Needed)
How to Become a Data Scientist (Learning Path and Skill Sets Needed)
Data Professor
36 Using Python in R
Using Python in R
Data Professor
37 Interpretable Machine Learning Models
Interpretable Machine Learning Models
Data Professor
38 Making Scatter Plots in R [Data Visualisation in R series]
Making Scatter Plots in R [Data Visualisation in R series]
Data Professor
39 Machine Learning in Python: Building a Classification Model
Machine Learning in Python: Building a Classification Model
Data Professor
40 Compare Machine Learning Classifiers in Python
Compare Machine Learning Classifiers in Python
Data Professor
41 Hyperparameter Tuning of Machine Learning Model in Python
Hyperparameter Tuning of Machine Learning Model in Python
Data Professor
42 Practical Introduction to Google Colab for Data Science
Practical Introduction to Google Colab for Data Science
Data Professor
43 File Handling in Google Colab for Data Science
File Handling in Google Colab for Data Science
Data Professor
44 Pandas for Data Science: Create and Combine DataFrames / Rename Columns
Pandas for Data Science: Create and Combine DataFrames / Rename Columns
Data Professor
45 Machine Learning in Python: Building a Linear Regression Model
Machine Learning in Python: Building a Linear Regression Model
Data Professor
46 Machine Learning in Python: Principal Component Analysis (PCA) for Handling High-Dimensional Data
Machine Learning in Python: Principal Component Analysis (PCA) for Handling High-Dimensional Data
Data Professor
47 How to Plot an ROC Curve in Python | Machine Learning in Python
How to Plot an ROC Curve in Python | Machine Learning in Python
Data Professor
48 Installing conda on Google Colab for Data Science
Installing conda on Google Colab for Data Science
Data Professor
49 Use native R on Google Colab for Data Science
Use native R on Google Colab for Data Science
Data Professor
50 How to Save and Download files from Google Colab
How to Save and Download files from Google Colab
Data Professor
51 Easy Web Scraping in Python using Pandas for Data Science
Easy Web Scraping in Python using Pandas for Data Science
Data Professor
52 Data Science for Computational Drug Discovery using Python (Part 1)
Data Science for Computational Drug Discovery using Python (Part 1)
Data Professor
53 Pandas Profiling for Data Science (Quick and Easy Exploratory Data Analysis)
Pandas Profiling for Data Science (Quick and Easy Exploratory Data Analysis)
Data Professor
54 Exploratory Data Analysis in Python using pandas
Exploratory Data Analysis in Python using pandas
Data Professor
55 Quick tour of PyCaret (a low-code machine learning library in Python)
Quick tour of PyCaret (a low-code machine learning library in Python)
Data Professor
56 How to Upload Files to Google Colab
How to Upload Files to Google Colab
Data Professor
57 How to Install and Use Pandas Profiling on Google Colab
How to Install and Use Pandas Profiling on Google Colab
Data Professor
58 How to Adjust the Style of Pandas DataFrame
How to Adjust the Style of Pandas DataFrame
Data Professor
59 How to use Bamboolib for Data Wrangling in Data Science
How to use Bamboolib for Data Wrangling in Data Science
Data Professor
60 How to use Pandas Profiling on Kaggle
How to use Pandas Profiling on Kaggle
Data Professor

This video teaches how to use AutoPlotter for exploratory data analysis in Python, covering data preview, plots, and analysis. It provides a hands-on example using the NBA player stats dataset.

Key Takeaways
  1. Install AutoPlotter using pip
  2. Import necessary libraries
  3. Load the NBA player stats dataset
  4. Run the AutoPlotter app
  5. Explore data using Data Preview, Plots, and Analysis tabs
💡 AutoPlotter provides an intuitive and interactive way to perform exploratory data analysis in Python, making it easier to understand and visualize datasets.

Related AI Lessons

FREE AI Sin City Photo Generator — Turn Any Photo Into High-Contrast Noir Art (2026)
Transform any photo into a Sin City-inspired high-contrast noir art using a free AI generator
Dev.to AI
Google makes Gemini’s personalized image generation free for all US users
Google's Gemini personalized image generation is now free for all US users, allowing them to generate images informed by their Google data
The Next Web AI
Gemini’s personalized AI image generation is now free for U.S. users
Gemini's AI image generation is now free for U.S. users, allowing for personalized images based on user interests and data
TechCrunch AI
WebP's Compression Secret: How a 1MB PNG Becomes a 200KB WebP
Learn how WebP compresses images more efficiently than PNG and JPEG, and why it matters for web development
Dev.to · swift king
Up next
OpenAI Kills Sora then Descends into Chaos
ColdFusion
Watch →