WEKA Tutorial #1.1 - How to Build a Data Mining Model from Scratch

Data Professor · Beginner ·📐 ML Fundamentals ·6y ago

Key Takeaways

This video tutorial covers the basics of data science and demonstrates how to build a data mining model from scratch using the WEKA data mining software, including downloading and installing the software.

Full Transcript

welcome back to the data professor I'm Tennant I said a mad and in this episode I'm going to give you a quick introduction about what is data science and how you can go about building your very first prediction model so without further ado let's get started data is ubiquitous and in this day and age we have an ever-increasing amount of data infamously known as big data which we can use to analyze to get insights and to drive the decision-making process so what exactly is data data pertains to information about entities of interest for example health parameters of a human being such as the red and white blood cell count the blood profile lipid profile and other parameters that describes the health status of an individual characteristics of cars such as the top speed that it can go and fuel consumption rates properties of drugs such as the molecular size solubility electronic and hydrophobic properties of the drug simply put data science is a very big field that encompasses several smaller disciplines such as statistics mathematics data visualization programming data mining machine learning so as you can see data mining is a subset of data science and it refers to the specific process of making use of the data in order to build a prediction model and extracting knowledge from the data while machine learning refers to the learning algorithms that are used to create the prediction models inside the data mining process so there you have it a very brief introduction to data science now comes the fun part let's get started in building our very first prediction model Vica is a program for performing data mining it has an intuitive graphical user interface that allows you to pre-process transform the data as well as construct the prediction model using a variety of machine learning algorithms and it was created by two developers in Witten and EEP Frank from the University of Waikato so let's begin by first installing weeka onto your computer so what you need to do is go to a go to Google and then search for weeka and then click on the first link so notice that the URL is coming from the University of Waikato so click on the link so it's the pace that was open a couple of seconds ago so let's get started by downloading the program so click on the download button and then scroll down you'll notice that they they're going to have several versions here snapshot is when they have a like a beta version which is not stable yet but what you want is the stable version right here or they also have the developer version where they also provide new features which are not yet stable but are included for your usage here if you're into the latest feature you might want to try this one but if you're starting out I would recommend using the stable version so it has for many platform on the windows platform for the Mac platform and also for Linux platform as well so before you begin you will have to select one of the four links right here so what are they well the first link is the week program right here version 3 point 8 point three and it also comes with a java virtual environment as you can see from the final name for the 64-bit version however the second file is the weaker program alone as you can see here by the name of weeka and the version number three eight three and then x64 would mean it is built for the 64-bit version of Windows but it does not come with the Java Virtual Machine so therefore you don't see that JRE in the file name and the third file are is similar to the first file in which it has the weaker program along with the java virtual machine but it is built for the 32-bit version of your windows and the fourth file is the weaker program built for the 32-bit version so if you are wondering which version should you go with well let's check out what is the version of your computer's but it is 64 or 32 bit oh it's right here properties and then notice the 64-bit version right here so this computer has 64-bit so I'm going to go for it with the 64-bit version however I will have to identify whether I want to have Java or without Java so in order to do that let's check whether my computer has Java or not and you can do the same by going to the search icon type in CMD and click on the command prompt and then then and then you will see this command prompt window coming up type in Java and if it says that Java is not recognize that it means that your computer does not have Java installed so let's go with the first file which has Java prepackaged along with the weakest software so let's click on here and that will take you to the download link wait a bit ok and then your download have started so it's a hundred and fifteen megabytes so that should take you a little while ok so the internet speed is going up and we are a couple of seconds away from downloading the program ok so it's finished and let's install so click on the installation file and it will ask whether you want to allow this program to make changes to your device so I'll click on yes and then the next step is pretty easy and straightforward so click on the next button and we are close to completion and now it's going to install the java virtual machine you so click on the install button click on ok wait some more ok so we're almost there ok so Java has successfully been installed and I will click on the close button and then weeka say it's ok it is completed so once it's a complete head or click on the next button and then it has to take for us to take on start Riga and click on finished so until next time I'm telling nontox in amman on the data professor channel and if you haven't subscribed to add please consider subscribing and clicking on the notification bell so that you will be notified on the next video so I'll see you in the next one [Music]

Original Description

In this Part 1 video (of a 3 part series), you will learn about (1) the big concept of data science in 2 minutes and (2) how to build your first data mining model from scratch using the WEKA data mining software (Part 1 covers the downloading and installation of the WEKA software). This 3 part video series is made for the absolute beginner as we guide you step-by-step on building a data mining model from scratch. 🌟 Buy me a coffee: https://www.buymeacoffee.com/dataprofessor ⭕ Playlist: Check out our other videos in the following playlists. ✅ Data Science 101: https://bit.ly/dataprofessor-ds101 ✅ Data Science YouTuber Podcast: https://bit.ly/datascience-youtuber-podcast ✅ Data Science Virtual Internship: https://bit.ly/dataprofessor-internship ✅ Bioinformatics: http://bit.ly/dataprofessor-bioinformatics ✅ Data Science Toolbox: https://bit.ly/dataprofessor-datasciencetoolbox ✅ Streamlit (Web App in Python): https://bit.ly/dataprofessor-streamlit ✅ Shiny (Web App in R): https://bit.ly/dataprofessor-shiny ✅ Google Colab Tips and Tricks: https://bit.ly/dataprofessor-google-colab ✅ Pandas Tips and Tricks: https://bit.ly/dataprofessor-pandas ✅ Python Data Science Project: https://bit.ly/dataprofessor-python-ds ✅ R Data Science Project: https://bit.ly/dataprofessor-r-ds ⭕ Subscribe: If you're new here, it would mean the world to me if you would consider subscribing to this channel. ✅ Subscribe: https://www.youtube.com/dataprofessor?sub_confirmation=1 ⭕ Recommended Tools: Kite is a FREE AI-powered coding assistant that will help you code faster and smarter. The Kite plugin integrates with all the top editors and IDEs to give you smart completions and documentation while you’re typing. I've been using Kite and I love it! ✅ Check out Kite: https://www.kite.com/get-kite/?utm_medium=referral&utm_source=youtube&utm_campaign=dataprofessor&utm_content=description-only ⭕ Recommended Books: ✅ Hands-On Machine Learning with Scikit-Learn : https://amzn.to/3hTKuTt ✅ Data Science
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Data Professor · Data Professor · 2 of 60

1 How a Biologist became a Data Scientist
How a Biologist became a Data Scientist
Data Professor
WEKA Tutorial #1.1 - How to Build a Data Mining Model from Scratch
WEKA Tutorial #1.1 - How to Build a Data Mining Model from Scratch
Data Professor
3 WEKA Tutorial #1.2 - How to Build a Data Mining Model from Scratch
WEKA Tutorial #1.2 - How to Build a Data Mining Model from Scratch
Data Professor
4 WEKA Tutorial #1.3 - How to Build a Data Mining Model from Scratch
WEKA Tutorial #1.3 - How to Build a Data Mining Model from Scratch
Data Professor
5 Computational Drug Discovery: Machine Learning for Making Sense of Big Data in Drug Discovery
Computational Drug Discovery: Machine Learning for Making Sense of Big Data in Drug Discovery
Data Professor
6 Quotes #1 on Big Data and Data Science
Quotes #1 on Big Data and Data Science
Data Professor
7 Quotes #2 on Big Data and Data Science
Quotes #2 on Big Data and Data Science
Data Professor
8 Quotes #3 on Big Data and Data Science
Quotes #3 on Big Data and Data Science
Data Professor
9 Quotes #4 on Big Data and Data Science
Quotes #4 on Big Data and Data Science
Data Professor
10 Quotes #5 on Big Data and Data Science
Quotes #5 on Big Data and Data Science
Data Professor
11 Data Science 101: Starting a Data Science / Data Mining Project
Data Science 101: Starting a Data Science / Data Mining Project
Data Professor
12 Data Science 101: CRISP-DM - Data Mining / Data Science in 6 Steps
Data Science 101: CRISP-DM - Data Mining / Data Science in 6 Steps
Data Professor
13 R Programming 101: How to Define Variables
R Programming 101: How to Define Variables
Data Professor
14 R Programming 101: Read and Write CSV files
R Programming 101: Read and Write CSV files
Data Professor
15 Data Science 101: Basic Command-Line for Data Science
Data Science 101: Basic Command-Line for Data Science
Data Professor
16 Strategies for Learning Data Science in 2020 (Data Science 101)
Strategies for Learning Data Science in 2020 (Data Science 101)
Data Professor
17 Building your Data Science Portfolio with GitHub (Data Science 101)
Building your Data Science Portfolio with GitHub (Data Science 101)
Data Professor
18 R Programming 101: Setting up R programming environment (R, RStudio and RStudio.cloud)
R Programming 101: Setting up R programming environment (R, RStudio and RStudio.cloud)
Data Professor
19 Exploratory Data Analysis in R: Towards Data Understanding
Exploratory Data Analysis in R: Towards Data Understanding
Data Professor
20 Exploratory Data Analysis in R: Quick Dive into Data Visualization
Exploratory Data Analysis in R: Quick Dive into Data Visualization
Data Professor
21 Machine Learning in R: Building a Classification Model
Machine Learning in R: Building a Classification Model
Data Professor
22 Machine Learning in R: Repurpose Machine Learning Code for New Data
Machine Learning in R: Repurpose Machine Learning Code for New Data
Data Professor
23 Data Science 101: Deploying your Machine Learning Model
Data Science 101: Deploying your Machine Learning Model
Data Professor
24 Machine Learning in R: Deploy Machine Learning Model using RDS
Machine Learning in R: Deploy Machine Learning Model using RDS
Data Professor
25 Data Pre-processing in R: Handling Missing Data
Data Pre-processing in R: Handling Missing Data
Data Professor
26 Machine Learning in R: Speed up Model Building with Parallel Computing
Machine Learning in R: Speed up Model Building with Parallel Computing
Data Professor
27 Data Science 101: Overview of Machine Learning Model Building Process
Data Science 101: Overview of Machine Learning Model Building Process
Data Professor
28 Web Apps in R: Building your First Web Application in R | Shiny Tutorial Ep 1
Web Apps in R: Building your First Web Application in R | Shiny Tutorial Ep 1
Data Professor
29 Web Apps in R: Build Interactive Histogram Web Application in R | Shiny Tutorial Ep 2
Web Apps in R: Build Interactive Histogram Web Application in R | Shiny Tutorial Ep 2
Data Professor
30 Web Apps in R: Building Data-Driven Web Application in R | Shiny Tutorial Ep 3
Web Apps in R: Building Data-Driven Web Application in R | Shiny Tutorial Ep 3
Data Professor
31 Web Apps in R: Building the Machine Learning Web Application in R | Shiny Tutorial Ep 4
Web Apps in R: Building the Machine Learning Web Application in R | Shiny Tutorial Ep 4
Data Professor
32 Web Apps in R: Build BMI Calculator web application in R for health monitoring | Shiny Tutorial Ep 5
Web Apps in R: Build BMI Calculator web application in R for health monitoring | Shiny Tutorial Ep 5
Data Professor
33 Machine Learning in R: Building a Linear Regression Model
Machine Learning in R: Building a Linear Regression Model
Data Professor
34 What programming language to learn for Data Science? R versus Python
What programming language to learn for Data Science? R versus Python
Data Professor
35 How to Become a Data Scientist (Learning Path and Skill Sets Needed)
How to Become a Data Scientist (Learning Path and Skill Sets Needed)
Data Professor
36 Using Python in R
Using Python in R
Data Professor
37 Interpretable Machine Learning Models
Interpretable Machine Learning Models
Data Professor
38 Making Scatter Plots in R [Data Visualisation in R series]
Making Scatter Plots in R [Data Visualisation in R series]
Data Professor
39 Machine Learning in Python: Building a Classification Model
Machine Learning in Python: Building a Classification Model
Data Professor
40 Compare Machine Learning Classifiers in Python
Compare Machine Learning Classifiers in Python
Data Professor
41 Hyperparameter Tuning of Machine Learning Model in Python
Hyperparameter Tuning of Machine Learning Model in Python
Data Professor
42 Practical Introduction to Google Colab for Data Science
Practical Introduction to Google Colab for Data Science
Data Professor
43 File Handling in Google Colab for Data Science
File Handling in Google Colab for Data Science
Data Professor
44 Pandas for Data Science: Create and Combine DataFrames / Rename Columns
Pandas for Data Science: Create and Combine DataFrames / Rename Columns
Data Professor
45 Machine Learning in Python: Building a Linear Regression Model
Machine Learning in Python: Building a Linear Regression Model
Data Professor
46 Machine Learning in Python: Principal Component Analysis (PCA) for Handling High-Dimensional Data
Machine Learning in Python: Principal Component Analysis (PCA) for Handling High-Dimensional Data
Data Professor
47 How to Plot an ROC Curve in Python | Machine Learning in Python
How to Plot an ROC Curve in Python | Machine Learning in Python
Data Professor
48 Installing conda on Google Colab for Data Science
Installing conda on Google Colab for Data Science
Data Professor
49 Use native R on Google Colab for Data Science
Use native R on Google Colab for Data Science
Data Professor
50 How to Save and Download files from Google Colab
How to Save and Download files from Google Colab
Data Professor
51 Easy Web Scraping in Python using Pandas for Data Science
Easy Web Scraping in Python using Pandas for Data Science
Data Professor
52 Data Science for Computational Drug Discovery using Python (Part 1)
Data Science for Computational Drug Discovery using Python (Part 1)
Data Professor
53 Pandas Profiling for Data Science (Quick and Easy Exploratory Data Analysis)
Pandas Profiling for Data Science (Quick and Easy Exploratory Data Analysis)
Data Professor
54 Exploratory Data Analysis in Python using pandas
Exploratory Data Analysis in Python using pandas
Data Professor
55 Quick tour of PyCaret (a low-code machine learning library in Python)
Quick tour of PyCaret (a low-code machine learning library in Python)
Data Professor
56 How to Upload Files to Google Colab
How to Upload Files to Google Colab
Data Professor
57 How to Install and Use Pandas Profiling on Google Colab
How to Install and Use Pandas Profiling on Google Colab
Data Professor
58 How to Adjust the Style of Pandas DataFrame
How to Adjust the Style of Pandas DataFrame
Data Professor
59 How to use Bamboolib for Data Wrangling in Data Science
How to use Bamboolib for Data Wrangling in Data Science
Data Professor
60 How to use Pandas Profiling on Kaggle
How to use Pandas Profiling on Kaggle
Data Professor

This video tutorial introduces data science and data mining concepts, and guides the viewer through the process of downloading and installing the WEKA data mining software to build a prediction model.

Key Takeaways
  1. Go to the University of Waikato website
  2. Download the WEKA software
  3. Select the correct version of WEKA for your computer
  4. Check if Java is installed on your computer
  5. Install WEKA with or without Java
  6. Launch WEKA and start building a data mining model
💡 Understanding the basics of data science and data mining is crucial for building effective prediction models, and WEKA is a useful software for this purpose.

Related Reads

📰
Is This Market Lying to Me? Part 1: The Problem and the Pipeline
Learn to build a real-time surveillance pipeline for prediction markets to detect potential lies, a crucial skill for data scientists and entrepreneurs
Medium · Python
📰
Share your experience
Hear from experienced programmers and learn from their journeys to improve your own learning path
Reddit r/learnprogramming
📰
Stop Guessing, Start Profiling: Mastering Edge AI Performance and Power on Android
Master edge AI performance and power on Android by profiling and optimizing your machine learning models, reducing guesswork and improving efficiency
Dev.to · Programming Central
📰
LeetCode Solution #1 — Two Sum
Solve the Two Sum problem on LeetCode using Python and improve your coding skills for interviews
Medium · Python
Up next
1. Overview of Artificial Intelligence | What is AI? Fundamental Concepts & Complete History of AI
Professor Rahul Jain
Watch →