Basic Data Types | Introduction to Data Mining | Part 3
Key Takeaways
Introduction to basic data types in data mining, including Record, Ordered, and Graph data sets, and understanding their structures and applications.
Full Transcript
all right we can move on to data set classification so data sets are there are a lot of different types of data sets and they require different approaches to analysis the pre-processing steps the modeling steps pretty much everything that you do with these different types of data sets is going to be different the kinds of models you use the kinds of visualizations you construct the kind of cleaning that is proper for that kind of data um understanding the structure of your data at the beginning is very important to not wasting time and not producing incorrect results uh and it's in this step the understanding the type this the the structure of your data that things like domain knowledge tend to be very important um but there are still certainly categories that tend to be similar no matter what domain they're in so uh we'll talk talk about these three different kinds of types of data sets records graphs and ordered data sets uh in a little bit more detail coming up here so record data is data that consists of a collection of Records Each of which consists of a fixed set of attributes so this uh tax ID so this particular data set which uh I use in some in some in several places um is a record data we have every data object has one tax ID has a value of whether they asked for refund marital status uh whether they're single married or divorced a taxable income field and a and whether they cheated on their taxes or not so that's what sort of the structure of this data set so any data which consists of this kind of collection of Records which consist of a fix set of attributes you almost always represent this kind of data as a table um whether a database table or or a spreadsheet or something like that and it's the most common kind of data uh so a lot of people will if you talk about data or data sets this is what they visualize entirely is record data um so it's sort of your your your most common and sort of fundamental kind of data set So within record data there are a few useful subsets so this record data with the tax data has some categorical values and then one ordinal variable uh so tax ID is ordinal right or is it it's really more of a of a nominal variable when you think about it because ordering doesn't necessarily matter right sure it takes numbers but 10 is not meaningfully different from five there's no ordering implied here so tax ID is a nominal field a nominal categorical field uh tax refund is a categorical field marital status also taxable income is a continuous field so most data that you encounter has mixed data types like this you have some categorical some numeric uh and that's sort of your traditional type of record data if on the other hand your record data consists entirely of numeric attributes so this is entire L continuous uh entirely interval or ratio variables then we can think of it as a mathematical Matrix rather than just a table so we would have an M byn Matrix there are M rows one for each data object n columns one for each attribute and this is nice because we can think of these data objects as points in a multi-dimensional space where each attribute is represented along one dimension and that allows us to use a number of numeric techniques specifically involving distance that some algorithms not only make the make some algorithms easier but which some algorithms require there's a number of algorithms that require you to have data Matrix data all numeric data
Original Description
In this talk, we will introduce you to basic data types including Record, Ordered, and Graph and give you examples of when you would want to use each dataset.
Table of Contents:
0:00 Introduction
0:11 Types of dataset
1:10 Record data
3:32 Data matrix
--
At Data Science Dojo, we believe data science is for everyone. Our data science trainings have been attended by more than 10,000 employees from over 2,500 companies globally, including many leaders in tech like Microsoft, Google, and Facebook. For more information please visit: https://hubs.la/Q01Z-13k0
💼 Learn to build LLM-powered apps in just 40 hours with our Large Language Models bootcamp: https://hubs.la/Q01ZZGL-0
💼 Get started in the world of data with our top-rated data science bootcamp: https://hubs.la/Q01ZZDpt0
💼 Master Python for data science, analytics, machine learning, and data engineering: https://hubs.la/Q01ZZD-s0
💼 Explore, analyze, and visualize your data with Power BI desktop: https://hubs.la/Q01ZZF8B0
--
Unleash your data science potential for FREE! Dive into our tutorials, events & courses today!
📚 Learn the essentials of data science and analytics with our data science tutorials: https://hubs.la/Q01ZZJJK0
📚 Stay ahead of the curve with the latest data science content, subscribe to our newsletter now: https://hubs.la/Q01ZZBy10
📚 Connect with other data scientists and AI professionals at our community events: https://hubs.la/Q01ZZLd80
📚 Checkout our free data science courses: https://hubs.la/Q01ZZMcm0
📚 Get your daily dose of data science with our trending blogs: https://hubs.la/Q01ZZMWl0
--
📱 Social media links
Connect with us: https://www.linkedin.com/company/data-science-dojo
Follow us: https://twitter.com/DataScienceDojo
Keep up with us: https://www.instagram.com/data_science_dojo/
Like us: https://www.facebook.com/datasciencedojo
Find us: https://www.threads.net/@data_science_dojo
--
Also, join our communities:
LinkedIn: https://www.linkedin.com/gr
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Data Science Dojo · Data Science Dojo · 0 of 60
← Previous
Next →
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Feature Engineering and Predictive Modeling | Data Analytics with R and Azure ML | Community Webinar
Data Science Dojo
Data Exploration and Visualization | Beginning Azure ML | Part 3
Data Science Dojo
Reading External Data Sources | Beginning Azure ML | Part 2
Data Science Dojo
Importing Data, Accessing, & Creating a New Experiment | Beginning Azure ML | Part 1
Data Science Dojo
Casting Columns & Renaming Columns | Beginning Azure ML | Part 4
Data Science Dojo
Scrub Missing Values & Project Columns | Beginning Azure ML | Part 5
Data Science Dojo
Feature Engineering & R Script | Beginning Azure ML | Part 6
Data Science Dojo
Building Your First Model | Beginning Azure ML | Part 7
Data Science Dojo
Run and Fine-Tune Multiple Models | Beginning Azure ML | Part 8
Data Science Dojo
Deploying Your First Predictive Model As a Web Service | Beginning Azure ML | Part 9
Data Science Dojo
Using R API to Obtain Predictions From Your Web Service Beginning Azure ML | Part 10
Data Science Dojo
Using Python API to Obtain Predictions From Your Web Service | Beginning Azure ML | Part 11
Data Science Dojo
Twitter Sentiment Analysis | Natural Language Processing | Community Webinar
Data Science Dojo
Listening to the Melody of the Universe (LIGO Gravitational Waves Presentation) | Community Webinar
Data Science Dojo
David Wechsler on the Impact of Data Science Bootcamp
Data Science Dojo
Andrew Choi on the Impact of Data Science Bootcamp
Data Science Dojo
Microsoft's Software Engineer Shares Her Experience with Data Science Bootcamp
Data Science Dojo
Michael DAndrea on the Impact of Data Science Bootcamp
Data Science Dojo
Data Driven Decision-Making with Data Science Bootcamp: Artem Kopelev's Revelation
Data Science Dojo
Learn the Fundamentals of Data Science: Srinivas Rao's Experience with Data Science Bootcamp
Data Science Dojo
Re-Learning Data Science with Data Science Bootcamp: Analyst's Revelation
Data Science Dojo
Scale R to Big Data with Hadoop & Spark | Community Webinar
Data Science Dojo
Enhancing Skills with Data Science Bootcamp: Sharon Lane-Getaz's Revelation
Data Science Dojo
Ryan DeMartino on the Impact of Data Science Bootcamp
Data Science Dojo
Software Engineer at Microsoft Reveals About His Experience with Data Science Bootcamp
Data Science Dojo
Wade Wimer on the Impact of Data Science Bootcamp
Data Science Dojo
Analyzing Data with Data Science Bootcamp: Hannah Richta's Revelation
Data Science Dojo
Applying Data Science Skills to The Current Role with Bootcamp: Marcos Lacayo's Revelation
Data Science Dojo
Lance Milner on the Impact of Data Science Bootcamp
Data Science Dojo
Deloitte's Data Scientist Revelation: Learning Predictive Analytics with Data Science Bootcamp
Data Science Dojo
Rajesh Patil's Experience at Data Science Bootcamp As an Enterprise Architect
Data Science Dojo
Michael Atlin on the Impact of Data Science Bootcamp
Data Science Dojo
Amina Tariq's In-Person Experience at Data Science Bootcamp
Data Science Dojo
Ceo's Revelation about Data Science Bootcamp
Data Science Dojo
Stephen Miller Describes His Experience at Data Science Dojo's Bootcamp
Data Science Dojo
Kevin Hillaker on the Impact of Data Science Bootcamp
Data Science Dojo
Marko Topalovic's Experience with Data Science Bootcamp
Data Science Dojo
Text Analytics With Python, Cognitive Services & PowerBI | Data Analytics | Community Webinar
Data Science Dojo
Unisys Manager's Revelation: Visualizing Real Time Data with Data Science Bootcamp
Data Science Dojo
Learn Data Mining with Data Science Bootcamp: Ryan LaBrie's Revelation
Data Science Dojo
Vang Xiong on the Impact of Data Science Bootcamp
Data Science Dojo
Data Scientist's Experience at Our Data Science Bootcamp
Data Science Dojo
Alejandro Wolf Yadlin on the Impact of Data Science Bootcamp
Data Science Dojo
Introduction To Titanic Kaggle Competition | Part 1
Data Science Dojo
Learning How to Code in R with Data Science Bootcamp: Priscilla Mannuel's Revelation
Data Science Dojo
Andrew Berman On Why Data Science Bootcamp Is Better Fit for Him
Data Science Dojo
How To Do Titanic Kaggle Competition in R | Part 3.1
Data Science Dojo
How to do the Titanic Kaggle competition in R | Part 3.1
Data Science Dojo
Delve Deeper into Data Science with Data Science Bootcamp
Data Science Dojo
Bank of America Data Scientist Reveals His Experience of Data Science Bootcamp
Data Science Dojo
Shaena Montanari on the Impact of Data Science Bootcamp
Data Science Dojo
Types of Sampling | Introduction to Data Mining | Part 12
Data Science Dojo
Sampling for Data Selection | Introduction to Data Mining | Part 11
Data Science Dojo
Data Aggregation | Introduction to Data Mining | Part 10
Data Science Dojo
Data Cleaning | Introduction to Data Mining | Part 9
Data Science Dojo
Missing & Duplicated Data | Introduction to Data Mining | Part 8
Data Science Dojo
Data Noise | Introduction to Data Mining | Part 7
Data Science Dojo
Graph and Ordered Data | Introduction to Data Mining | Part 5
Data Science Dojo
Document Data & Transaction Data | Introduction to Data Mining | Part 4
Data Science Dojo
Data Quality | Introduction to Data Mining | Part 6
Data Science Dojo
More on: Data Literacy
View skill →Related Reads
📰
📰
📰
📰
Manual Tool Calling in LangGraph, with Pydantic Doing the Deciding
Medium · LLM
Local LLM vs Claude: Benchmarking qwen3-coder:30b as a Production Agent Backend
Dev.to AI
Designing a Production-Grade RAG System for PDF Question Answering
Medium · AI
10 Ways ChatGPT Can Save You 1 Hour Every Day
Medium · ChatGPT
Chapters (4)
Introduction
0:11
Types of dataset
1:10
Record data
3:32
Data matrix
🎓
Tutor Explanation
DeepCamp AI