Data Frames in R - Subsetting a data frame
Key Takeaways
The video demonstrates how to subset a data frame in R using square brackets, the dollar sign, and other methods, with the Star Wars data set from the dplyr package as an example.
Full Transcript
so far we've talked about creating and naming data frames importing them and the tiny verse package now let's get more close and personal with them and learn about sub setting data frames we've already talked about some of these methods with mover covering matrices and lists so Volvo try to be super quick when glossing over familiar stuff first I will make use of the tiny version get a data set from the deployer package that's called Star Wars it starts as a table which as we know is just another take on the data frame there are benefits to using a table instead of a data frame but that usually works best for the more advanced our user alright I will save the data as a data frame and I will remove the last three columns because all of them contain lists and that will make exploring our data difficult excellent now that's a pretty large data frame and it isn't very efficient to print all of it when we do an operation so from now on whenever our data frame is hefty I will use the head function it is part of a set of functions that act like a sanity check and only show you the top six or the bottom six rows of your data like this this is a great way for keeping tabs on whatever you're doing without flooding your konso with data all right so subsetting in terms of notation all the usual suspects are here we can subset data frames like we would subset a matrix or like we would index a list to index a specific element we use square brackets row index comma column index so if I want to know where r2d2 is from I need to code this or I could use the column name instead of its number because it's easier to ask about all the data on Princess Leia Organa I must reference a row this happens by passing her row name in the square brackets and leaving a space for the column indicator remember alternatively if I want to see only the character names but nothing else I can call the first column leaving a space for the rows I will use the head function here because 87 Rose is too long or I can borrow from the lists of setting notation and use double brackets to Co the name column notice that in the first instance our returned data frame and in the second a vector why does that happen well remember how lists behaved when we were sub setting them in the previous section double brackets drilled down to the individual elements and extract them without keeping the larger structure the same things happens in their frames why because there are two dimensional lists excellent now lists could also be subset it using the dollar sign and that stands true in this situation as well and just like with lists we don't need to use quotation marks around column names again our returns a vector one last way to reference a column but get a data frame in return is by using single brackets and column name inside quotation marks like this okay and how about if I want to extract multiple columns for a subset of characters we can use the combined function to specify what exactly we're interested in easy and that returns a dataframe - this is sweet alright now we know how to subset a dataframe to operate on whatever part of it we need that is fantastic but what if we got some new observations and wanted to add them to the structure or if we isolated a new variable and wanted to attach it to the data frame you guessed it we will cover that in the next lesson thanks everyone see you in the next video and of course may the force be with you for more videos like this one please subscribe
Original Description
👉🏻 Download Our Free Data Science Career Guide: https://bit.ly/3kUHmIy
👉🏻 FREE MONTH! Get full access to our newly redesigned platform and all our courses (18th October - 18th November): https://bit.ly/3hfTtO8
How to subset a data frame using square brackets and the dollar sign.
We can subset data frames like we would subset a matrix, or like we would index a list.
To index a specific element in a data frame, we use square brackets, row index, coma, column index.
To reference a row, pass the row number in the square brackets and leave a space on the column indicator.
To reference a variable in a data frame, call the column number, leaving a space for the rows. Or use double brackets to call the column by name.
► Consider hitting the SUBSCRIBE button if you LIKE the content: https://www.youtube.com/c/365DataScience?sub_confirmation=1
► VISIT our website: https://bit.ly/365ds
🤝 Connect with us LinkedIn: https://www.linkedin.com/company/365datascience/
365 Data Science is an online educational career website that offers the incredible opportunity to find your way into the data science world no matter your previous knowledge and experience. We have prepared numerous courses that suit the needs of aspiring BI analysts, Data analysts and Data scientists.
We at 365 Data Science are committed educators who believe that curiosity should not be hindered by inability to access good learning resources. This is why we focus all our efforts on creating high-quality educational content which anyone can access online.
Check out our Data Science Career guides: https://www.youtube.com/playlist?list=PLaFfQroTgZnyQFq4nUfb-w2vEopN3ULMb
#Statistics #Datascience #RProgramming
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from 365 Data Science · 365 Data Science · 32 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
▶
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Population vs Sample
365 Data Science
Data Science & Statistics: Levels of measurement
365 Data Science
Statistics Tutorials: Mean, median and mode
365 Data Science
Skewness
365 Data Science
What is a distribution?
365 Data Science
The Normal Distribution
365 Data Science
Central limit theorem
365 Data Science
Student's T Distribution
365 Data Science
Type I error vs Type II error
365 Data Science
Hypothesis testing. Null vs alternative
365 Data Science
The linear regression model
365 Data Science
Simple linear regression model. Geometrical representation
365 Data Science
INDEX and MATCH application of the two functions separately and combined [Advanced Excel]
365 Data Science
INDIRECT Excel Function: How it works and when to use it [Advanced Excel]
365 Data Science
VLOOKUP and MATCH another useful functions combination [Advanced Excel]
365 Data Science
VLOOKUP COLUMN and ROW - Handle large data tables with ease [Advanced Excel]
365 Data Science
The ELIF keyword [Python Fundamentals]
365 Data Science
Working with Tuples in Python
365 Data Science
Database Terminology - A Beginners Guide
365 Data Science
Relational Database Essentials
365 Data Science
Database vs Spreadsheet - Advantages and Disadvantages
365 Data Science
Conditional Statements and Loops
365 Data Science
Backpropagation – The Math Behind Optimization
365 Data Science
Monte Carlo: Forecasting Stock Prices Part I
365 Data Science
Monte Carlo: Forecasting Stock Prices Part II
365 Data Science
Monte Carlo: Forecasting Stock Prices Part III
365 Data Science
365 Data Science Online Program
365 Data Science
Data frames - Creating a data frame
365 Data Science
Data Science & Statistics: Slicing a matrix in R
365 Data Science
Data frames in R - Exporting data in R
365 Data Science
Data frames in R - Transforming data PART II
365 Data Science
Data Frames in R - Subsetting a data frame
365 Data Science
Data Science & Statistics: Matrix arithmetic in R
365 Data Science
Data Science & Statistics: Indexing an element from a matrix
365 Data Science
Data Frames in R - Extending a data frame
365 Data Science
Data Science & Statistics: Creating a matrix in R FASTER
365 Data Science
Data Science & Statistics: Creating a Matrix in R
365 Data Science
Data frames - Importing data in R
365 Data Science
Data frames in R - Getting a sense of your data
365 Data Science
Data frames in R - Transforming data PART I
365 Data Science
Data frames in R - Import a CSV in R
365 Data Science
Data Science & Statistics: Matrix operations in R
365 Data Science
Data Science & Statistics: Matrix recycling in R
365 Data Science
Tableau vs Excel: When to use Tableau and when to use Excel
365 Data Science
Download Tableau: Learn how to download Tableau Public
365 Data Science
Connecting data sources: Useful tips when connecting data sources to Tableau
365 Data Science
The Tableau interface: See how to navigate through the Tableau interface
365 Data Science
Tableau data visualization: Create your first Tableau visualization!
365 Data Science
Duplicating sheets: This is how to duplicate a sheet in Tableau
365 Data Science
Build a table in Tableau: The steps needed to create a simple table in Tableau
365 Data Science
Custom fields in Tableau: Using Tableau operators to create custom fields
365 Data Science
Custom fields in Tableau: Add calculations to tables through custom fields
365 Data Science
Totals in Tableau: Learn how to display subtotals and totals in Tableau
365 Data Science
Gross Margin calculation in Tableau
365 Data Science
What is a filter in Tableau: Set up a filter in Tableau to specify the data you want to show
365 Data Science
Joins in Tableau: Inner, outer, left, or a right join in Tableau
365 Data Science
Building a Tableau dashboard: Three types of charts you want to have in a Tableau dashboard
365 Data Science
Creating great looking charts in Tableau: Real life Exercise on charts in Tableau
365 Data Science
Joins in Tableau: Choose the correct join type
365 Data Science
How to make a data check in Tableau: A quick data check is better than no data check
365 Data Science
More on: Data Literacy
View skill →Related AI Lessons
🎓
Tutor Explanation
DeepCamp AI