Five hard truths about building a career in Data Science

Harshit Tyagi · Intermediate ·🛡️ AI Safety & Ethics ·5y ago

Key Takeaways

The video discusses five hard truths about building a career in Data Science, including the need to become data literate, specialize in a few skills, and have domain knowledge. It highlights the importance of programming, creating a solid work portfolio, and understanding the intersection of data science and business problems.

Full Transcript

[Music] hello so a simple google search on how to build a career in data science will throw a long and exhaustive list of skills in your face you know from learning how to program in python then learning statistics mathematics and you know finding out how to make reports dashboards using tableau or excel or any other tool then being proficient in communications and yeah there are n number of lists out there that you need to master in order to become a good data scientist now to add to this chaos you have a very large spectrum of job profiles for example uh you know a fintech company might require a phd in data science or phd in finance or mathematics in order to build their algorithmic trading strategies whereas a product-based company might just need a student statistician who can run and design their a b testing experiments to check which features are working which features are liked by the users which features would work well so as a result of this whole list and you know this large landscape of data science and the skills that are required to master as parents who are still trying to figure out whether they want to build a career in data science it becomes very overwhelming for those now in this video i am going to break down these hard truths that you should be aware of before you get down to building a career out of data science so five hard truths that you should be aware of before you start making a career in data science so truth number one nobody can do everything you should look to become data literate and then narrow down there's not even a single data scientist or data analyst who knows it all who possesses all the skills that we have just mentioned so what you should do is you should look to become data literate which is basically you should acquire enough mathematical and statistical knowledge so that you can understand your data understand the problem understand the whole frame of your reference your context and everything and then you should look to specialize in two to three skills that's it data scientists who do good they specialize in two to three domains or and two to three skills on top of their basic data literacy which make them irreplaceable and what you should do in order to become data literate is basically expand your knowledge of available methods and techniques so if you do not know something if you do not know a technique how would you apply it first of all so that is expanding your knowledge second choose the right method so you should be able to assess which method or which technique you should apply and you should be able to assess that for your particular scenario or for solving the problem that you are trying to solve then lastly you should be able to apply those methods you should be able to you know configure and optimize those techniques so once you have chosen a particular technique you should first of all the data literacy would help you understand the problem and the method that you have chosen to apply and then you will be able to find out or you know you'll be able to figure out the interplay between the two truth number two the paradox of experience now a big or major challenge with getting a job in data science is that most companies are looking for experienced data scientists who can actually drive decision making you know and refine their business models and help them you know provide more values to their users so how do newbies break into data science so you must have come across those memes that talk about needing the experience in order to get experience that kind of holds true in this particular case so how do we break into this field so the only solution is that you need to work or you need to produce a solid work portfolio that can actually showcase that you can do the job uh one very important quote that i came across the other day was you know in order to get hired you should start doing the job before you get the job truth number four programming is inevitable you need to learn how to program in order to become a good data scientist now up to a certain extent you can use tools like excel or business intelligence tools like tableau or power bi for your analysis or even modeling for some cases but it doesn't provide you the flexibility of you know running multiple analysis multiple models with different data sets and reproducibility has been a major concern so now we are looking at tools that can actually help you you know version data sets as well as um ml experiments and this is where uh programming comes into play and becomes required becomes mandatory in order to you know learn or build complete machine learning pipelines truth number five no matter where you come from domain knowledge is key so the last company that i worked at they used to send research papers to their data science candidates and by doing that they used to check the candidates ability to wrap their head around biomedical problems data sets and the theory which included a lot of biology and chemistry so this exercise used to inform them about the ability of the candidate to make decisions and you know drive growth of the data science team now business understanding here or domain understanding is basically the intersection of your data science paradigms and the practicalities of real world problems and having a good understanding of the domain basically helps you you know with a number of things first of all you will learn how to define the right questions you know choose the right data sources uh measure the right metrics prioritize the right product line and in turn it helps in business uh it helps in growing the business faster so a new uh you know a big chunk of newcomers basically find it really hard to specialize in one field and that's kind of the hard truth so it goes back to point number one which is you know become data literate and then try to narrow down try to specialize and find out which particular domain you want to apply your skills in so those were the five hard truths and if you like the content then you should consider subscribing to my channel as well as subscribing to my newsletter if you want all of this content everything interesting that's happening over a week delivered right to your inbox then you should definitely look at my newsletter i am sharing authentic data science content every week it could be illustrated tutorials it could be interesting reads that i come across or it could be some announcement around workshop webinars or any interesting course that i am conducting or i came across so yeah with that we have come to the end of this video and yeah i'll catch you guys in the next [Music] one

Original Description

My newsletter: https://dswharshit.substack.com/ Connect with me: LinkedIn: https://www.linkedin.com/in/tyagiharshit/ Twitter: https://twitter.com/dswharshit Instagram: https://www.instagram.com/upgradewithharshit Medium: https://dswharshit.medium.com/
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from Harshit Tyagi · Harshit Tyagi · 36 of 60

1 Your PATH to learning Data Science
Your PATH to learning Data Science
Harshit Tyagi
2 Ideal Python environment setup for Data Science projects - Unix shell, Anaconda and Git.
Ideal Python environment setup for Data Science projects - Unix shell, Anaconda and Git.
Harshit Tyagi
3 Building COVID-19 interactive dashboard from Jupyter Notebook | No frontend/backend coding required.
Building COVID-19 interactive dashboard from Jupyter Notebook | No frontend/backend coding required.
Harshit Tyagi
4 Introduction to Jupyter Notebooks - Interface | Ipython Kernel | Sharing | GitHub
Introduction to Jupyter Notebooks - Interface | Ipython Kernel | Sharing | GitHub
Harshit Tyagi
5 Python fundamentals for Data Science - Part  1 | Data types | Strings | Lists
Python fundamentals for Data Science - Part 1 | Data types | Strings | Lists
Harshit Tyagi
6 Python fundamentals for Data Science - Part 2 Dictionaries | Conditionals | Loops | Functions
Python fundamentals for Data Science - Part 2 Dictionaries | Conditionals | Loops | Functions
Harshit Tyagi
7 Python fundamentals for Data Science - Part 3 OOPS | Working with External Libraries & Modules
Python fundamentals for Data Science - Part 3 OOPS | Working with External Libraries & Modules
Harshit Tyagi
8 NumPy Essentials for Data Science - part-1 | One Dimensional Array
NumPy Essentials for Data Science - part-1 | One Dimensional Array
Harshit Tyagi
9 NumPy Essentials for Data Science - part-2 | Multi-Dimensional Array
NumPy Essentials for Data Science - part-2 | Multi-Dimensional Array
Harshit Tyagi
10 Math For Data Science | Practical reasons to learn math for Machine/Deep Learning
Math For Data Science | Practical reasons to learn math for Machine/Deep Learning
Harshit Tyagi
11 Linear Algebra Ep 1 | Introduction to Vectors, Matrices and Tensors using NumPy
Linear Algebra Ep 1 | Introduction to Vectors, Matrices and Tensors using NumPy
Harshit Tyagi
12 Linear Algebra Ep 2 | Dot Product in Linear Algebra for Data Science
Linear Algebra Ep 2 | Dot Product in Linear Algebra for Data Science
Harshit Tyagi
13 Python vs R | The BEST programming language for your Data Science Project
Python vs R | The BEST programming language for your Data Science Project
Harshit Tyagi
14 Linear Algebra for Data Science Ep3 | Identity and Inverse Matrices | NumPy
Linear Algebra for Data Science Ep3 | Identity and Inverse Matrices | NumPy
Harshit Tyagi
15 The Data Show Ep1 | Elucidating Data Science in Drug Discovery - A CTO's Account
The Data Show Ep1 | Elucidating Data Science in Drug Discovery - A CTO's Account
Harshit Tyagi
16 Google Certified TensorFlow Developer | Learning Plan, Tips, FAQs & my Journey
Google Certified TensorFlow Developer | Learning Plan, Tips, FAQs & my Journey
Harshit Tyagi
17 Speeding up your Data Analysis | Hacks & Libraries
Speeding up your Data Analysis | Hacks & Libraries
Harshit Tyagi
18 How to build an Effective Data Science Portfolio
How to build an Effective Data Science Portfolio
Harshit Tyagi
19 End-to-End Machine Learning Project Tutorial - Part 1
End-to-End Machine Learning Project Tutorial - Part 1
Harshit Tyagi
20 Data Preparation with Sci-kit learn and Pandas | End-to-End ML Project Tutorial - Part 2
Data Preparation with Sci-kit learn and Pandas | End-to-End ML Project Tutorial - Part 2
Harshit Tyagi
21 Training and Fine-Tuning ML Models with Sklearn | End-to-End ML Project Tutorial - Part 3
Training and Fine-Tuning ML Models with Sklearn | End-to-End ML Project Tutorial - Part 3
Harshit Tyagi
22 Deploying a Trained ML model via Flask on Heroku | End-to-End ML Project Tutorial - Part 4
Deploying a Trained ML model via Flask on Heroku | End-to-End ML Project Tutorial - Part 4
Harshit Tyagi
23 Three Decades of Practising Data Science | Interview with Dean Abbott
Three Decades of Practising Data Science | Interview with Dean Abbott
Harshit Tyagi
24 Calculating Vector Norms - Linear Algebra for Data Science - IV
Calculating Vector Norms - Linear Algebra for Data Science - IV
Harshit Tyagi
25 Ep1 - Getting Started | Zero to Hero in Computer Vision with TensorFlow
Ep1 - Getting Started | Zero to Hero in Computer Vision with TensorFlow
Harshit Tyagi
26 Ep3 - Designing Data Experiments to enhance your Product | Rapido's Data Science Lead, Pramod N
Ep3 - Designing Data Experiments to enhance your Product | Rapido's Data Science Lead, Pramod N
Harshit Tyagi
27 Building projects with fastai - From Model Training to Deployment
Building projects with fastai - From Model Training to Deployment
Harshit Tyagi
28 October AI - Video Calling with One-Tenth of Internet Bandwidth
October AI - Video Calling with One-Tenth of Internet Bandwidth
Harshit Tyagi
29 November AI - Breakthrough in biology after 50 years | Datasets, books, research papers and more...
November AI - Breakthrough in biology after 50 years | Datasets, books, research papers and more...
Harshit Tyagi
30 Data Science learning roadmap for 2021
Data Science learning roadmap for 2021
Harshit Tyagi
31 Talk is cheap, BUILD - Microsoft Software Engineer | Interview with Abhirath Batra
Talk is cheap, BUILD - Microsoft Software Engineer | Interview with Abhirath Batra
Harshit Tyagi
32 Building a Habit of Reading Research Papers | Ft. Anurag Ghosh(Microsoft Researcher)
Building a Habit of Reading Research Papers | Ft. Anurag Ghosh(Microsoft Researcher)
Harshit Tyagi
33 Tableau vs Python - Building a COVID tracker dashboard
Tableau vs Python - Building a COVID tracker dashboard
Harshit Tyagi
34 [Explained] What is MLOps | Getting started with ML Engineering
[Explained] What is MLOps | Getting started with ML Engineering
Harshit Tyagi
35 Dmitry Petrov - Creator of DVC | ML Systems, Teams, Scaling challenges, and Learning Data Science
Dmitry Petrov - Creator of DVC | ML Systems, Teams, Scaling challenges, and Learning Data Science
Harshit Tyagi
Five hard truths about building a career in Data Science
Five hard truths about building a career in Data Science
Harshit Tyagi
37 Computing gradients using TensorFlow | Training a Linear Regression model from scratch.
Computing gradients using TensorFlow | Training a Linear Regression model from scratch.
Harshit Tyagi
38 Foundations for Data Science & ML - First steps for every beginner!
Foundations for Data Science & ML - First steps for every beginner!
Harshit Tyagi
39 Course Outline - Foundations for Data Science & ML
Course Outline - Foundations for Data Science & ML
Harshit Tyagi
40 How Machine Learning uses Linear Algebra to solve data problems
How Machine Learning uses Linear Algebra to solve data problems
Harshit Tyagi
41 Calculus for ML - How much you should know to get started
Calculus for ML - How much you should know to get started
Harshit Tyagi
42 Building a buzzing stocks news feed using NLP and Streamlit | Named Entity Recognition & Linking
Building a buzzing stocks news feed using NLP and Streamlit | Named Entity Recognition & Linking
Harshit Tyagi
43 AI Engineer - The next big tech role!
AI Engineer - The next big tech role!
Harshit Tyagi
44 AI researcher vs AI engineer | The next big tech role!
AI researcher vs AI engineer | The next big tech role!
Harshit Tyagi
45 Reviewing LLMs for content creation
Reviewing LLMs for content creation
Harshit Tyagi
46 Building a chatGPT-like bot on WhatsApp #coding  #chatgpt #engineering
Building a chatGPT-like bot on WhatsApp #coding #chatgpt #engineering
Harshit Tyagi
47 High Signal AI - the most action-oriented newsletter on the web! #ai
High Signal AI - the most action-oriented newsletter on the web! #ai
Harshit Tyagi
48 Building an AI-powered Discord Chatbot Locally for FREE using Ollama
Building an AI-powered Discord Chatbot Locally for FREE using Ollama
Harshit Tyagi
49 Build a second brain with Khoj 🧠  #ai #obsidian #plugins #productivity #engineering #notes
Build a second brain with Khoj 🧠 #ai #obsidian #plugins #productivity #engineering #notes
Harshit Tyagi
50 Summarising YouTube Videos using Ollama on Discord | Becoming an AI Engineer - Ep 2
Summarising YouTube Videos using Ollama on Discord | Becoming an AI Engineer - Ep 2
Harshit Tyagi
51 Watch the full video on my channel - Roadmap to become an AI Engineer.
Watch the full video on my channel - Roadmap to become an AI Engineer.
Harshit Tyagi
52 Mesop - Python-based UI framework from Google!
Mesop - Python-based UI framework from Google!
Harshit Tyagi
53 How I automated my YouTube | Gumloop tutorial | No Code
How I automated my YouTube | Gumloop tutorial | No Code
Harshit Tyagi
54 ARC PRIZE - Win $1Million to Beat the ARC-AGI benchmark
ARC PRIZE - Win $1Million to Beat the ARC-AGI benchmark
Harshit Tyagi
55 Microsoft's Autogen vs CrewAI - tested on a diverse range of use cases
Microsoft's Autogen vs CrewAI - tested on a diverse range of use cases
Harshit Tyagi
56 Claude #AI artifacts are just amazing!
Claude #AI artifacts are just amazing!
Harshit Tyagi
57 OpenAI releases CriticGPT to correct GPT-4's mistakes | Read the paper with me
OpenAI releases CriticGPT to correct GPT-4's mistakes | Read the paper with me
Harshit Tyagi
58 Day in my life | Vlog #1
Day in my life | Vlog #1
Harshit Tyagi
59 How to add AI Copilot to your application using CopilotKit | Tutorial
How to add AI Copilot to your application using CopilotKit | Tutorial
Harshit Tyagi
60 Quick Questions with an AI Founder - Anudeep Yegireddi
Quick Questions with an AI Founder - Anudeep Yegireddi
Harshit Tyagi

The video teaches viewers about the importance of becoming data literate, specializing in a few skills, and having domain knowledge to succeed in a Data Science career. It highlights the need to create a solid work portfolio and understand the intersection of data science and business problems.

Key Takeaways
  1. Become data literate by acquiring mathematical and statistical knowledge
  2. Specialize in two to three skills
  3. Create a solid work portfolio to showcase skills
  4. Learn programming to build machine learning pipelines
  5. Develop domain knowledge to understand business problems
💡 Domain knowledge is key to succeeding in Data Science, as it helps in defining the right questions, choosing the right data sources, and measuring the right metrics.

Related AI Lessons

AI Security Isn't a Product. It's an Engineering Discipline.
Learn why AI security requires a continuous engineering discipline rather than a one-time product implementation, and how to apply this mindset to your AI development workflow
Dev.to AI
Why Solving Legal AI's Context Problem Is Harder Than You Think
Solving legal AI's context problem requires understanding decision-making processes, not just having large models
Forbes Innovation
How Can We Truly Protect Information Privacy in the Age of Artificial Intelligence?
Learn how to prioritize information privacy in the age of AI and make it a competitive advantage
Medium · Machine Learning
The AI Validation Gap: The $2.5 Trillion Blind Spot In Enterprise AI
The AI validation gap poses a strategic risk to enterprises, costing $2.5 trillion, and requires immediate attention
Forbes Innovation
Up next
Containers Don't Make Your AI Agent Safe
Web Dev Simplified
Watch →