3rd wave of data scientists

MLOps.community · Beginner ·💰 FinTech & AI for Finance Professionals ·6y ago

Key Takeaways

The video discusses the concept of the 3rd wave of data scientists, highlighting the evolution of data science from pioneers like Google and Facebook to the current need for full-stack data scientists who can extract business value from machine learning and data science.

Full Transcript

so pivoting back now into this hiring and what was going on there I I thought it was really interesting in your blog how you talked about the third wave of data scientists can you go into that a bit more yeah yeah I I think I'm in a position where I can answer this as well because I feel I've been part of like oh every kind of wave if you want to call it with that so I think this all starts to make sense after there around 2011 or something especially like okay we always had like the pioneers that you know Google Facebook Twitter whoever we were like earlier doctors in early like you know the the adopted machine learning first and were like on the forefront but I guess the rest of the world was a little bit using various proprietary kind of tools doing very similar work but it was kind of all locked in and and pretty like not really it was opaque kind of kind of world and I guess that's the first wave in a nutshell and then this big data like revolution happened and particularly where I think the second wave kicked in was juked and in other books so that was just a profound moment in the dots I the data science like the science like community and it just opened up like python and like data science to to the whole world it was like this this is this brilliant moment and everyone suddenly became I would say including myself everyone suddenly became like a data scientist and there was massive like excitement and massive hype of the like possibilities that machine learning and data science can bring to to enterprise the business and I think we were rolling in that kind of wave for maybe three or four years but in the end I think businesses didn't really know how to extract business value a lot of business value out of out of kind of you know data science joke the notebook data science as it was back then so you ended up with situations where yo you know it was just like shelfware powerpoints things that were created in a vacuum and were never put in production and I think there was there's a lot of that in in business during the second wave so it was kind of a high time but a time where business value wasn't optimally extracted and I think the lesson learned there was like the data scientists cannot just be people who know build you know right Jupiter notebooks in a vacuum they have to be involved in the end-to-end process of actually prototyping something and then ensuring there gets pushed into production and I think there's a recognition amongst like businesses nowadays that this is kind of you know this is kind of a given this is kind of a requirement nowadays for the data scientist or you know she letting engineer whatever kind of kind of jobs so yeah working in a vacuum in Jupiter notebooks is not an option I think anymore and the whole industry is coming to that realization and I think that's the third wave of like the full stack kind of data scientist

Original Description

What does the 3rd wave of data scientists mean and how does it relate to your company? London Based Fintech start-up TrueLayer decided to use Machine Learning instead of a rule-based system in mid-2019 and in our 7th meetup we spoke to their lead data scientist Alex Spanos about everything that entailed. During the meetup, we dove into how TrueLayer architected their MLOps pipeline for their Open Banking API: more specifically which tools they use and why, what prompted them to use machine learning, and how Alex sees the role of a Machine Learning Engineer. Alex has led the hiring process of Machine Learning Engineers and shared learnings on candidates and businesses alike. Alex is the Lead Data Scientist at TrueLayer, focussing on building Open Banking API products powered by data. Prior to TrueLayer, he built predictive models in Financial Services, used social data to predict the “next-big-thing” in Fast Moving Consumer Goods and introduced Machine Learning techniques in subsurface imaging. His academic background is in Applied Mathematics & Statistics. Check out his blog entries for more info: https://alexiospanos.com/hiring-machine-learning-engineers-part-1/ https://alexiospanos.com/hiring-machine-learning-engineers-part-2/ https://blog.truelayer.com/improving-the-classification-of-your-transaction-data-with-machine-learning-c36d811e4257 Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Alex on LinkedIn: https://www.linkedin.com/in/alexspanos/
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Playlist

Uploads from MLOps.community · MLOps.community · 51 of 60

1 Our 1st MLOps Meetup // Luke Marsden // MLOps Meetup #1
Our 1st MLOps Meetup // Luke Marsden // MLOps Meetup #1
MLOps.community
2 Remote Collaboration as a Data Scientist
Remote Collaboration as a Data Scientist
MLOps.community
3 MLOps Manifesto with Luke Marsden from Dotscience
MLOps Manifesto with Luke Marsden from Dotscience
MLOps.community
4 MLOps lifecycle description
MLOps lifecycle description
MLOps.community
5 What Does Best in Class AI/ML Governance Look Like in Fin Services? // Charles Radclyffe // MLOps #2
What Does Best in Class AI/ML Governance Look Like in Fin Services? // Charles Radclyffe // MLOps #2
MLOps.community
6 Life purpose and too many spreadsheets
Life purpose and too many spreadsheets
MLOps.community
7 Explainability, Black boxes and EU white paper on reproducibility
Explainability, Black boxes and EU white paper on reproducibility
MLOps.community
8 Hierarchy of Machine Learning Needs // Phil Winder // MLOps Meetup #3
Hierarchy of Machine Learning Needs // Phil Winder // MLOps Meetup #3
MLOps.community
9 Automatically Retrain Machine Learning Models? Are best practices worth it?
Automatically Retrain Machine Learning Models? Are best practices worth it?
MLOps.community
10 Building an MLOps Team? Key ideas to keep in mind
Building an MLOps Team? Key ideas to keep in mind
MLOps.community
11 Hierarchy of MLOps Needs
Hierarchy of MLOps Needs
MLOps.community
12 Bare necessities for getting an ML model into production
Bare necessities for getting an ML model into production
MLOps.community
13 MLOps and Monitoring
MLOps and Monitoring
MLOps.community
14 How Phil Winder got into Data Science and Software Engineering
How Phil Winder got into Data Science and Software Engineering
MLOps.community
15 Provenance and Reproducibility in Machine Learning; what is it and why you need it?
Provenance and Reproducibility in Machine Learning; what is it and why you need it?
MLOps.community
16 Friction Between Data Scientists and Software Engineers
Friction Between Data Scientists and Software Engineers
MLOps.community
17 MLOps Problems in different size companies
MLOps Problems in different size companies
MLOps.community
18 ML tooling in large companies
ML tooling in large companies
MLOps.community
19 ML Platforms - The build vs buy question
ML Platforms - The build vs buy question
MLOps.community
20 ML Services Gateway at SurveyMonkey
ML Services Gateway at SurveyMonkey
MLOps.community
21 Message buses, Async and sync architecture
Message buses, Async and sync architecture
MLOps.community
22 MLOps #4: Shubhi Jain - Building an ML Platform @SurveyMonkey
MLOps #4: Shubhi Jain - Building an ML Platform @SurveyMonkey
MLOps.community
23 Hybrid Data Science Teams @SurveyMonkey
Hybrid Data Science Teams @SurveyMonkey
MLOps.community
24 How do you handle ML version control at SurveyMonkey
How do you handle ML version control at SurveyMonkey
MLOps.community
25 Doing ML with Personal Information
Doing ML with Personal Information
MLOps.community
26 Evolution of the ML feature store @SurveyMonkey
Evolution of the ML feature store @SurveyMonkey
MLOps.community
27 Developing a Machine Learning Feature Store
Developing a Machine Learning Feature Store
MLOps.community
28 Auto retrain ML models is not the question
Auto retrain ML models is not the question
MLOps.community
29 3 key parts to Machine Learning monitoring
3 key parts to Machine Learning monitoring
MLOps.community
30 MLOps Meetup #6: Mid-Scale Production Feature Engineering with Dr. Venkata Pingali
MLOps Meetup #6: Mid-Scale Production Feature Engineering with Dr. Venkata Pingali
MLOps.community
31 MLOps meetup #5 High Stakes ML: Active Failures, Latent Factors with Flavio Clesio
MLOps meetup #5 High Stakes ML: Active Failures, Latent Factors with Flavio Clesio
MLOps.community
32 MLOps: Airflow Pros and Cons
MLOps: Airflow Pros and Cons
MLOps.community
33 Specific challenges in Machine Learning
Specific challenges in Machine Learning
MLOps.community
34 Current State Of Machine Learning
Current State Of Machine Learning
MLOps.community
35 Humans in the Loop are a defining factor in Machine Learning
Humans in the Loop are a defining factor in Machine Learning
MLOps.community
36 Learning from real life Machine Learning failures
Learning from real life Machine Learning failures
MLOps.community
37 Survivorship Bias in machine learning tutorials
Survivorship Bias in machine learning tutorials
MLOps.community
38 Swiss Cheese model in Machine Learning
Swiss Cheese model in Machine Learning
MLOps.community
39 Resume driven development in Machine learning & software engineering
Resume driven development in Machine learning & software engineering
MLOps.community
40 Who has the highest standards in ML?
Who has the highest standards in ML?
MLOps.community
41 Venkata Pingali of Scribble Data Thoughts on the Current State of Machine Learning
Venkata Pingali of Scribble Data Thoughts on the Current State of Machine Learning
MLOps.community
42 Dependable data and being able to Trust in your Data with Venkata Pengali of Scribble Data
Dependable data and being able to Trust in your Data with Venkata Pengali of Scribble Data
MLOps.community
43 Speed, Trust, Evolution and Scale in MLOps
Speed, Trust, Evolution and Scale in MLOps
MLOps.community
44 More difficult transition for data scientists to become ML engineers
More difficult transition for data scientists to become ML engineers
MLOps.community
45 How many models in prod til I need a dedicated ML platform?
How many models in prod til I need a dedicated ML platform?
MLOps.community
46 Deeper thinking from data scientists around platform blackholes
Deeper thinking from data scientists around platform blackholes
MLOps.community
47 Checkpointing, metadata, and confidence in your data
Checkpointing, metadata, and confidence in your data
MLOps.community
48 Adjacent usecases and multistep feature engineering
Adjacent usecases and multistep feature engineering
MLOps.community
49 Standardization of Machine Learning tools like in Software Engineering with Venkata Pingali
Standardization of Machine Learning tools like in Software Engineering with Venkata Pingali
MLOps.community
50 Reproducability flaws in end to end Machine Learning debugging
Reproducability flaws in end to end Machine Learning debugging
MLOps.community
3rd wave of data scientists
3rd wave of data scientists
MLOps.community
52 MLOps meetup #7 Alex Spanos // TrueLayer 's MLOps Pipeline
MLOps meetup #7 Alex Spanos // TrueLayer 's MLOps Pipeline
MLOps.community
53 MLOps Meetup #8 Optimizing Your ML Workflow with Kubeflow 1.0
MLOps Meetup #8 Optimizing Your ML Workflow with Kubeflow 1.0
MLOps.community
54 Are Kubeflow and Airflow complementary?
Are Kubeflow and Airflow complementary?
MLOps.community
55 Why Kubeflow gained so much traction=open community
Why Kubeflow gained so much traction=open community
MLOps.community
56 Who decides the dirrection of Kubeflow
Who decides the dirrection of Kubeflow
MLOps.community
57 What do Kubeflow and Arrikto do and how do they work together?
What do Kubeflow and Arrikto do and how do they work together?
MLOps.community
58 Versioning your ML steps with Kubeflow
Versioning your ML steps with Kubeflow
MLOps.community
59 Machine Learning Lifecycles//Perception vs Reality
Machine Learning Lifecycles//Perception vs Reality
MLOps.community
60 Kubeflow vs SageMaker in Machine Learning
Kubeflow vs SageMaker in Machine Learning
MLOps.community

The video discusses the 3rd wave of data scientists, emphasizing the need for full-stack data scientists who can build production-ready models and extract business value from machine learning and data science. This requires a shift from working in isolation to collaborating with businesses and ensuring that data science projects are production-ready.

Key Takeaways
  1. Understand the evolution of data science from pioneers to the current need for full-stack data scientists
  2. Recognize the limitations of working in isolation and the importance of collaboration
  3. Learn to build production-ready machine learning pipelines
  4. Extract business value from data science projects
  5. Communicate data insights effectively to stakeholders
💡 The 3rd wave of data scientists requires a full-stack approach, where data scientists are involved in the end-to-end process of building and deploying production-ready models, and extracting business value from data science projects.

Related AI Lessons

Build a Remittance App with the Afriex Business API
Learn to build a remittance app using the Afriex Business API for seamless cross-border transactions
Dev.to · Victory Lucky
Why FinTech Is Growing So Fast — And What the Next 10 Years Could Look Like
FinTech is growing rapidly, enabling fast and convenient financial transactions, and is expected to continue shaping the financial landscape over the next decade
Medium · Startup
$100,000 in Support. Four Weeks. One Goal — Turn African Fintech Founders Into Operators. Bankiffy Fintech Accelerator BFA 2026 Cohort 1 Is Now Open
Learn how the Bankiffy Fintech Accelerator supports African fintech founders in turning their ideas into operational businesses
Techpoint Africa
Nigeria’s fintechs built payments. Now they’re becoming banks.
Nigeria's fintechs are transitioning into banks by obtaining MFB licenses to diversify revenue streams beyond transaction fees
TechCabal
Up next
Ito ang Pinakamalaking WEALTH KILLER | At Ano ang Solusyon?
Chinkee Tan
Watch →