No Priors Ep. 6 | With Daphne Koller from Insitro
Skills:
AI Ethics & Policy60%
Key Takeaways
Explores the intersection of AI and biotech with Daphne Koller, CEO and founder of Insitro, and discusses the potential for machine learning to revolutionize the life sciences
Full Transcript
[Music] Daphne welcome to the podcast thank you Sarah it's a pleasure to be here so as we were saying we won't ask you to walk through every part of your amazing life story but you came to biology as a computer science application years into your career what sparked you going down that route my initial interest in biology came from the technical side in the sense that the data sets this is way back when in the mid 90s the data sets that were available to machine learning research at the time were kind of boring and not very inspiring so things like classifying text into 20 different news groups and I found that there were more interesting data sets technically to be had on the biology side back then as we were starting for example to measure the activity of uh genes across the entire genome in multiple samples so initially it was really more more from a technological perspective but then I ended up actually having an interest in biology in its own right and ultimately ended up having a bifurcated Lab at Stanford where half my lab did core machine learning work published in traditional computer science venues and the other half did core biology work that was published in nature and cell and Science and what was really interesting is that most of my computer science colleagues had no idea that I did biology most of my life science colleagues had no idea I wasn't a computer science department so it was a bit of a bifurcated existence but it was a lot of fun amazing and we'll come back to you know full circle working on this problem now one more historical question for you uh you wrote the book on probabilistic graph models and I spent about a half an hour looking at my house for this book I have it somewhere but I wanted to have it for the podcast when I asked a mutual friend Andrew what I should ask you he suggested you know what motivated that work and um and how that field has changed so I think that um just like like in most Fields there is a swing of a pendulum a lot of uh the early work in probabilistic graphical models was hugely influential in bringing um artificial intelligence more into the world of machine learning and uh and working with numerical data rather than just symbolic Ai and then I think the Advent of um deep learning pushed that to the side a little bit because there was so much power that could be gained from basically the kind of pattern recognition from from raw inputs raw images text and so on without having to worry very much about interpretable representations what I think we're starting to see right now is a pendulum starting to swing back in the sense that there is a greater understanding that you really need a bit of both you need that hugely powerful pattern recognition that we get from Deep learning but you also need the ability to reason about things like causality and you also need some interpretability of your deep learning models so that you can potentially convey to a clinician why you made the decision that you did and so what we're ending up with as a really powerful Paradigm is some kind of synthesis of the ideas from both of these disciplines coming together let's focus on the problem you're working on now and for the last um how long has it been four four years or four and a half years before we even dive into that I had a really quick question just on the career side still because you know you went from um Stanford I believe that then going and co-founding Coursera with andrewing and then uh you went to Calico right after that right and I or you know a few years after that I'm sort of curious what made you decide to go into Calico because you mentioned your career was split between life sciences and Computer Sciences and so you went on the computer science online learning route and then you went back into biology so I'm a little bit curious what drove you back in so actually I'm going to go back and answer the earlier part of that which is what took me to Coursera in the first place because I think it feeds into what took me away um so the throughout much of my career at Stanford I had an increasing sense of urgency that I needed to make an impact in the world a real impact on real people not something that was at one step or two steps removed by training great students and having them go and do amazing things but by something that I get to experience myself and so when the work that I was doing at Stanford on technology assisted education gave rise to the launch of those first Stanford massive open online courses and we saw just how much impact those were having I felt like it was too amazing of an opportunity to pass up and just assume that if I didn't do this then somehow other people would take on the flag and Carry It Forward I felt like there was an incredible need to go and actually have that impact myself and make sure that it was done right and so that led to what my departure from Stanford on what was supposed to be a two-year leave of absence to go and found courser and I had the full intention to go back to Stanford at some later point and resume my faculty life um that didn't happen Stanford has a very strict leave of absence policy and when they came two years later and said it's already coming back and I responded that it wasn't really the right time I needed to see the project through for another year or so and they said that that was not an option I ended up doing this completely crazy thing which is resigning in endowed chair from Stanford and um staying at industry my mother thought I was nuts I think she still thinks I'm not but um I ended up staying at Coursera for a total of about five years and so five years was kind of a reasonable point to take a step back and and reflect and when I did that this was in early 2016 I realized that while I'd been deep in the trenches building Coursera the machine learning world had totally transformed because as a reminder I left Stanford for Corsair in late 2011 just before the machine learning Revolution really took off in 2012 and so I suddenly lifted my head looked around me and said wow machine learning really is transforming the world but not really having much of an impact in the Life Sciences um and so um I left Coursera in in good hands Coursera is a wonderful company but it's not really a deep technology company and certainly not a science company and decided that where I could have a really disproportionate impact wasn't bringing these these two disciplines together because there's just not a lot of people who had the benefit as I did of spending basically you know 20 years doing machine learning and maybe a decade doing uh doing biology and could really speak both languages and figure out how to synthesize them but since I'd been in industry for five years and away from science and even away from machine learning I didn't quite know where I wanted to go and what I wanted to do and so I turned for advice actually more than anything else to Art Levinson who is the former CEO of Genentech the former chairman of of Google and apple and I figured that if there was anyone who would know how to bring those two Fields together he was probably uniquely qualified to do that and so I reached out to him for advice because we'd run into each other in a number of different places I'd been on the thesis Committee of his son Jesse for example um and so I asked him for advice and he was very um I think admittedly self-serving in his advice he said you should come to college though and yeah and honestly I didn't know much about what Calico did other than it worked on Aging which seemed like a really important problem to think about but I did know that it's not many times that one has the opportunity to work with the luminary like art Levinson and also by that point met Hal Baron who's another person I have tremendous respect for and I figured this was you know a really interesting way to spend some time and and learn from these wonderful people and so I did that um and so and I learned a ton during my time at Calico was only 18 months because ultimately I realized that I didn't want to be at a company that focused on a particular biology but really build a platform for doing drug Discovery differently addressing some of the points that you Sarah made in your introduction about how drug Discovery is this incredibly fraught um largely unsuccessful and very expensive Endeavor and so how could I uh how could I make that happen differently and it didn't seem like was necessarily the right place to take on a what was a platform company built and so that's why I left and founded in citro were there any specific insights from Calico that drove the founder of NC Trail was it just more the exposure to biopharmaceuticals and how things are developed that really drove your thinking that maybe ML and AI would have a real application area there I think that it was really the exposure for the first time to how to how Pharmaceuticals were developed as you said at uh Stanford had worked a lot at the intersection of machine learning data science and biology and realized just uh you know how much power these machine learning Technologies can have when applied even to small data sets and certainly as the technology had evolved tremendously since then data sets were becoming considerably larger and Richer there was an even larger opportunity to make a huge difference and so that's what led my move back into that intersection and then therefore the Calico but um but I think it was really the realization that I I guess twofold one is that uh the the way in which you turn the insights into therapeutic interventions was so old-fashioned and so um un um unaccommodating of the use of data that I felt there had to be a better way to do this which I think that the industry has since started to demonstrate across the board um in many different companies uh and I think the other thing that made me make that shift is that whereas data is uh in life science is growing tremendously data in aging and specifically human aging is really hard to get because human aging is a very long process and um and in order to get data on um the longitudinal trajectory of human aging today will need it to start collecting data you know 20 30 years ago and the cohorts are rather small and so I felt like there was a huge opportunity in this intersection but maybe aging wasn't the first place where one could most beneficially apply it from at least my perspective yeah when you look across direct development because I guess right now it costs a billion a billion and a half dollars to develop a drug successfully it takes a decade plus to actually get there and then a lot of the um when I look at the potential uh areas that are challenging in the industry the sort of the initial small molecule selection and design or alternatively the pathway or cell type that you're using um separate from that there's the clinical trial itself and how do you figure out who to enroll and how to deal with the data and the patients and everything else there's all the calibration around Diagnostics and endpoints and clinical endpoints and how you think and all those places seem like there could be real uses of AI how did you choose what encito is actually going to do given how much room there actually is to innovate in this area relative to data to your point I mean it's just it's shocking how Little's done right it's like awful I completely agree and um and yeah in some sense the wealth of opportunities here is one of the biggest challenges because everywhere you look there is a big opportunity for machine learning to be deployed in a potentially quite significant way sometimes I have these discussions with the increasingly fewer number of people within biopharma who think that yeah this machine learning thing is a fad that will go away or maybe that machine learning is going to be this thing that helps you in a particular Point um area like you know x-ray crystallography it can improve this narrow little vertical but that's pretty much what it's going to do and my analogy is that it's not like x-ray crystallography it's like computers you're going to use it everywhere and it's going to be transformative everywhere if it's not going to be the Silver Bullet unless you figure out how to use it most effectively but but the opportunities are pretty much endless across the entire process from beginning to end so with that how did we pick what um what we end up working on you know I thought about this and you could divide the the process as many do into three large chunks one is the original biology discovery which is what targets do we employ in what indications and maybe in what patient population is kind of the first chunk then there's turning those targets into therapeutic matter which is a molecular design process and then at the end there's the enablement of the clinical trials in terms of actually actualizing patient selection or biomarkers for efficacy and things like that and all of those are important and all of those are valuable but if you look at the actual numbers of what makes drug Discovery so expensive it is the fact that 95 of drug programs fail they just do not succeed and the biggest reason why they don't succeed is not because the clinical trial was poorly designed that still happens but it's not the biggest reason nor is it because the molecule doesn't hit its Target and modulated in the right way that too happens but again it's it's an increasingly smaller number of situations because Pharma companies have gotten better and better at making uh therapeutic matter it's a place where most programs fail is because um and because we're just not modulating the right thing it's the wrong Target in the wrong indication or the wrong patient population so if you really want to bring down that two and a half billion dollar number what you have to do is to bring down this completely mind-blowing uh statistic of 95 percent of drug programs fail into something that is much more manageable so that a successful program doesn't have to carry on its back all of the many failures um expensive failures of all the things that didn't quite make it and so I've figured that it was maybe the hardest thing to do but also the thing is going to be the most impactful so how do you approach that problem as a computer science and now computer science and biology person the target identification problem yeah you know it's really hard right because when you think about it it's the one area where you really don't have the right type of training data at least not obviously because the question you're asking yourself is if I make this therapeutic intervention in this patient what is it going to do clinically and that is the thing about which you don't have data until the very end of the process which is called a clinical trial and so how do you train a machine learning model that doesn't have training data to train it right and so the uh the direction that we've chosen to take is actually a two-pronged approach and it's the synthesis of the two that we think is particularly powerful uh we bring in we bring in data from two quite different sources one is data from Human individuals where we don't get to do experiments but we have experiments of nature each of us is an experiment of nature where Nature has modulated our genetics into um you know different types of activity levels or um of individual genes where some of them behave this way and others behave that way and we can look at that mapping from genotype to phenotype as a surrogate of what a therapeutic intervention would do in those humans so that's great but it limits you to those experiments of Nature and the experiments of nature are not necessarily um the same as what a therapeutic intervention would do and so what we've uh done in parallel is to create our own data in our own wet lab where we make interventions in Cellular Systems and measure the phenotypic consequences there again um using very large scale data with very high content modalities and so the machine learning is actually used I would say in three different ways one is to interrogate um the phenotypic consequences of genetic variation in a human looking at very high content data like Imaging where we know machine learning works really well like different types of ohmic modalities transcriptomics proteomics and so on to really understand that mapping between genetics and phenotype we similarly look at the mapping between genetic interventions which in this case we get to actually direct ourselves by doing genome editing of cells and say what is the phenotypic consequences of modulating this Gene in this cell background and reading out a large high content amount of high content data to really understand how cell States responds to these interventions and so the machine learning is used on each of those two separately and then also to bring them together so that you can kind of think about building cellular models that are predictive of human clinical outcomes which is ultimately what we're looking to do is to replace the sort of untranslatable animal models with something that is much more driven from human biology when you think about again just like the focusing of incitro what domains do you decide to work in first because this approach should be quite horizontal of of course then you have you know complexity of what that cellular model can be it for sure is and again uh focusing has always been a challenge in the sense that there's so many opportunities and how do we say no to some of them so we've uh what we've done is tried to go in areas where we think there is both a large unmet need in the sense that the current tools that we're deploying are just not very effective and at the same time where we think that the technologies that we are developing internally uh provide us with the unique differentiated Advantage so one of those areas has been um in Neuroscience because as we know the unmet need there is humongous there are so very few effective therapeutic interventions neuroscience and that's partly because the model systems that we've been using specifically animal models while one can quibble about in which other therapeutic errors they are more or less relevant in Neuroscience it is very clear that they're probably not and that's one of the reasons why things work so well in whatever curing mice of schizophrenia whatever the heck that means and then not having much of an impact in human schizophrenia because it's not really even the same disease right um the other so that's on the unmet Need side and on the opportunity side we know that induced pluripotent stem cells are actually um relatively easily um differentiated into neurons you can actually see cellular phenotypes that are quite disease relevant in those neurons can I stop you for a second just because we have mostly a like computer science not biology audience and can you just explain like how you get a daphnia and a lot of neuron at all okay so um in order to get a Daphne neuron in the lab you take either a white blood cell for me or a skin cell for me and you go through a process of what's called reprogramming which received a technology which received the Nobel Prize number of years ago which allows you to turn it into what is basically a stem cell which means a cell that can then be uh that can then take any lineage it doesn't have to form a skin cell which is where it came from it can form a liver cell or a heart cell or or brain cell and so and then with that um stem cell which is why it's called an induced because you force it to be pluripotent which means it can go on any different direction stem cell or it's called ipsc and you can depending on what you do to it it can now be transformed as I said into a neuron or uh or a cardiomyocyte which is a hard cell and so on and so forth and so you can effectively get the effect of our genetics in these Cellular Systems and similarly uh you can make an even more pointed change by editing those cells and say if there is a genetic variant that we know causes a particular disease or significantly increases the chances of such a disease we can introduce that into different genetic backgrounds and then do a sort of almost like an uh in vitro case control which is same cell with and without the genetic variant what are the differences and that very carefully positioned protect people is like an A B test um uh this this in vitro a B test is something that allows us to really get at those differences that are specifically associated with the disease-causing variant so so that is one aspect of the capability that drove us towards our um therapeutic areas the other is as I said we have a two-prong strategy one is the data that we produce in the lab and one is data that we collect from humans so we also looked for areas in which um the data from humans is relatively readily available and in Neuroscience we have an increasing number of brain MRIs I think there will be even more now with the approval of some of the earliest Alzheimer's drugs because it's going to be part of the process by which uh people are either selected to receive the drug or not depending on whether their brain MRI shows certain um certain aspects of disease the other areas that we've gone into are metabolism and oncology because again those are areas where relevant disease relevant data that is high content that is unbiased and truly informative about the disease state is collected quite abundantly as part of the standard of care and so those are again we tried to look for areas where there is large unmet need and where the two types of capabilities that we bring to bear can be deployed I was going to ask if you think about um something like uh you know neurodegenerative diseases Alzheimer's Etc like you know is it single cell who can who can say but feels feels unlikely uh what what's beyond single cell and do you guys do organoid research like what is that within the scope of incitro yeah no that's a great question so um a lot of complex diseases don't are not encompassed within a single cell lineage however I think even there one can study in many cases not always the disease state by looking at a cell type that is clearly relevant to the season perhaps pushing it out of its comfort zone so for example in some of the work that we've done in metabolic disease I mean it's clear that how to sites are not the BL and end-all of what it takes to make a diseased liver but you can push the hepatocyte out of its Comfort Zone by putting in the right combination of you know fatty acids and maybe various immune system factors or whatever to create a disease state that is much more um similar to what you see in its uh in its natural environment um that having been said it's clearly the case that we're not going to be able to recapitulate the entire complexity of a disease state for a lot of those diseases and so one of the things that we do and this is in the spirit of being pragmatic and and prioritizing there's plenty of things that we can do today where the um where the disease does manifest sufficiently in a single cell lineage and so we go after those first and we defer some of the other ones to a later stage because Technologies such as organoids for example that Encompass multiple cell types in a single you know little micro brain or microwave or whatever or sometimes these things called organs on chips which allow you to actually create things that are more than just even a single organ they start to create sort of the flow between different organ systems those are technologies that other people are currently developing they're getting better by the day and so we feel like there's a lot of value that we can bring with the capabilities that are out there even if we know the reductionist even if we know they don't fully capture the disease um but they capture enough diseases so that we can bring medicines to patients and maybe in three years we'll have another tranche of diseases that are unlocked by the technological tidal wave that we're all writing you mentioned there are sort of two areas of um exploration uh foreign Citra right now one was um metabolic disease and cancer I guess that's really three areas and the second is um neurological areas I was just sort of curious how far you want to take these in terms of the actual development of drugs in-house versus partnering out and then I noticed you had things like relationships with BMS and others for ALS and dementia and a few other areas so a little bit curious about how far you actually want to take the development of drugs yourself versus partnering with others and how you think about that in the context of building a company and culture that's a a great question and the answer is that we are going to be relatively pragmatic about this as well and and um do what makes sense in terms of maximizing the impact that we have on patients so one of the things that we have going for us I think over a lot of other companies is that what we've built is a as an engine for generating novel insights novel targets so it's not um the situation a lot of companies are which is you have one program two programs and if you kind of sell those off then you're left with an empty cupboard and then what do you do you're not a company anymore so what we think of is because we have this engine we have the opportunity to have some of those programs be done in partnership with others some of those perhaps even be entirely out licensed to others while the engine continues to give us additional insights maybe even better insights as we expand for example into new indications using new technologies on the other hand to think about it from the complementary perspective some of the targets that we uh that we find ourselves having emerged from our platform are ones around which there's already a drug available because you know there's only 20 000 genes and so sometimes someone may have developed a drug just didn't deploy it in the right indication or didn't deploy it in the right patient population and we don't believe that the only thing that makes our existence worthwhile is if we come up with a new chemical matter towards those targets so we might go to the asset owner and say hey let's work together to bring that asset to Patient faster and and that can usually shave off you know two three maybe even five years from the development of a program because you've already made the drugs sometimes you've already put it in people you've shown that it's safe you have a good biomarker for when it's working and when it's not all those things that can really slow down a program if you're starting from absolutely um square one and a brand new Target and so we we hope to be very pragmatic in terms of what we develop in-house and what we develop with others with the goal of really trying to maximize the impact that the platform can bring to as many patients as possible how much work if any are you doing on the biomarker side because I think one of the points that you just raised is really interesting when I look at a lot of clinical drug development a lot of it is waiting for clinical endpoints that may take months or years to really substantiate and so sometimes the FDA or others will be willing to accept certain clinical biomarkers as sort of intermediary steps or things that tend to vary relative to the trade or the outcome are you doing biomarker development as well because that seems like such a great area for the applications of the ML and yet it seems like there's so little work in terms of actually translating ml into the real world for biomarkers in particular I think there's there's research research owes that that drugs that have a biomarker are about twice as likely to be successful in the clinic is ones that do not by the way there's also data that show that drugs that have support in human genetics are twice as likely to succeed as ones that do not and so we are deep Believers in both of those and um I think that because our focus is so much on human data a lot of the insights that come out of analysis of human clinical data does actually give you a biomarker for which patients are likely to benefit from a particular therapeutic intervention and so in some ways you can think of biomarker clinical biomarkers as coming out almost for free if you will not for free but sort of as a consequence of the work that we're doing anyway as long as we pay attention and don't just say as a lot of a lot of companies do that oh we found the target we're just going to go and apply it in all comers because honestly that is one of the big things that causes drugs to fail is that you are trying to apply it more broadly if I'm being cynical sometimes so as to maximize the revenues that you can get from a drug versus trying to figure out exactly in which patients it's going to work and one of the things you asked earlier a lot which what did I learn at Calico and one of the things that I learned there there were a lot of former genetic people there as one would expect given the pedigree of the company um some one of them told me that if uh one of the earliest Precision oncology drugs was herceptin and that goes after her two positive breast cancer patients that if they had tried to run the herceptin clinical trial in an all-commer breast cancer population you would have needed a population of ten thousand in the clinical trial which is a very large clinical trial and even then you might not have seen a sufficiently strong statistical significant statistically significant signal because the adverse side effects in every drug has adverse side effects in the non-responders may have outweighed the benefits in with the very strong benefits responders so the fact that they had the right patient population in the clinical development of herceptin was absolutely critical to creating a successful and um and you know uh reasonably sized clinical trial and so I think that that is a a pattern that many more people in the drug development industry should be following and frankly a lot of them have started to see the benefits of this so we're not the only ones going there but I do think to your point a lot that we have a differentiated technology staff that will hopefully allow us to get even better more accurate biomarkers via machine learning on high content data Yeah you mentioned two really key points I feel to Expediting Direct Delivery there's a biomarker part and then there's finding the right patients relative to the drug and I think that that actually also is very famous for the hrd drugs where there's a specific set of Pathways that if you didn't actually select out the patients with specific mutations the drugs didn't work and the second you focus on that population worked extremely well right and so there's lots of examples of that where you just have to figure out who you're actually targeting there's a really great interview from a couple years ago with Jansen who started Jansen Pharmaceuticals where he talked about how he felt that a lot of drug regulation and the length of time it takes to develop drugs was driven by almost an overly safetiest view of the world like there wasn't a strong series of cost-benefit trade-offs or willingness to sub segment patient populations or really look at data in a rich way and we've seen recently with things like covid that we can really expedite both drug development vaccine development everything right we did things in six months that normally would take 10 years during covid because we decided we could do it how much time do you think an ml first company or ml first approach can really cut out of drug development or do you think it's purely a regulatory issue in terms of those timelines I think is that's a complicated question and I think has elements of both I think first there does need to be a discussion with The Regulators around uh what what might be feasible um from from a regulatory approval perspective about different kinds of biomarkers there's also elements that I think are very legitimate questions like how do you collect the relevant biomarker in a robust reproducible way from different patients what kind of lab protocols one would need in order to have that be collected robustly that's not always trivial you can have the most beautiful sophisticated biomarker that works in a very carefully designed research environment and it's not going to work in the wild as part of the standard of cure so I think the regulator does have legitimate questions that need to be answered there um so uh and so but I do think that with that um with that discussion and especially if you can front load that and have the discussion with the regulator is not at the very end when you show up with your whatever NDA package but in an earlier State saying okay what would it take in order to make this uh reasonable from your perspective what questions would you like to see answered I think there's a legitimate opportunity to actually accelerate things having said that I think one needs to be realistic about what is and is not feasible in covid we were in the fortunate or unfortunate position um that there's there were a lot of patience with covid um it was rampant and so you were able to get a lot of uh you're you were able to fill your clinical trials relatively quickly and the disease progression was relatively fast if you're doing an Alzheimer's trial the disease progression is what it is and you need to wait long enough to see a delta in the cognition curve in order to convince yourself that there is in fact a difference that your drug is making a difference now I think there is an opportunity to try and create proxy biomarkers amyloid beta is a um as an example of that there's been questions about is it the right proxy um for cognition or not my guess would be that it is for some patients and probably not others but um so it's a mixed bag to our earlier point about heterogeneity and finding the right patient population but I think that is a thing that we need to gain conviction around over time and so ultimately there's only so much that you can speed up biology in certain cases because biology takes as long as it as long as it takes yeah it's it's interesting because um I feel like that's a mindset that those of us have worked on both computer science and biology have to learn right you are so used to just being able to manipulate some data in the cloud and then you get an answer versus waiting for years for a readout or to make progress when you think about how you built out the team at incitro and how you built out the culture how did you think about having each side learn about the different aspects that each side provides and in general how did you think about the culture of a company that could Bridge both things you know it's really hard and I think building the right culture is one of the most challenging things that we had to do at in citro and at the same time I think a big competitive Advantage because doing it is really not very easy you have to bring in people who truly have both a learning mindset on their own in terms of being interested enough to learn about something that for many is a totally different um set of Concepts and and even ways of thinking about the world so you need computer scientists who are willing to learn about this fuzzy ill-behaved field of biology where things don't do what they're supposed to do you know when you program a computer yeah you can have bugs but ultimately assuming you did the right things the same thing will happen and that's not true in biology we just don't know that much and these things are living beings so they don't respond in the same way even day after day and so there's just it's really hard and then conversely you have the scientist mindset the that sometimes they get frustrated with the okay we can take those building blocks and put them together and this is what will happen and and science is not like that and so you have to create a bridge between the different cultures the different jargons the different mindsets um and really both get people who are willing to learn about the other discipline but also just engage in meaningful ways with people who are different to themselves what did that mean when you said science is not just not like that in terms of uh manipulating building blocks so it comes back to what I said earlier just a function call there are so many variables that have a huge effect on on the system that sometimes we only are only vaguely appreciate sometimes we don't appreciate it all a colleague told me an anecdote about um a uh an experiment where some days it went perfectly well and then the other days the cells just died and they tried to figure out what was going on it turns out that they the cells had died were the days when there was a particular technician who had really had a fondness for onion sandwiches and and so it turns out that the onion in on his breath actually um ended up you know making the cells be less happy and so you just don't even think about these things if you're an engineer right um the the other really interesting mindset difference between between how scientists and how Engineers approach the world is when you show an engineer or computer scientist a bunch of dots usually the natural inclination is to try and find the pattern the thing that explains as many of the points as you can because that is the thing around which you will engineer your system if you're a scientist oftentimes what you look for are the outliers the exceptions because those exceptions are often the um beginnings of scientific discovery because they're the beginning of a threat it's like why did this one behave differently from everybody else and that gives rise to a new discovery so again it's just the mindsets are just so different was there anything you did from a process perspective to help bridge these things so for example I remember a color we tried that often um embed a bioinformatician with a team of uh systems engineers and they'd learn off of each other but then everybody on the team you know it could be a variant scientist it could be somebody else would participate in a scrum which was a concept that they weren't used to right on the biology side for example it was more of a way to set that everybody does things on weekly cadences and it's you don't just do long-term planning you also do way more short-term planning than you're normally would in a lab or you know there's different approaches to almost try and Bridge those divides were there any things that you specifically did along those lines or were there other approaches that you took from a tangible perspective well so first of all we do bring in people with their different mindsets and and we try and create um sort of bridges between them so we have product managers who do scrums and do uh you know these these agile planning processes and we apply that also to our platform development even on the biology side but at the same time you know drug Discovery projects which are years long you don't do scrums you you know there is a timeline and when you have a whatever uh a 45-day differentiation for your IPS cells it takes 45 days and there's no point to doing an agile scrum in the middle you just need to wait for the cells to do their thing and so we have project managers we have product managers and we make sure they communicate with each other but they have but they each deploy their discipline in their own way but to your question about one of the things that we did um a lot of it comes down to really being deliberate about culture and values and so one of the things that we did at the very beginning of the company is we laid out a set of Behavioral Norms which you know you can think of as values and the one that um is I think among my favorites maybe my favorite is actually the last one they're ordered not an order of importance but from what we do to how we do it which is that we engage with each other openly constructively and with respect each of the words matter um engagement means we don't Silo ourselves and just sit with our tribe we really have an engagement with others openly being open to asking naive questions um and at the same time being open to naive suggestions from someone from a discipline other than yourself because sometimes the question of why don't we do things this way is actually a really good idea when you don't come in with a preconceived notion of oh because that's how we've always done it um constructively means that when you make these suggestions it has to be with the goal of making the outcome better rather than being the smartest person in the room which is a big problem in companies we have a lot of smart people and the respect is really the respect for what everyone brings to the table and I think that's really important because there's a lot of um and please forgive me a lot but a lot of tech people who come in to um life sciences and it's like we have that sell verbala we are the smartest we're machine learning we're gonna solve everything and they don't respect the challenges of the other discipline they sometimes they don't even take the time to learn what the challenges of the other discipline are and that creates immediate um Hackle raising on the other side and you know from there the conversation can only get worse so I think it's really important to have that respect for both for all sides we have a lot of tech people uh Engineers Founders researchers as listeners what would you be working on if you weren't working on in sitro like what else are you paying attention to in digital bio or AI assuming people are uh attuned to having that culture of openness and respect and constructive thinking so um I think that's a great question and this really is the Golden Age of um Ai and machine learning and there's just so many different ways in which that can be deployed in useful ways I mean my personal Compass has always been that we should be deploying this towards areas where we make life better for people so I've tried to Veer towards applications that are really about improving life improving Health versus you know selling more ads or whatever not that you know I mean I guess selling ads is good too but um but for me it's really about how do we make life better um so I think there is a lot of really exciting opportunities right now I think that intersection or that interface if you will between biology and technology is one of the richest areas that exist today because each of these field has been making a huge amount of progress in its own right we all hear about you know AI much more in the news because of Chad GPT and so on and it's something that everyone can really relate to and understand but the toolkit that biologists have available to them with crispr and pluripotent stem cells and the huge advances in microscopy and such are maybe not quite as visible to the everyday person but they are equally dramatic I think in terms of what they unlock and so bringing those two together creates so many opportunities for change in uh not just in drug discovery which is where I happen to pick my own trajectory but in agriculture technology in environmental technology in energy in biomaterials maybe materials that are much less destructive to the environment and and such with better properties in food Tech um I think there is just a tremendous wealth of directions that one can take those um those fields and bring them together in interesting ways having said that I think there's other are really beneficial societal directions that one can deploy this I think we're only starting to see the applications of machine learning and AI to say energy other than things like biofuels because the data just haven't been as readily available but I'm sure that will change similarly I think going back to my Coursera days and even my Stanford days the benefits of machine learning in education and really personalizing learning experiences to individual Learners maybe having a more beneficial experience than just letting chat GPT write their essays for them I think there is a lot of opportunities to really deepen and enhance learning experiences for for students so I think there's almost unlimited things that one could do one just needs to be committed to finding them versus falling into the sort of comfortable place of going to one of the tech Giants and just doing something that earns you a lot of money which is I guess nice for you but maybe not as good in terms of making the world better you've worked with great success in areas that are uh perhaps traditionally harder to make money in as a startup Ed Tech Health Tech so I guess you don't have anything else to compare it to but what advice would you give to Founders who want to work in these areas in particular uh I think that are you talking about financial aspects like raising money or just um no just if if there's a there's a way to look at problem spaces where you know there's not traditionally a ton of budget or there's um impedance mismatch you know you have regulatory controls or whatever it is that makes it more challenging traditionally than many other areas of software so I think that uh there is I'm hoping a realization among investors that there are entire unpacked ecosystems where technology can make a difference and hasn't and um so I think that as you look at what we did at Coursera for example ethic had always been a Backwater of investment um and yet we were very fortunate to have been able to attract fairly significant funding even at the very early stages because we had an idea that our investors found compelling and differentiated from what others had done so I guess I'm a Believer and maybe I'm an optimist that if you have a really good idea that is differentiated from what others have done where the impact is something you can make clear as we were able to do with those first early moocs people will have confidence that you can turn that into something that is revenue bearing and will be willing to you know go with it for for a while so um that having been said I would say that ultimately and this is I guess how I feel about maybe the other half of a question which is is this going to be the place where you make the most money with the greatest amount of certainty maybe not but I believe that we only have one life to live and that ultimately what you want to be able to do is to look back on your life at some point since I have done something that's really worthwhile and important and I think that's something that um is important for people to keep in mind as they decide where to spend their time Daphne thanks for an incredible conversation and thank you for joining us on the podcast thank you very much [Music] foreign
Original Description
Life-saving therapeutics continue to grow more costly to discover. At the same time, recent advances in using machine learning for the life sciences and medicine are extraordinary. Are we on the verge of a paradigm shift in biotech?
This week on the podcast, a pioneer in AI, Daphne Koller, joins Sarah Guo and Elad Gil on the podcast to help us explore that question. Daphne is the CEO and founder of Insitro — a company that applies machine learning to pharma discovery and development, specifically by leveraging “induced pluripotent stem cells.” We explain Insitro’s approach, why they’re focused on generating their own data, why you can’t cure schizophrenia in mice, and how to design a culture that supports both research and engineering. Daphne was previously a computer science professor at Stanford, and co-founder and co-CEO of edutech company Coursera.
00:00 - Introduction
01:49 - How Daphne combined her biology and tech interests and ran a bifurcated lab at Stanford
04:34 - Why Daphne resigned an endowed chair at Stanford to build Coursera
14:14 - How insitro approaches target identification problems and training data
18:33 - What are pluripotent stem cells and how insitro identifies individual neurons
24:08 - How insitro operates as an engine for drug discovery and partners to create the drugs themselves
26:48 - Role of regulations, clinical trials and disease progression in drug delivery
33:19 - Building a team and workplace culture that can bridge both bio and computer sciences
39:50 - What Daphne is paying attention to in the so-called golden age of machine learning
43:12 - Advice for leading a startup in edtech and healthtech
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from No Priors: AI, Machine Learning, Tech, & Startups · No Priors: AI, Machine Learning, Tech, & Startups · 17 of 60
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
▶
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
No Priors Ep. 13 | With Jensen Huang, Founder & CEO of NVIDIA
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 8 | With Neeva’s Sridhar Ramaswamy
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 7 | With Stanford Professor Dr. Percy Liang
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 1 | With Noam Brown, Research Scientist at Meta
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 9 | With Perplexity AI’s Aravind Srinivas and Denis Yarats
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 10 | With Copilot's Chief Architect and founder of Minion.AI Alex Graveley
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 11 | With Matei Zaharia, CTO of Databricks
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 12 | With Noam Shazeer
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 14 | With Sarah Guo and Elad Gil
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 2 | With Runway ML’s Cristobal Valenzuela
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 3 | With Stability AI’s Emad Mostaque
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 15 | With Kelvin Guu, Staff Research Scientist, Google Brain
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 4 | With Zipline’s Keller Rinaudo Cliffton
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 16 | With Mustafa Suleyman, Founder of DeepMind and Inflection
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 17 | With Karan Singhal
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 5 | With Huggingface’s Clem Delangue
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 6 | With Daphne Koller from Insitro
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 18 | With Kevin Scott, CTO of Microsoft
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 19 | With Anduril CEO Brian Schimpf
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 20 | With Sarah Guo and Elad Gil
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 21 | With Datadog Co-founder/CEO Olivier Pomel
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 22 | With Instacart CEO Fidji Simo
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 23 | With Snowflake's CEO Frank Slootman
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 24 | With Devi Parikh from Meta
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 25 | With Palantir's CTO Shyam Sankar
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 26 | With Weights & Biases CEO Lukas Biewald
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 27 | With Sarah Guo & Elad Gil
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 28 | With Khan Academy’s Creator Sal Khan
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 28 | With Khan Academy’s Creator Sal Khan (Japanese Version)
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 29 | With Inceptive CEO Jakob Uszkoreit
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 30 | With Vercel CEO Guillermo Rauch
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 31 | With Cerebras CEO Andrew Feldman
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 32 | With NEAR’s Illia Polosukhin
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 33 | With Replit's CEO & Co-Founder Amjad Masad
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 34 | With Ginkgo Bioworks Co-Founder and CEO Jason Kelly
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 35 | With Sarah Guo and Elad Gil
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 36 | With Hubspot's Co-Founder Brian Halligan
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 37 | With Kawal Gandhi
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 38 | With Material Security Co-Founder Ryan Noon
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 39 | With OpenAI Co-Founder & Chief Scientist Ilya Sutskever
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 40 | With Arthur Mensch, CEO Mistral AI
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 41 | With Imbue Co-Founders Kanjun Qiu and Josh Albrecht
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 42 | With Sarah Guo and Elad Gil
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 43 | With Clara Shih, CEO of Salesforce AI
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 44 | With Former Square CEO Alyssa Henry
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 45 | With Reid Hoffman
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 46 | Best of 2023 with Sarah Guo and Elad Gil
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 47 | With Sourcegraph CTO Beyang Liu
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 48 | With Covariant CEO Peter Chen
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 49 | With Shopify VP of Core Product Glen Coates
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 50 | With Stripe Head of Information Emily Glassberg Sands
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 51 | With Notion CEO Ivan Zhao
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 52 | With Pinecone CEO Edo Liberty
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 53 | With AMD CTO Mark Papermaster
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 54 | With Sarah Guo & Elad Gil
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 55 | With Figma CEO Dylan Field
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep 56 | With Baseten CEO and Co-Founder Tuhin Srivastava
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 57 | With LangChain CEO and Co-Founder Harrison Chase
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 58 | The argument for humanoid robots with Brett Adcock from Figure
No Priors: AI, Machine Learning, Tech, & Startups
No Priors Ep. 59 | With Sarah Guo & Elad Gil
No Priors: AI, Machine Learning, Tech, & Startups
More on: AI Ethics & Policy
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
The AI House of Cards: Why Revolutionary Tech Breeds the Best Ponzis
Medium · Startup
The New Geography Of Entrepreneurship—How Founders Are Rethinking Where To Build
Forbes Innovation
Esports Company BLAST Reports Record Growth Following US Expansion
Forbes Innovation
Explorers Get Naming Rights. Infrastructure Builds The Future.
Forbes Innovation
Chapters (10)
Introduction
1:49
How Daphne combined her biology and tech interests and ran a bifurcated lab at
4:34
Why Daphne resigned an endowed chair at Stanford to build Coursera
14:14
How insitro approaches target identification problems and training data
18:33
What are pluripotent stem cells and how insitro identifies individual neurons
24:08
How insitro operates as an engine for drug discovery and partners to create th
26:48
Role of regulations, clinical trials and disease progression in drug delivery
33:19
Building a team and workplace culture that can bridge both bio and computer sc
39:50
What Daphne is paying attention to in the so-called golden age of machine lear
43:12
Advice for leading a startup in edtech and healthtech
🎓
Tutor Explanation
DeepCamp AI