Jason Lopez: Our immune system can detect viruses, but sometimes cancer cells are invisible. Cancer cells can appear normal and the immune system allows them to go about wreaking havoc. In immunotherapy, the DNA of the immune system’s T-cells are changed to recognize the genetic strands of the tumor cell and destroy them.
Debo Dutta: Let's see how AI gets into this space. So the programming, the existing T-cell can be done with gene therapy, but how do you know what to program? That's where AI comes in. Using AI, you can rapidly design sequences. People are today looking at a lot of sequence data as well as literature, feeding them into large language models. And they are actually trying to generate candidate sequences for lab experiments.
Jason Lopez: Advances in large language models as well as computer infrastructure are cutting development times dramatically. Debo says what would have taken years might take a 10th or even a hundredth of the time.
Debo Dutta: I'm not an oncologist. I'm not a gene therapy researcher or an immunotherapy researcher, but I do believe that as a systems person, if I can help the lifecycle of the machine learning go faster, I can help them to make immunotherapy get better and better. We can cure more types of cancers and more diseases.
Jason Lopez: Debo was born in India. His mother is a retired doctor and his father a retired engineer. Those two professions are some of the most sought-after in the country and getting into a good university to pursue those professions is very competitive, and he was able to enroll in one of the top colleges in engineering where he opted to study computer science.
Debo Dutta: But I kept thinking about biology, but I didn't do much about it.
Jason Lopez: After graduating, he came to the US to study for a PhD at the University of Southern California, and that's when he saw another dimension to computer science.
Debo Dutta: It looks like pure science in many ways because it's a lot of math, applied math and discrete math, and a lot of engineering too. But it can be applied to biology.
Jason Lopez: At USC he encountered a large team doing computational biology. And after his PhD, he had this thought:
Debo Dutta: No matter what I do in life, I need to spend some time in computational biology at USC before I actually get into my career and just be a researcher and learn how computer science can be applied. And that's when I first picked up what is now known as machine learning.
Jason Lopez: What do you think about the AI boom?
Debo Dutta: Yeah, that's a very interesting and loaded question. I could spend the entire session just talking about that.
Jason Lopez: Models are at the heart of machine learning. Data is used to train models. Which are deployed to infer from new data. And in the ChatGPT and the generative AI space, models are getting bigger.
Debo Dutta: These models can be as large as 1 trillion parameters. And typically when you see new data, you have to run this data through the model. Big models take longer time, more compute resources. And so measuring the performance will lead to more innovations in the infrastructure space.
Jason Lopez: That will make models cheaper to develop and cheaper to deploy. But, Debo points out the quality of the model still matters.
Debo Dutta: When you're generating text, how good is the text? Is it hallucinating? Is this text believable? How was this model developed? Was it developed on open data? Was it developed on somebody's private data? And if that is the case, what's the liability of the model if I use that model?
Jason Lopez: When Debo did his postdoc in computational biology at USC, it was called statistical algorithms. He says he was on the right track as he explored signals in biological experiments, especially proteomics: the study of proteins.
Debo Dutta: I got to learn about machine learning as a part of understanding how the human body works.
Jason Lopez: Statistical algorithms were one of his tools at his job, later, at Cisco. He did cloud computing for a while. But then decided to investigate a new area.
Debo Dutta: And what struck me is cancer therapeutics in the US was still not advanced enough by a computer scientist standards.
Jason Lopez: One day, he found himself at a meeting at Stanford with some leading machine learning experts. That group became ML Commons. Subsequently, they began discussing what ML Commons ought to do. It wasn’t long before a medical-focused working group was formed.
Debo Dutta: And that's when the idea of this med perf paper was born.