Colin Murdoch (Google DeepMind): “Gemini will transform the way billions of people live and work”

Google has been the great dominator in the development of artificial intelligence (AI) systems for years. The acquisition in 2014 of DeepMindthe startup London company focused on AI research that developed a program, AlphaGo, capable of defeating a great champion of Go, the complex Asian board game, and opening the debate about whether the capabilities of this technology would end up surpassing the human mind.

But Google's placid dominance was interrupted last year by another startup, in this case OpenAI. The launch of ChatGPT, the most successful application in history, caught big technology companies off guard, as they were forced to accelerate their AI programs. In April of this year, DeepMind, which until then had functioned as a relatively independent research laboratory, and Google Brain, the technology company's other major research division, merged into a single organization, Google DeepMind, in which several of the best scientists in the world in this discipline.

Colin Murdoch (Glasgow, 45 years old) is the business director of this new AI super division of Google that has just presented its first toy: Gemini, a multimodal generative AI platform that can process and generate text, code, images, audio and video from different data sources. Those who have used it say that it far surpasses the latest version of ChatGPT and that it puts Google back in the fight to dominate this market.

An electronic and computer engineer by training, Murdoch joined DeepMind nine years ago, after accumulating experience managing startups and large corporations. His job is to ensure that the advances in AI made by Google's scientific team end up having an impact on the general public. He attends to EL PAÍS from London by video call.

Ask. Is Gemini the definitive answer to ChatGPT? What does it bring new to the popular OpenAI application?

Answer. Gemini is a significant advance in the development of AI. It's our largest and most capable model to date: it understands and reasons text, images, audio, video and code, so it can help people be more creative or learn. For example, let's say your child brings home physics homework and needs help understanding what he has done right and wrong. If you took a photo of the page, Gemini would not only give you the correct answer to the problem, but would read the letter and explain what the child has done right, what he has done wrong, and the underlying concepts. Users can also interact with Gemini through Bard, which now works with Gemini Pro and is more effective for understanding, summarizing, reasoning, coding, and planning. It is already available in English in more than 170 countries, and in the coming months it will be available to billions of people through other Google core products such as Search, Ads, Chrome, and Duet AI. In the long term, tools like Gemini will transform the way billions of people live and work around the world.

Q. What do you think of the commotion caused by the departure and subsequent reinstatement of Sam Altman as CEO of OpenAI?

R. They were a very interesting few days in the industry. But we remain focused on our work of launching world-class products and research. We've had an incredibly busy few months: since the announcement of Lyria, our advanced AI music generation modelwhich will increase creativity and generate new forms of innovation for artists, creators and fans in the future, until the publication of GraphCast, our weather forecasting system latest generation, and Gemini. We are very confident in our technology portfolio and are excited about the year ahead.

Q. What is artificial intelligence capable of right now?

R. Our research lab seeks to improve people's lives, and I think AI is a good tool for that. As long as you work carefully, which is in DeepMind's DNA. One of the areas I'm most excited about is what we call science at digital speed: AI helping to advance scientific progress. I'll give you an example. Proteins are the building blocks of cells. When they malfunction, they can cause problems or illness. So science has been studying the structure of proteins for years, specifically the shape of those structures, which is what really tells us how they work. The number of shapes that these structures can acquire exceeds the number of atoms in the universe. Well, two years ago, a DeepMind team managed to develop an algorithmic model, AlphaFold, capable of determining the structure and appearance of amino acids, the essential element of proteins. We know the structure of 200 million proteins, and we have unlocked that knowledge. Our tool is being used to accelerate research into methods to combat antibiotic resistance. It is also being used to accelerate research into enzymes that eat plastic in the oceans. And in cancer vaccine research.

Q. Have they made progress in other areas?

R. We are hopeful that there will be progress with nuclear fusion, the cleanest energy source that exists. To achieve this, plasma is circulated through superconductors at high speeds, about 10,000 revolutions per second. Magnets are used to reduce the friction of the plasma in the tubes. We are using AI to try to optimize, in real time, the calibration of these magnets so that the resistance is as low as possible.

Q. The interest of the general public, and it seems that also of companies, has turned towards generative AI. Do you think that could harm progress in many other areas of AI, like the ones you just mentioned?

R. We have been working on generative AI for a long time. In fact, the models that are now successful are based on an architecture called Transformer that Google scientists developed five years ago. What has happened in the last 12 or 18 months is that things have escalated very quickly: we have bigger models and more data. The fundamental change is that we can relate to those models with conventional language, like the one you and I are talking about now, and that makes it more accessible. Before, only computer scientists could relate to this technology; now, anyone who knows how to speak and write.

AI is very powerful and promising, but we must be very careful because it is a very powerful technology

Q. You have mentioned the importance of being careful when developing AI. What kind of rules are they following?

R. AI is very powerful and promising, but we must be very careful because it is a very powerful technology. We have a number of operating principles for how we can conduct our investigations. A second element is that we do research ourselves in areas such as bias and equity, to ensure that we address those challenges correctly. Thirdly, it is important to have a correct institutional configuration in the organization and the appropriate culture. We have multidisciplinary groups that include ethicists, engineers, and a wide range of professionals with different specializations, who test and analyze the benefits and risks of each system we develop. We also invite external specialists to help us.

Q. How do you think this technology should be regulated?

R. Regulation is important. I think it has to be measured and proportionate so as not to constrain innovation and at the same time it can mitigate the big risks, because I think this is an exceptionally promising technology.

Q. Do you think the approximation of the regulations on artificial intelligence that the EU has just approved is correct?

R. I think so, it establishes a proportionate and risk-based approach to each tool. It seems to me that this is a good starting point for the global debate. It is important that we try to promote that kind of coordinated approach to regulation and policy around the world so that we can maximize the benefits for everyone – and there are many – and we can also adequately mitigate their risks.

Q. DeepMind was until now Google's advanced AI laboratory. Has it changed after your business integration with Google? Do they now have to orient their work more towards commercial results?

R. I think the merger has been a very successful move. On the one hand, we have a scientific team unparalleled in the field of AI; on the other, a gigantic market that we can access thanks to Google, which offers us the possibility of trying to solve people's problems. My job is to find ideas at the intersection between these two spheres. And, when we find them, incubate each idea and take it forward.

Q. Give me an example in which those two elements coincided.

R. At Deepmind we have software, MuZero, capable of playing chess, Go and other complex games. One day, talking to someone at YouTube, he told us that they needed to reduce the bandwidth needed to get our videos to people all over the world, so that you can watch them regardless of the speed of your internet connection. There was a very creative moment when we realized that a video, in essence, is like a game of chess: it can be viewed as a succession of individual still images, and there are transitions between those images. Each of those images can be a position on the chess board, and the transitions, chess moves. So we applied MuZero to a video and gave it the goal of reducing its size, of compressing it. We saw that it had a dramatic impact on the weight of those videos, and now that technology is already built into YouTube.

You can follow EL PAÍS Technology in Facebook and x or sign up here to receive our weekly newsletter.

Limited time special offer