From Software Engineers to AI Word Artisans: Filip Kozera of Wordware

Filip Kozera sees parallels between Excel’s democratization of data analytics and Wordware’s mission to put AI development in the hands of knowledge workers. Drawing inspiration from Excel’s 750 million users (compared to 30 million software developers), Wordware is creating tools that balance the rigid structure of programming with the fuzziness of natural language. Filip explains why effective AI development requires working across multiple abstraction layers—from high-level concepts to detailed implementation—while preserving human creative control. He shares his vision for “word artisans” who will use AI to amplify their creative impact. Hosted by Sonya Huang, Sequoia Capital Mentioned in this episode: Lovable : Generative AI app that builds UIs and web apps Her : 2013 Spike Jonze film that Filip uses as an example of how voice will not be the best modality to express knowledge work. Descript : AI video editing app that Filip uses a lot. Granola : AI notetaking app Filip uses every day.. Gemini 2.0 Pro : Google’s newest long context model that can handle 6000 page pdfs. Limitless pendant : Wearable device for collecting personal conversational context to drive AI experiences that Filip can’t wait for to ship. DeepLearning.AI : Andrew Ng’s amazing resource for learning about AI 3Blue1Brown : Grant Sanderson’s incredible channel on YouTube that explains math and AI visually.

Published: Published Mar 25, 2025
Uploaded: Uploaded Jun 11, 2026
File type: Podcast
Queried: 00

Full transcript

Showing the full transcript for this episode.

AI-generated transcript with timestamped sections.

0:00-1:32

[00:00] We have 30 million software engineers in the world. [00:03] And we have 750 million active users of Excel. [00:08] And now you'll ask, like, how does Wordware compare to Excel? And what Excel did in the 80s to data analytics and numbers is what we are trying to do to AI. So in the 80s, you either had to have a team of data analytics people or engineers or you're using a calculator. [00:29] And I would say the calculator equivalent here is the ChatGPT that you every time you need to redo the conversation and every time you need to instill your own needs into it. What WordWare is trying to do is saying, hey, a lot of the things that you do are repeatable, similar to Excel, and you can encode your taste into it with WordWare. [00:59] Bye. [01:07] Today we're joined by Philip Cazera, co-founder of Wordware, who's building tools to help bridge the gap between human creativity and AI. [01:16] Philip shares his vision for why English is becoming the new assembly language for LLMs, why he believes that the future belongs not just to coders, but to people he calls word artisans who can communicate effectively their creative vision to an AI system, and what that means for the future of programming computers.

1:32-2:56

[01:32] - Philip, thank you so much for joining us today. I'm excited to learn about you and WordWare, and get your take on how programming computers is fundamentally gonna change with large language models. - Thank you for the invite. - Let's dig right in. I wanna start with Andre Karpathy. He had a tweet in 2023 that went viral. The hottest new programming language is English. [01:56] What do you make of that? And what does it mean relative to what you are trying to build at WordWare? [02:00] I think syntax will not be as important. You know, everyone will be somewhat of a coder. People used to have to know Python. Now, English is enough. However, you still need to know what you're trying to say. [02:12] And in that way, I would say, you know, [02:17] Not everyone will be able to use it because some people don't have that much to say. And I would maybe rephrase it a little bit in a little bit more of an exciting way for us, is that the assembly language to LLMs is English, but you still have to structure it in the right way. And you still have to use some of the concepts even from typical programming in order to actually make sure that it does what it's supposed to do. [02:44] You heard it here first. Okay. English is not the hottest new programming language. It is the new assembly language. And at Wordware, you were trying to build that new programming language to work with that new assembly language.

3:14-4:47

[03:14] and um [03:16] deterministic with something that's intrinsically fuzzy. [03:20] And that marrying of these concepts and choosing the right affordances and the right abstraction layers for that communication to be incredibly important. [03:32] easy yet enable people to do complex things is hard. And this is what we are trying to do with WordWare as the engine to enable the right mix of the two. [03:43] Let's talk a little bit more about no code. I remember back when I first got to Sequoia in 2018, I remember no code was the hottest thing ever. And it was like, you know, we're not going to program anymore. And, you know, obviously some no code companies have done extremely well, like Retool, for example. But by and large, we still have, you know, we have software engineers today. And so, so far that no code promise hasn't really come to fruition yet. [04:06] What's different now? Like what has changed? [04:09] I think coming back to what I just said as well, the assembly language is English and [04:16] You [04:18] I kind of have this [04:19] small insecurity when people call wordware a no code tool because we have not yet reached a ceiling with any of our clients so you can achieve absolutely everything we've created you know an ability to put in code execution blocks which if you want an escape hatch and wordware is not fully capable of everything you still can do it and in that way you know the document format of

4:49-6:28

[04:49] and how to write them down one by one is still very similar to, [04:54] to how code works. We still have loops, we still have conditional statements, and we have flows calling other flows, which really is function calling. So in a way, we don't see ourselves as a no-code tool. [05:07] And we kind of believe that the word crafters, the word artisans of the future are still coding. It's just, you know, they are structuring the English in a very precise way. [05:21] to make sure that the prompt is populated in the right manner. [05:24] Hmm. Wordcrafters and wordosens. I like it. World War Engineers. You've heard it here first. [05:31] How are you using the English language then to make, you know, since you bristle at the no code term, how are you making no code more code like? And maybe this is a good time to just give a 30 second overview on, you know, the word where product and how people use it to build things. Sure. So by trying to bring that structure to the intrinsically fuzzy English language, we've created an editor where you can use the similar concepts to product. [05:56] software engineering, looping, conditional statements, functions, calling functions. [06:02] and... [06:03] marry it in that editor in that natural language IDE. [06:08] in a way that people can construct agents. Something that we're not, we're not cogent, we go straight to agents. And in my definition, agents are almost like they are still software, they're still a little bit like software taking in inputs and outputting outputs, but some of the stages of what they do is fuzzy.

6:28-8:07

[06:28] So, [06:30] Merring that structure in the editor, whatever you build there, whatever you iterate on there, you then have three different ways of deploying it. You can deploy it as an API that will power your product or that AI button or that AI chatbot that is a little bit more complex than just doing a vanilla API call to Cloud or OpenAI. [06:50] That's number one. Number two is you can deploy it as a workflow where, you know, the main part of the workflow is not like Zapier. It's more AI native. And really the brain of the AI is a little bit more complex than a couple prompts strung together. And the third way, the third thing that we're building is the GitHub for AI, let's say, for people to share and what they've done. [07:20] things as components. Again, marrying some concept from software engineering, you need other people's components and libraries. And, you know, you want to be building on top of the, on the shoulders of the giants of these AI thinkers. So again, hardware engine, editor, way to actually create these agents and then three different ways to deploy. [07:39] I think one of the beauties of code is just its expressibility and its precision. You know, you know exactly what you were telling the machine to do and you were expressing it in some languages in the most precise way possible. English is not like that. To your point, it's fuzzy. And so as you are turning the English language more code like, should I think about that as, you know, helping with the controllability and the steerability of that? Or, you know, what is the, I guess, abstract thing you were doing with English to make it more programmable?

8:07-9:37

[08:07] Yeah, I think you're exactly correct. We are trying to bring a little bit more structure. It's not all the way because if you go all the way to a programming language, you lose the fuzziness and you lose the power of it. But it's hard. And right now, most of the people, you know, we had this. [08:24] way for evaluation software companies at some stage. And what we've realized with a bunch of companies that we work with is that they don't know [08:33] Like they don't have these data sets to use for evals. And what we came up with is that editor very quickly gets you to understand the [08:43] and develop an intuition of what works and what doesn't work. And for now, this is the most important thing. It's clicking run 100 times quickly and making sure that what you've written here [08:55] has enough structure in order to output things on the right side that you want. [09:00] And in that way, you know, as long as you know what you're trying to achieve, [09:04] And this is very hard. You know, we have a lot of companies coming in and just saying, hey, I predict weather. I literally had a big customer say, can you guys like predict weather for me? And, you know, that's not the case. You need a document where you outline what you're trying to do. And, you know, even that documented that has enough structure. [09:23] if you have done you know an intro those are the inputs you will be you'll be playing around with images and pdfs and then you will manipulate it in this way and then you will get some outputs and [09:35] developing that intuition

9:38-11:09

[09:38] is just enough structure for today. [09:41] Soon, maybe we'll be able to do better evals. [09:45] But for now, a lot of people really don't know [09:48] at the moment when they start playing around with Wordware, they don't know what they are trying to achieve. And one of our customers coined this term of, [09:55] speed of creativity with word where [09:58] is higher. So they learn what they are actually trying to do [10:04] as they... [10:05] encounter problems with the underlying models and they realize Gemini 2.0 Pro might be better, and maybe Gemini can take huge PDFs, and Claude can kind of take PDFs but smaller, and then GPT-4 cannot take PDFs. And you know, they develop that understanding and that helps them to structurize their faults. [10:29] How do you instruct the machine to go from intent to outcome? Like, let's say I'm a brilliant filmmaker and I want, you know, I want to use wordware to create the next the next hit. What do I do in wordware in order to make that happen? [10:44] Yeah, so for now, we focus on knowledge workers, where that is a little bit easier. I think using your example for figuring out how the future will be, you know, if we have, let's say, you know, [11:00] George Lucas playing around, hypothetically playing around with GPT-7 and trying to create Star Wars.

11:10-12:43

[11:10] He might type in just the prompt being like the two sentences. And he would just say, hey, create a movie about wars between star systems. And that's just enough to give a model an ability to run on its own. And this is actually not what you want. You want to convey your creative vision, which for now in Word, where it's just knowledge work. But soon it will be all work. [11:35] work where it needs that human sprinkle, that human taste. And these are the things that I really value is like, [11:44] People say, oh, nobody will have a job whatsoever. I don't agree with it at all. [11:49] I think the human taste and how you do things will matter even more. And I use this George Lucas example because it's a little bit easier to understand how taste is influencing that. [12:00] Everywhere. [12:01] Writing a good email is dependent on good taste. Figuring out I was just hiring for an executive assistant and... [12:11] Like everyone in our company needs to show that taste and needs to show... [12:16] a little bit more conviction in the way that they do things. So for executive assistant... [12:20] like she needed to choose a right restaurant for our offsite, you know, and that also has taste. I don't want to trust an AI with this. Um, [12:29] So yeah, that's kind of that sprinkle of human touch is very important. So taste as the last bastion of humanity. I think so. I think creativity and taste. Do you think machines can learn human taste?

12:43-14:15

[12:43] I think they can, but that's not the point. There is an interesting analogy here. They put humans into an MRI machine. [12:54] And they've shown them two different pieces of art. [12:57] Both of them were done by AI. [12:59] And they told the people, hey, one is created by a human artist and another one is created by AI. Our brains work completely differently and our different parts of the brain fire when we assume human intent behind something. So, you know, I can create a song and just like it will be a good song with Suno or whatever and send it to my friend and he'll be like, yeah, it's a cool song. [13:26] But if I ingrain my intent and I'll create a song about our, you know, skiing trip to Chamonix and I'll make it funny and I will... [13:37] that intent will be there i'm pretty sure his brain will be firing in a completely different way giving him [13:43] Like, [13:44] giving him a completely different experience in that way. Does that make sense? It makes a lot of sense. Yeah. I love it. [13:49] I'm going to transition to talking about the next billion developers. And you've alluded to this a few times in the conversation, and I really want to just pull on that thread. [13:57] So you started this conversation by saying, you know, there's a certain set of people in the world that know how to code, but there's a different set of people in this world that have creativity and have ideas. [14:10] Do you think that set of people is larger? Is that how we get to the next billion developers?

14:16-15:46

[14:16] There's an interesting analogy here. We have 30 million software engineers in the world. [14:21] And we have 750 million active users of Excel. [14:26] And now you'll ask, like, how does Wordware compare to Excel? And what Excel did in the 80s to data analytics and numbers is what we were trying to do to AI. So in the 80s, you either had to have a team of data analytics people or engineers, or you're using a calculator. [14:46] And I would say the calculator equivalent here is the chat GPT that you every time you need to redo the conversation and every time you need to instill your own needs into it. What World War is trying to do is saying, hey, a lot of the things that you do [15:03] are repeatable, similar to Excel. And you can encode your taste into it with wordware. [15:11] Um, [15:12] And I believe that [15:15] that taste will be important, as mentioned before. And I think the next 500 million or a billion users of AI might be calling them, I don't know what will be the term, hopefully it's hardware engineer, but, you know, word artisan or whoever. [15:31] The really important part here is that they need to know [15:35] what the AI is supposed to do. Many, many, many times, you know, we are a horizontal tool and... [15:42] people come to us and they say, hey, what can I do with AI? Yeah.

15:47-17:26

[15:47] And... [15:48] I tend to explain it as if for now it's an intern, but intern after university, and you need to write out on a piece of paper, a couple of things that you would want to do with the intern. So you need to say, hey. [16:04] This is your job. This is the title of what you're trying to do. Here are some of the documents or input that you will be working with. Here are the data sources. And here's the output that I'm expecting of you. [16:17] And the important caveats here that people don't often understand is that the data sources has to be something that you trust. You can't just say, hey, go and search the Internet, because often you end up with things that, [16:30] you don't [16:32] agree with and if the intern works on top of that that's a problem and another one is and this is very important is that you're going to trust the intern with this so you know if you want to send a thousand emails uh to every person that that that needs a response in your inbox with some people you just won't trust an intern to do this um and this is how like ai works uh right now so as long as you have a job right now that you say hey if i haven't one intern i could easily explain [17:02] them. And a lot of our work is like this right now. We often read an email, go search in Dropbox, go search Notion, and then we create a response that is essentially based on this database that we curate. Um, [17:15] then you can be using AI. And I think more and more, this knowledge work is going to be automated. And I think, you know, in that way, the next billion people are going to be...

17:26-18:58

[17:26] You're going to need two things. [17:28] intent [17:29] what action needs to happen. [17:31] and taste. How do you want to do this? And all of the rest will feel like CEOs of the biggest enterprise because we will have a thousand knowledge workers working beneath us. And [17:42] trying to actually execute on these two things. [17:46] What I heard just now was a lot of automation about knowledge work. [17:50] I mean, the thing that... [17:52] I'm most intrigued about within AI and within generative AI is its generative capacity, including the ability to create, you know, you mentioned the George Lucas example, but also to create, you know, new applications, new marketplaces, you know, new products. And so do you imagine, do you see WordWare primarily serving, you know, making the knowledge worker more productive? Or do you see it also assisting in kind of the creation of new products, services, you know, pieces of art? [18:22] for now, it's mostly about the productive work, I would say. [18:27] It's, you know, the AI engine is... [18:30] The AI heart of your product is Wordware. Currently, we have not dipped into the generative UI part of things. We're not Lovable. We've actually used Lovable to, you know, wrap our AI heart for some of our customers. And that has worked great. I'm just so impressed with their product. Talk more about this. Lovable, I think, also sees themselves as, you know, enabling the next billion developers.

19:00-20:35

[19:00] vision. You know, how do you think your view of how the world will go foots with with their view? [19:07] Um, [19:09] And why didn't you choose to make that style of no-code tool? Yeah, because the way that I see the word developer is a little bit different. They see the word developer as what developers do today, which is, [19:24] A lot of SaaS is a wrap around a database with some dashboard and ability to manipulate that data. They are creating a much more personalized dashboards, you know, and... [19:39] a lot of people are going to create incredible vertical SaaS based on Lovable. [19:44] And I think that's incredible. And the one thing that was missing through all of this is that they not only... [19:50] grab the UI part, they also grab the database part, which many people do not know how to manipulate. And hence, they unlocked a lot more use cases. [19:59] whatever but what we are trying to to say is that this part of like creating a generative you're creating a ui on top of a database is not the future the future is to you [20:12] actually utilize this reasoning engine that an LLM is in a productive manner. And we focus on that [20:20] that substance of AI at the beginning. In the future, we might want to expose that engine, [20:28] as in a UI, maybe it's a chatbot, maybe it's digesting some images, et cetera.

20:35-22:26

[20:35] But the real important part is the AI engine. And so if you think about an app as, you know, there's the UI, there's the application logic, and there's a database. What you're saying is, you know, you really want to just... [20:45] You know, knock it out of the park on the application logic, so to speak. Yes. And I think, you know, [20:50] Um... [20:52] the databases that we used to work on were discrete. [20:57] And right now we're able to work on a lot more data, which is not structurized. And this is the big, big difference right now. A lot of SaaS right now will still work in a similar manner, just the database is fuzzy. [21:14] And the database might be what you see every day. And how the hell would you put that in a typical, normal, you know, database? [21:22] And I think working on top of that context is the really exciting part for me. [21:28] Let's go back to this concept of word artisans or wordware engineers, if everything goes right, of... [21:35] You know, what's your vision of what a hardware engineer looks like in 10 years? [21:38] Oh, that's a tough one. [21:41] I've... [21:43] have been thinking a lot about... [21:47] What does work look like in 10 years for human beings? [21:52] And... [21:54] I was struggling with this at the beginning because [21:59] It's really hard to understand people's jobs even today. And often I boil it down to the software that they use. They can talk a big game about strategy and, you know, I set the mission. And in the end of the day, I ask them, do you do meetings? Do you do email? Do you do PowerPoint presentations? Do you work in Excel? Or do you work in code? And I just want to understand what does work look like in 10 years? And like, what are you really working with?

22:29-24:10

[22:29] we hear like interface when you just talk to the AI and it does a lot of work for you. I think, to be honest, voice is kind of not the best modality to express that. Hence, [22:41] I kind of think that in its simplest form, WordWare is a document where you jot down your thoughts and you do it in a more structured way. WordWare, a co-pilot AI is helping you throughout structuring it. And in the end of the day, you behave like a CEO, which sets the strategy intent and all of that on that piece of paper, essentially on this blank canvas. And you draw that vision carefully. [23:09] Maybe it's even more than words. It's just, you know, you generate this vision of how your own enterprise works. And, you know, I look at different things around us and I see furniture or shoes or whatever. And I think there is taste ingrained into what kind of shoe would you like to make. So in 10 years, if somebody wants to become a creator of the best brand of shoes, you [23:36] It becomes about, that shoe becomes a luxury object, which has ingrained taste and intent in it. And then a bunch of things in the end will be... [23:48] will happen on its own. The really tough parts, even manufacturing it and so on, will happen on its own. But what's your job in the future is talking to other CEOs. I think we'll not, humans don't want to lose that control. So you will talk with other CEOs about maybe doing a partnership with your shoe brand and somebody else.

24:10-25:53

[24:10] you will have to be still critical about the intent of the other person. And you have to instill taste and your own creative vision into that shoe. Do you think that, you know, do you think a billion people globally will be capable of programming a machine in English language, the way you describe or in wordware documents, the way you describe? Because it does require, you know, it's almost like pseudocoding. And, you know, there is logic, there's loops and things like that. [24:40] Like, do you need to be technical in order to use Wordware? Like, who is the ICP today? And what is needed to move that ICP so that you can reach a billion developers? Yeah, I think... [24:50] Right now, what we've enabled is people who are somewhat technical, CEOs, technical PMs, high up in the org chart. [24:59] to ingrain their own kind of [25:02] think that they know what needs to happen and get there quicker, you know? So, um, [25:08] Max from Instacart, for example, he is a founder and he spent four days just, you know, refining his idea. [25:16] in wordware instead of hiring a whole team, but he is somewhat, you know, analytically minded. [25:23] And for now, that's the case. We did not want to make too much magic because the models were not there. [25:30] And right now what we're doing is we're moving more into that blank canvas when you just describe the idea and we take care of guessing the right structure. And you still will be able to, you know, in a very fine grained way, edit it. But you will start playing a lot more. We'll probably use all free to get you to the first draft of how that flow works.

26:00-27:33

[26:00] really whether we will have 1 billion developers, you know, working in wordware. [26:05] It becomes a much bigger question here. It becomes a question of like, will a billion people want to do productive work? [26:11] Like, you know, we just talked about the shoe. How many people will have the drive to... [26:17] put out something [26:18] to the world and they will want to express that creative vision. Maybe in, you know, post-resource scarcity world, most of it won't work. But I think we'll still have the equivalent of billionaires. And it will be about influence. It will be about taste. And it will be about how you utilize your own resources and how do you multiply it to have the equivalent of future money. And I went a little bit deep here. [26:48] deeply human drive. And I think that exists in a post-capitalistic world. I also have that opinion. And I really believe in humans. Like I want them to succeed. Like somebody asked me, one of our prospective employees asked me, what, like Philip, in 10 years, what do you want? [27:05] there to be like, what have you done? And I want to save like the human creative vision. I don't want everything to be AI. I really have the pleasure when I go to an artisan shop on my holiday and I know that somebody put in the intent and put in the work and I want to interact with it and I want to interact with the story of it. Okay. So today your ICP is the analytical creative, which is a little bit of a unicorn. And over time, as you kind of lower, as the models get better,

27:35-29:10

[27:35] lower the bar so it'll really be just more of the creative is your ideal user. You're going to lower the bar of how analytical you need to be in order to use word. Yes. But at the same time, you know, my use of the word creative is not... [27:48] to what most people associate it with right now. I think a good creative is also, you know, using growth channels in the right manner. They are creative about everything that they do in this new, uncertain world of AI where everything is changing. And, you know, I'm not thinking only about an artist that's painting on the canvas. It's [28:10] I think creativity can basically show itself in so many different aspects of work. [28:18] Let's talk about user interfaces and, you know, the future GUIs, the GUIs of the future. Right before we filmed this podcast, you made the analogy that, you know, transformers are the new transistor. Maybe say a little bit more about that and what you think the new GUI is going to be. [28:34] So I think the analogy here is that if... [28:38] The LLM is the, well, if transformer is the new transistor and it's being packaged as the model, the model is kind of the mainframe, let's call it, you know. And then we took our sweet time to utilize the power of that mainframe in a GUI that's accessible by billions of people. You know, there has been really two big spikes there. The first was the desktop. [29:05] you know, and Apple came coming up with their GUI. And the second one was mobile.

29:11-30:53

[29:11] And... [29:12] You know, right now we are almost like [29:15] exposing the numbers and the logic in a chat [29:18] style thing and nobody has had a better idea. [29:22] We think that the document style is better for creating, like doing... [29:27] kind of more complex work. [29:30] Because often when you're trying to achieve something, you just give it two sentences and the model just runs on its own. And it's just enough. The two sentences, our lead investor recently said that the two sentences is just about enough for a model to hang itself on. And, you know, you will get something completely different than what you actually wanted. And this is a problem of like lovable endeavors of this world as well. Yeah. [29:55] But... [29:56] I basically think that there are better GUIs coming and, you know, whether they will be based on AR or, you know, there will be an assistant that's listening to everything that we do. That was actually my first company. Augmenting human memory of always on listening devices using GPT-2 and BERT. I've been in this. You've been in this since the GPT-2 days. I mean, my research wasn't into LSTMs, which are the precursor to the transformer architecture. And I've been in this for a while. [30:26] has yet delivered on this. I want everything that I hear to be somewhere in a searchable database that also has the perfect context about me, you know, the way that I want to do things. And I think those affordances and those like, we called it GUI, but it's really the underlying like way of interacting with intelligence is not going to be mainly chat. I just don't believe it.

30:54-32:34

[30:54] Programmable documents. Do you think that is fundamentally what Word Word looks like UI-wise in, call it, five years? [31:01] I think there is more and more magic in it. And... [31:07] I would believe that I want people still to be able to do that fine grain work. You know, we've linked it with George Lucas doing the movie, you know, in a way you almost want to firstly start with the high level thing, the two sentence description, and then zoom in and zoom in and zoom in and create, you know, modules which make the best scene that is five seconds and then combine them together in that way. So what I would like World War to be is to transcend the, [31:37] abstraction layers and, you know, be able to zoom it all out [31:42] Start with a sentence and have it run. Maybe see whether it's working in the right manner. And then as you're seeing that some things are not doing the thing that you want them to do, is to be able to zoom in and see maybe, you know, four sentences of exactly what it's doing. What are the inputs to this? What it's trying to do in the middle? And what are the outputs? You know, that's kind of the most simple one level in. [32:12] iterate on your idea of [32:15] how this should be done. [32:17] How did you arrive at the current user interface? I think it does feel really novel compared to how others are, you know, enabling AI builders today. How did you arrive at the current user interface? Was it more experimentation, listening to users? Was it you philosophizing about, you know, what it should be?

32:38-34:10

[32:38] Currently, the [32:40] And currently and before, the approach to creating these agents was a block based on a 2D canvas. [32:48] And once, you know, I've been building agents for a long time. You know, I think, you know, March 2023, I put out the first article about how to build agents. And me and Robert, my co-founder, we've been in this for a long time. And the more... [33:03] the better the models got. [33:06] the prompting became more difficult because you can do more complex things with it. So at some stage, there was this movement of like the prompt is going away, so on. We actually really disagreed with it. And that idea is gone a little bit. It's like, you know, we came back, did a loop again, and be like, actually communicating your vision is really important. And when we tried to communicate our vision, which was a little bit ahead of what the models could have done at the [33:36] canvas is just not enough. Like if you do a reflection loop inside of a reflection loop, you run out of dimensions. [33:44] and [33:45] We basically really like the way that code is structured. Code has an ability to express very, very complex concepts in a way that is still... [33:59] Like you can still manipulate it and understand it. [34:02] Think about trying to structure the whole Uber app with everything in it on a 2D canvas.

34:10-36:03

[34:10] It would become so cluttered and so messy. You can do the big picture thing, but not really the, you know, you don't want engineers to be interacting in that way. You want the engineers on the future world war engineers to be interacting with something that's easy to grasp the structure of very complex systems. [34:28] Whereas the Uber app actually could probably be described in pseudocode. And it seems like you're getting people closer to that vision versus the 2D canvas. [34:38] Yes. And I think, you know, the most important part here is that Uber has an agent equivalent. And this is what we are trying to build. You know, if you want an agent to decide where is that person going and where are they starting their journey and where they will accept that charge or, you know, you want to maybe make sure that the charge is right for that particular person, there is an agent equivalent there. [35:08] where it's not like you're going to create that whole UI. [35:12] for Uber. And I think, you know, probably Uber... [35:17] is the right abstraction layer. You don't want to be ordering an Uber through a chatbot or through like a voice based thing or, you know, but you might want an Uber to be ordered for you if you have a calendar invite. So, [35:32] You know, in a way that like for your personal use, Uber is nice because you can click around and the agent will not always know. [35:39] But I was coming here and I wanted a Waymo. Actually, Waymo can't get that far yet. But I wanted a Waymo to be ordered and to be ordered perfectly when I need this. And it's almost like a personal assistant would do this for me. And now that capability is open to everyone. So we'll soon have these kind of affordances and these kind of abstraction layers there. I think that's a great note to end on. Should we end on a lightning round?

36:03-37:38

[36:03] Let's go. Okay. One or two sentence answers only. [36:07] Okay, first question. What is your most hot take or contrarian take in AI, not related to wordware or everything we just discussed? [36:17] Uh, pre-training was still going to matter. And Deep Seek is a little blimp that people liked to, uh, people jumped on because people love a good drama and it was connected to China. And actually it doesn't matter that much. Okay. I know I said lightning round, but you have to say more. What do you mean it doesn't matter that much? [36:41] I mean, they utilize some cool techniques, and the rest of the community is going to learn from that. However, you know... [36:50] Like the fact that they like trained it for a little bit cheaper for like a lot cheaper does not involve all the experimentation that they did before that. And, you know, I don't know if I'm supposed to say that, but I'm pretty sure they had access to the best NVIDIA as well for that experimentation. And yeah. [37:09] It's not that novel. People jumped on it because they were like, oh my God, China is taking over the race and so on and Nvidia stock price plummeted. And I just think it's another place where some models were trained that were open sourced and we're not going to remember it in like 20s. [37:28] a year or like even six months or maybe they will take over. But the model doesn't really matter that much. How you kind of work with that best model out there. That's what matters.

37:38-39:12

[37:38] That is a hot take indeed. Okay. Next question. Who's going to have the best frontier model next year? [37:44] Oof, um... [37:46] I think OpenAI is always super bullish and they always promise a lot. And then I was just on a talk with Sam Altman on the YCAI retreat and the O3, the way that he pictured it sounded great. But I think we both know that they overpromise a little bit, a lot. And... [38:06] I love Antropic. [38:08] I think their kind of vision and their kind of the way that they've created this is great. But recently, Gemini 2.0 Pro launched. [38:16] with their abilities to ingest 6,000 pages of PDF is really blowing my mind. So end of the story is, I have no clue. This is a place where it's super fragmented and people have zero loyalty. Pre-training is hitting a wall. I think, you know, famous people, including Ilya, have been quoted saying something to that extent recently. Agree or disagree? [38:40] disagree right now. I think, you know, it's [38:45] The intelligence of a model is linked logarithmically to the resources that are needed to train it. [38:53] But, you know, doing a 2x of intelligence is... [38:57] And it's on its own. [38:59] exponential. Like if I'm smarter 2x than somebody else, it doesn't mean I'll do 2x of the work. It means that I'll find ways that probably mean I'm a 10x or even more.

39:13-40:57

[39:13] Favorite new AI app, not WordWare? I would say I started to edit content because we need to explain and educate people a little bit more about both WordWare and AI. So Descript is something that I've been loving. And I use Granola every day. And the newest model that I'm really impressed is the Gemini 2.0 Pro. [39:33] I really like it. [39:35] That is, that's a hot take as well. I haven't heard much of that from people. [39:38] I think it came out like four days ago. So people have not been playing around with it. Their PDF capabilities are awesome. [39:45] What application or application category do you think will really go mainstream and hit this year? [39:51] I would love to see. I'm personally very, very involved with that whole... [39:57] AI, having the context of your life and being able to, you know, basically make better decisions based on the context. And, you know, I've rewind, which, you know, I think they are called limitless right now. I've ordered their pendant. By the way, it's been like a year and a half and I still don't have it. [40:17] I don't know. Send it to me. And I had to change the color because they didn't have the color. But I would love for there to be a provider which has a lot more context and can do the personal stuff for me. Don't you think that's Apple over time? [40:38] I was just about to say, I think [40:41] Ideally, that N421 model or whatever it's called of the AR glasses that they are trying to push out there, which I think Facebook has taken over a little bit. Maybe we'll see early stages of that. And I think they're the only ones.

40:57-42:31

[40:57] where the privacy really like they have a good brand around privacy and two even if you're [41:05] new AR glasses run out of battery, it's still cool to be wearing a $5,000, you know, a piece of hardware. And maybe that's the UX. [41:17] But I don't know what that UX and like a microphone so far failed. [41:22] Yeah. [41:23] single piece of content that an AI aficionado should read. [41:28] Or watch. [41:30] I would say all of the deep learning.ai resources, everyone, like we have a bunch of candidates apply for jobs. By the way, we are hiring whatever I should be looking very, very aggressively. So come join hardware. [41:46] The deep learning that the resources are awesome and they explain everything from [41:52] from the bottom layer all the way to the practical layer of how to actually get it done. I also think if you don't understand the underlying technology, go see FreeBlueOneBrown, an incredible channel on YouTube, and they explain everything super well. [42:09] Yeah, I think. [42:11] Wonderful. Your lightning round was full of hot takes. I didn't even have to ask you for a specific hot take. Well, Philip, thank you so much for coming on. I really enjoyed chatting about, you know, how you see the world evolving from developers to word artisans or, you know, wordware engineers, if everything goes right. And appreciate you sitting down to share your vision and your hot takes.

42:31-43:02

[42:31] Thank you for having me. Thank you. [43:01] you

Want to learn more?