technologyeducationbusiness

Web News: Trying Claude Code for the First Time

HTML All The Things - Web Development, Web Design, Small Business·Mar 14, 2026·33:52

About this Episode

AI coding tools are evolving quickly - and the latest generation of “agentic” development tools are changing how developers interact with their codebases. In this edition of the Web News, Mike introduces Matt to Claude Code for the first time. While Matt already uses tools like ChatGPT to assist with coding, he hasn’t yet adopted the newer workflow where AI agents can plan, generate, and modify entire projects directly from the terminal. During the episode, Mike walks through a live demo of Claude Code by attempting to generate a brand-new website for the HTML All The Things podcast and blog. Along the way, they explore features like plan mode, discuss how agent-based tools approach software development, and examine how these tools compare to more familiar AI assistants. Throughout the demo, Matt reacts in real time - asking questions, challenging assumptions, and trying to understand how these modern AI development workflows actually fit into a real developer’s process. If you’ve been hearing about tools like Claude Code, Codex, or AI coding agents and wondering how they actually work in practice, this episode offers a firsthand look at the experience of using them live.

‍Show Notes: https://www.htmlallthethings.com/podcast/trying-claude-code-for-the-first-time

Hosts & Guests

Matt Lawrence

Host

Mike Karan

Host

Transcript

All right, everybody, this is the web news and we are going to be doing something different.

I have no context for this, which is why I'm kind of hesitant in my intro because I don't use

cloud code at all. I do use AI to help me code here there, but I do not use cloud code. I don't

use any of this agentic workflow stuff at all. And so Mike said for this web news, just sort of

live. And I again have no context, other than maybe I get a YouTube video or two, he's going

to show me some stuff. So Mike, why don't you why don't you take it away? Yeah, absolutely. So

I think it'd be cool for us to just kind of experience this live together. I don't have anything

like super, super planned. I want us to stumble. I want us to get through this. I want you to ask

questions as you go. But since you've never touched cloud code, I think there's going to be a lot that

will present itself as maybe confusing or maybe interesting. So I figure this will be a good

experience. And I'm kind of glad you haven't touched it. I kind of glad that I get to be the first

one to show you what you can do with it. We're not going to do anything too crazy because we obviously

want to keep this web news long, like so 15, 20 minutes. But we are going to at least try some

things. And I'll show you kind of the agentic side of things. So I do want to show you how you can

like kind of launch a browser, ask it to do stuff in a browser, help you in your like day to day

life, as well as the programming side. Well, to give you any context, I have opened codex and

done one demo project, which is actually what the next episode we're going to record is going to be

about. And then I've watched a cloud code video on YouTube because I knew we were going to do this.

And I just at the very least wanted to understand generally what we're doing. But I have not I don't

have cloud code. I don't have any subscription. I never downloaded it or anything. So just to

give people like if people are like, Hey, you kind of sounded like you do that. It's like I've seen

a, I've seen a YouTube video and I've literally opened codex and for like five minutes. Yeah.

Well, that makes sense. That makes perfect sense. But yeah, hopefully, hopefully we'll get

somewhere or maybe we won't. Maybe this will be an epic failure because cloud is down at the moment

or something. It should be fine. I think I checked. Although we just tried to do a cloud code

thing with web flow and web flows down right now. So fantastic. Yeah. My first demo already failed.

This is attempt number two. Anyway, just jump it in right now. What we're going to do is like

to start with any project. We're going to actually create a directory for it. So that we're

going to call this McDermot one, right? And then CD into demo one and clear the screen. Everything

should be nice and cozy for us to begin. So right now, are we just on like a terminal? Like

we're just on a Mac term on Mac either, by the way. So this isn't this isn't just a Mac terminal.

This is called ghosty. It's just a terminal emulator that has a lot of extra functionality built in,

makes it faster. It's more like compatible with two ease, which are terminal user interfaces.

Oh, yeah. Ish term. But yeah. So it's just a better way to kind of interact with cloud code.

The terminal like the Mac terminal actually is pretty glitchy with cloud code.

Oh, that's thanks. And then again, just to give context, I do use terminal and command line.

Just I'd use that on Windows primarily. Yeah. So with this, all we have to do to start a

cloud code session is type in cloud. And that will give us, you know, it'll tell us what MCP

tools are called. Don't worry about this isn't really not important. That's more for my work.

And here, this is where actually in cloud code now. So this is still terminal. But this is,

again, that too we were. This is a terminal user interface. Yeah. So we don't have,

I can't perform traditional terminal commands in here. Like I can't do like a, you know,

a McDurren here. This is literally like a prompt box, like a text box where you can talk to the

the cloud, the cloud code interface. And this is your chat GPT like chat box kind of kind of

the only difference between this and chat GPT is the orchestration and the agentic side of it

is quite more interactive with your computer. Chat GPT is interactive with the web.

It can't really action on your computer. You can give it context. You can, you can paste in

some screenshots. But with this, it has access to the terminal. Right. So it has access to pretty

much anything that's on your computer. And depending on what kind of level of access you give it,

it has, it can do more things autonomously or less depending on if you allow it to like go free

rain. I haven't given it the the free rain yolo access because I like to least like for the

dangerous commands, like, especially with like in production work that I do, I don't let it like

push anything to a repo. I don't let it remove any, any files on its own. Like it has to ask me

for everything. But you can give it like an unsafe command essentially that will allow it to do

anything, including, including like literally delete your entire disk if it gets that crazy.

It has, you know, commands inside of it not to do that, but it's, it's happened before.

So this is why people use like a Mac mini, right? Like this is why, because although they mostly,

I guess use that for open claw is what I hear, but I mean, a separate Mac mini or just a separate

computer or bluntly said, that's correct. Open claw is essentially just an automated version of

cloud code. Like it has cloud code built in a lot of times to the open claw and it just allows

cloud code exactly what you're saying to be kind of sandboxed in, right? So that even if even if

it's running unsafely, the worst it can do it is wipe itself rather than access your, you know,

personal files and wipe them or whatever. That is exactly it. So have you said that?

Our first example, what we're going to do actually is we're going to take our HTML

of things website, which I love and it's, it works great. But I want to see what

Claude's design capabilities are. Like I'm not really that good at design and Claude is

apparently getting better. Opus is getting better at design. So I'm curious what it can do like

with our current website. So I'm going to tell it to go into our current website and look at all

the content. Well, I'm going to it's going to it's going to it should be able to gain context

of what it is. Like it should be a, you know, a podcast website with blogs. And I want it to

design a better, more catchy cover and I want like I want there to be a significant difference

than what we have right now. Well, to be fair, the thing I built was years ago. We did it

quickly and we haven't facelifted it. So I definitely think it could use a facelift.

Yeah. So maybe this will give us some ideas. I doubt it'll be able to replace the website

because we only have 15 minutes, but maybe this will give us some cool ideas at least, right?

So let's try let's do that. So in this, it would be just like chat GPT. So can you please,

I'm saying please to the bot, take a look at HTML, all the things.com. And I want to redesign the website

and the website to be a bit more modern and have a much more actionable cover portion of the site

and be clear about what we, what we are, which is, which is a web development podcast.

And which web development podcast and I don't know what else. What do you think? What else do I add

here? Like, I don't know how much context you have to give it like because we cover AI,

we talk about some small business stuff. I don't know if we need to do that. Yeah, let's do that.

We're going to a podcast. We cover AI web development web dev careers like dev jobs, careers and

freelancing. Do we want to limit it? Maybe like, should we, what should we say? Like maybe

three, let's do three AI web development and small business or we talk about dev jobs quite a bit.

Dev jobs. Okay, let's do that. And then, okay, so that's that. Then we could talk a little bit

about text actions. I'm a svelte person. Let's create a svelte kit site with tailwind. We don't

have to give it this. I want to be clear. It'll probably just make a either a regular HTML CSS

JavaScript site or a next JS site. If we don't give it any information about the text stack,

it's just my own preference to do this because I have more context into the code that it will write

if it does this. So that's literally just a preference thing. With tailwind, with tailwind,

you make sure to grab all the logos and all the logos and colors that match the logos.

Right. Right. From the site as well as all the content you can grab by scraping. It's

creating it. We can't. So my, my initial intention was actually to have it be connected through

an API key because you can do that. You can give it an API key. You can give it a .e and v file.

And it's able to then use that to actually connect to the web flow API and pull in all the API

information, which it would be able to do a lot better because there's a lot more structure in the

API information. Whereas right now, it's just going to have to go through the site map,

which, I mean, it's structured enough. It should be able to do most of what I'm asking for it.

Yeah. And just to be clear, we just can't generate an API thing because web flow is down currently,

unfortunately. Yeah. Exactly. Okay. And I'm going to put, don't make any mistakes with the

shits and giggles. And we're going to let it go. So now it's going to start thinking. It might

start asking us some questions and giving asking us if we can, for example, it'll probably ask if I

can even access HTML with things.com. That's the first thing it does. And I'm going to say,

yes, do whatever the heck you want with HTML with it. Improvising. It says it's improvising.

What the heck is it doing? Yeah. So this text right here is going to be just gobbledy good. It's

literally just kind of a joke. It's not actually, it doesn't actually mean anything that it's doing.

This text up here is the real like thinking and reasoning path.

This is the mind. It's the Minecraft menu yellow text. Pretty much. Yeah. That is correct. So you

can see what it's doing. You can see where where the tokens are going. You can also see how much

it costs, by the way. I am on an enterprise plan for my job. So thank you, Cypherin, for letting

us burn some tokens here. But overall, it's, it's quite expensive. Enterprise plans as well. The

catch there is that they don't allow the typical cloud code subscriptions, which are $200 a month.

Already a lot. They just give you tokens. Like they'll, it'll be $200 of the tokens, which is

way less than the $200 you get with a subscription from the non enterprise plan. So it's just 22 cents

there. You've used 22 cents of computing. Yep. Is that on top of your plan? No. So it's like the

plan itself, I believe, is like $200 per developer. And then so I'm using those that money from

straight from the plan. And then as soon as I go above that amount, it'll start just be on top.

Now, now there's a question. Does it stop and tell you? No. It does give the administrator

a warning. It does give the, you know, like the administrator can set limits. Okay. Yeah.

The administrator can set limits. Yeah. Like I, yeah. So it's, it's controlled by the admin of,

of the enterprise. Yeah. So how long do we think this is going to, because so it's been going

for a minute and 33 seconds, you know, again, counting, it could actually take quite a long time

depending on what you're doing. I'm hoping it will do it in phases. We'll see. And I actually

I already screwed this up, to be honest, and not really screwed up on, not really screwed up on,

not on purpose, but to save time, I skipped a planning phase, which is bad. And that might,

that might result in a worse result. But the planning phase actually takes quite a bit of time,

and then the implementation would take time. Well, what's, like, like, lay that out for me,

what is the planning phase? Like, is it, I understand you're planning, but are you planning

and the bot is doing anything? Is it just gaining context? Is it taking tokens? Like, is it using

tokens to plan? Absolutely. Yeah, it's taking tokens. Yeah. Planning phase is more or less the most

intense phase, because of the fact that it has to go in and research everything. Like, it does a

lot more research. There's a lot more thinking. It'll, it'll almost like think against itself. So

it'll get, it'll make an idea, and then there'll be a reasoning, a reasoning bot that looks at the

ideas, like, no, that's a bad idea. And then it'll, like, it'll, it'll have a lot of back and forth

internally as it, as it's making ideas. And I can show you that while this is going, by the way,

we can launch another one of these. We can go cd.dot and create another maker demo too.

We can launch another cloud instance here so we can have as many cloud instances open as we want.

And here, let's, let's actually do a plan while the, while the other thing is going.

Here, what I want to do is actually a little bit different. I don't want it to make a website or

anything that what I wanted to do is help me with my taxes, right? I did this already. This is

something that I've done. I think I did this in cursor, but I'm, I'm assuming it'll work in cloud

code as well. I need some help with my bookkeeping. I want you to be able to fire up a browser or

me to log into my open AI account. And then for you to take over and download all of my invoices,

so I can match them to their transactions and book. Now we're on plan mode now. So right now,

what it's doing is just considering this. It's not executing anything. It's just going to come

back with a plan like, Hey, this is how I think I'm going to do this. Are you okay with that?

And then you have an option to accept the plan or you have an option to actually like ask it

more follow up questions, do some different things. It could also itself ask you questions,

right? So it's not doing that right now. I was just asking for permission right now, but it can

itself ask you some questions in turn and be like, Hey, how do you want to access it? How do you

want to do this? And then based on those questions, it'll generate a better plan. Let me ask you

a question here then. So like, this is not a question that is out of the realm of what I would

potentially type into chat GPT is what is the reason, I suppose, for you to do it here in

cloud code and not in like a chat GPT. So how would chat GPT do this? You would have to do the

browser extension thing like you, I think there's a browser that chat GPT can control. It's very

similar in this case to that. I would have to I would have to go into my open AI account and

actually like grab context and numbers and things. Yeah, I see that you're prompt, you're literally

telling it to fire up the browser and to do stuff. So this is this is mostly actionable. I suppose

exactly. And we're going to go back to this. As you can see in the terminal here, you can see a

little star. That means this is stopped. So if we go back here, you can see that it's already

asking us a question here. So we can ask it to go on. There's a little thing there that says 23

percent. Is that your tokens? So that's my context. It's a little bit it's different than tokens.

So that context limit is how much it can keep in memory before the model degrades in performance

quite a bit. Oh, and then you have to open up a new chat. No. So it actually auto compacts. So

by the time it hits about 80 percent of that context, it'll ask you, Hey, do you want to

compact this chat, which essentially means it'll summarize the most important parts of the chat

into like a markdown file and leave that mark down file in a temporary space. And then it will

continue on and reference that markdown file as like a memory. Okay. Yeah. Because I so in the

YouTube video, I watched somebody was doing this and they were using cloud code plugin for VS code.

And they had switched. Like I said, we're still in the same project. I might be describing this

correctly. They were still in the same project, but they were going and opening like new chats.

And one of the reasons when they did that at one point, they mentioned that it was because it

would degrade like you're saying. So is this better? Like does the VS code not do that? Or is

that a new thing that cloud code will compact and summarize and then start a new chat?

So if someone's using the cloud code in VS code extension is pretty much exactly this.

When it first came out, it was a little bit behind the CLI and I never really adapted it as much

as I should have probably, but the reality is is that right now it's probably cut up like 95%

of the way. So there's just I think there's a couple of things that it still can't do the same way

that the CLI can do it. But it's essentially the same thing as you're seeing here. Just in your

interface rather than the CLI. Okay. So we can see here that it's finished something. So we can we

can start to like we can hear. I'll I'll actually open up my browser in another window

and see if I can get anywhere with this. Oops. That is. Oh, I see. So you have to actually write run

npm dev. Okay. So it's telling us what to do. You can either do it or you can actually just allow

it like to do it itself, which I will do right now. Run it yourself and open it in a browser or

you don't actually even need to follow the commands that it gives you. The web dev podcast you've

been missing. Man. So just for the listener out there, it's made a website. We're looking at it.

It's got like a big hero section, multiple buttons, some badges and they're 458 episodes

and counting it says. And then it has a course on content boxes and other things so we can see.

And it'll mention stuff as interviews or whatever. It definitely messed up the interviews.

Yes. You can see that it has the web news, which I think that's accurate when clients

ignore it. Yep. That's a act. But then if for some reason, it there's them all all the other

ones interviews, which is not unless you and I are interviewing each other each week. Yeah. I mean,

it could it could assume that. I mean, what do you think? Like what other than the small nitpick

there? What do you think about like the minimal amount of prompting that we did to what we got?

At least the click. I mean, it's it's pretty impressive. I mean, I will say, you know, it's got a

Spotify button, Apple podcast button. It's got some other badges there to say. These buttons should

work. Yeah. The buttons work. Yeah. You clicked on Apple podcasts. If you click on browse

episode, it has an anchor that takes you down the page. Like it's interesting. It's something that

you know, would take one of us a day. Maybe something, I don't know, to like lay out the HTML and CSS.

Obviously, you mentioned there was no planning phase with code with cloud itself. There was

obviously no planning phases in a lot of a website is, you know, especially within a web ad agency

is going back and forth with the client on designs and what they need and all those other things

depending on their project. Of course, do those other buttons work? Can you see the, I see there's

a nav bar that articles there. It looks like everything just kind of anchors you down and it looks

like it did not build out each individual page, but we can do that. I will ask it to build out each

individual. Wait, so what do you mean by that? So if you click, oh, I see. So this has become like a hub

and then it links out to I see. Yeah. So the first step of this was just building out the

cover page, which I mean, it is a reasonable step. And I'm kind of glad it did it that way,

because again, it would have, it would have taken a forever if it did build out each individual

page. What I like about this is it did grab our logo. It didn't quite nail the colors. Like,

I would say the colors are colors at all. They're incorrect. Yeah, they're incorrect. And I did

ask it to use the colors based on the logo. So it wasn't able to do that. Something we can

reprompt so that that's something. What's that? Listen, now button do in the top bar because you

have like other buttons for spotifying that. What is listen now? I think it's the same as the podcast

button. Oh, latest episode looks like the same button. Oh, hang on. Can you scroll down?

So there's latest episodes. Is there like latest articles as well? Oh, no, there is. Yeah,

there is. Well, I was wondering if if that listen now was actually like a like the latest.

Like, I see some websites will have like that page that says like the latest and I'll have the

latest like five episodes, the latest five blog posts, the latest five newsletter. So okay,

well, like I mean, I don't know if it's going to take a while. Why don't we try a correction

live? Why don't we try to get it to fix the colors? That's something that's like very visual,

very obvious. That's that's incorrect and shouldn't take you know, a long prompt. I don't think

right? It's like, what would you write here? Are you writing something? Sorry. Yeah, I'm writing

right now. So I'm writing. Can you fix the colors to be more represented of the colors in

our logo? And I think that should be enough personally. That's what I would say to a designer.

I would I have a question here. So you you can kind of have like that. I always hear

the the official name of it. Let's kind of like the design the brand guidelines, the one sheet,

if you will, where some brands will have that where it has, you know, these are our primary,

secondary, tertiary color, our font sizes, our font families, you know, the list goes on and on.

I've done that for people in the past on like a web page. So I assume could you give it that

file? We don't have that. But if we if we did, that would give it more context. So I recently

created a presentation using this using cloud code. And I literally fed it the cipher and design

guide. And it created a like a perfect designed presentation based on the cipher design guide.

So it looks like it finished. We can probably go and check that out. It's done. And I would say,

where is it? It's definitely better. And where is this living? Is the question? Like, is this

something that's just in a file? Yeah, local host. Okay. It's in local host. It's living. It's living in

that folder that I created for it. That demo one. Yeah. That demo one folder. It's a svelte kit

application. So there's a bunch of code that you can open up and curse or whatever and you can

change yourself. So it's it's living. I wasn't sure if that was in the cloud or not. Because you

can do cloud. I remember when I was kind of like poking around Claude or codex. Man, there's two

many services when I was poking around codex. It there was some stuff where it was like, hey,

you can keep working on this on the go or on the web or whatever. If you like allow like some sort

of cloud or web project. Yes. That's correct. So like we can see, I just hopped into my other one

that's still doing the plan for for the automation tasks that I wanted to create. Oh, it's still

a wow. It's still doing well to be fair. It had a question that I had to answer like it. It was

asking for questions about something. So let me let me ask you a question here. What is the point

of plan mode in terms of in terms of okay. So the bot is doing all of our stuff. It's not like

I'm asking Claude, you know, make a plan and then it gives me a PDF and then me the human goes

and does the plan. So what is the point of plan mode? Is it not running? So for example, if we have

we had used plan mode for the website, would it not build the website? It would let you know I'm

going to build it. I need NPM. I need it would like say all that stuff. It's a more generic thing.

So I can we're running a little bit at a time here, but I can do a quick plan mode here probably.

Can you land out a page a page for the blog and podcasts? Right. So so that we aren't just linking it

to the to the current site. So now you you type that out. It yeah, because you said plan

it's plan mode or did you have to do something like hotkey on it? Yes, it's a very good question.

A shift tab activates plan mode. You can see plan mode is on down here.

And so I did shift tab. I forgot to say that. That's my bad. But having said that, if you do just

say can you plan out, it should activate plan mode on. So that isn't it's not a requirement to do

the shift tab. Okay. Just like it. When you ask it to plan, it does plan. So we can see that this

is asking me something. So let's see what it's asking. Yeah, okay. Just one time. So it's asking me

a bunch of questions. The first question was do you want to check for API or just chat GPT plus?

I said chat GPT plus. The second question there was do you want reusability like you want to be

able to create a script that can do this multiple times? I said no. And here it's asking me what language

to do it in. And I'm going to say type script because that's the language that I usually like.

So it's just going to write a script. That's what it's trying to do. It can continue. So that

that was that's part of plan mode plan mode. We'll ask you questions that it comes up with.

If you don't do plan mode, it has a very small chance of actually asking you those questions.

So loosely, I mean, we get like content from clients when we like that would be plan mode.

So those initial meetings of like what design, what color, what's your brand guidelines? Do you have

that? What's your slogan? Are you going to have a blog? Are you going to have any commerce store?

That's kind of what Claude is doing. That is exactly that is exactly it. But you'll you'll get the

plan. So the thing that we haven't gotten to yet that I wish we did is the actual plan. Right?

It's still asking us for permissions here. Again, it's a little bit annoying with permission

asking, but I'd rather have it asked than just do. Well, especially when it's driven through tokens.

Well, I mean, again, the tokens are too, too bad right now.

Well, see that so this is something and maybe this would be a more

not for this web news. But I've been seeing people that use Claude code or other things. And

they're like, yeah, I spent $200 today on tokens. And I was thinking to myself, I think I maybe

even posted this on X is it's like, people were complaining about the $200 a month.

Chatchy BT plan that came out about a year ago, I think. Now people are spending $200 a day and it's

sort of like, okay, but the $200 a month is not. It's just a little interesting. Now I understand

that this is quote unquote, replacing or augmenting a software engineer. And so maybe there's a

certain amount of value you can affiliate with that. But for the like the indie hacker, I mean,

I don't anticipate like I have friends who worry about $5. I don't anticipate them being like,

I have a great app ID. I'm going to open up Claude code. I'm going to get a $200 a month plan on

them. I'm going to start coding with Claude code and then, you know, paying $200 a day while I

build an app for a week. Oh, the app didn't do well. You know, it's so the indie hackers kind of

still left behind a little bit here. I agree with you. Like the indie hacker that's cost conscious

is definitely left behind because these things are only going up in price, right? Like there are some

cheaper alternatives, but they're also worse. And worse right now is actually a pretty significant

thing. Like only recently did the top tier models become so good that we're really relying on them

to do most of the code. That's very recent like last three months thing experience. So if you can

talking like four months ago, like yeah, we have we have cheaper models that maybe four months ago

would have been top tier quote unquote that you can do in a fairly economic way. And you can still

get a lot done with them, but it's not going to be this like the top top level. And that does

matter quite a bit. So currently, yes, price conscious people are priced out. That will change,

in my opinion, because we're seeing the cheaper models catch up fairly quickly. So I think in three

to six months time, the cheapest model is going to be almost at this level that we're seeing right now.

And therefore will be a very viable solution to for everyone, right? Where you could get get away

with like a $10 subscription actually built something quite good, but we're not there quite yet.

There are again, there are some cheaper models. If you're an engineer and you can play around with

the cheaper models, you're you can still get a lot for with them, but it's not it's not going to be

like as hands off, I should say, it's going to be a little bit more like reviewing a little bit more

reviewing is going to be required a little bit more back and forth with the plan. So right now,

we can see the plan is actually complete. There's a multiple sections to this plan. There's a summary

add four new routes podcast podcast slug blog blog slug. So it's giving you a summary of what it

wants to add. It's going to extract reusable cart components, expand data files. I guess it's

going to extract and scrape more content, which makes sense. It's going to update navigation.

It's going to, you know, this is going to generate a bunch of new files. It tells you which files

it wants to generate. It's going to create new components tells you which components is going to

generate. It tells you some of the data that it's going to select, right? The navigation changes

are going to be slightly slightly different. The navigation is going to be slightly different. The

footage is going to be slightly different. And there's going to be new homepage sections. So

there's a bunch of things and then it talks about design as well. Like what's the podcast design

going to be? And this is where you can again, this is the where you can go back and forth with it,

right? Like if you don't like this idea of all, you know, all, all episodes, search, input,

pill style, like you can tell it to do something different. At this point, before it's implemented

anything, saving you some money. Like this, the plan mode is going to cost a lot less than a full

implementation. Right. So you can, you can have a better back and forth without actually spending

as many tokens. And that's a really keeping here. And then the other really, really big advantage

in the main advantage is the plan mode, this plan will be fed into the model and it'll have a much

more structured to-do list of what it needs to do. So look, but I have to go soon in a couple minutes

actually. But if I click on, yes, an auto accept edits or clear context and auto accept edits,

it's going to start ripping through it. And you're going to see it's actually like doing a to-do list

of things that it got from the plan. And it's going to go step by step through the to-do list

and actually be much more like structured in what it changes. So usually you're going to get

much better output out of these models. If they have a plan and they are able to execute that

plan with a to-do list versus a model that's just going to eat it. And I mean, we saw, like,

this isn't bad. I want to be clear. Like the, the site that it generated without a plan is

probably fine. Like, it's not the greatest website in the world. It's not revolutionary,

but it's fine. Like, it's okay. It definitely looks like an AI-made website, but it is modern.

I mean, it is definitely modern. It looks better when we have because ours is dated.

Yeah. Like, sure. But I guarantee you, if we spent like 20, 30 minutes back and forth going

through it during the plan mode and getting exactly what we wanted out of it, it would be 10 times

better. Almost a guarantee. Do you have a rough estimate as how much that would cost?

It wouldn't, it wouldn't be like maybe a dollar more or something like that. It wouldn't

actually be that much because during the plan mode, it's not that expensive. But I mean,

like the whole project, if you did the plan mode, like, roughly, if you did the plan mode,

if you did the development, and then we were going to deploy this, well, you could easily be covered

for us. It would be easily covered under the $200 subscription. It might even be enough to do

like the $20 subscription on Codex. Okay. Yeah. It wouldn't be that much for this specific project.

It would not be that much. There are some projects that could be much more complex that would need more.

But for the most part, I haven't run into any issues using the Codex subscription. I use that

personally for a lot of things. And the Cloud Code $200 subscription is much more generous than

there. Again, their enterprise level stuff. So it is, it is going to be enough. But yeah, you can see

here, this is this is the to do list that I was talking about. This is really key to these models.

Once it has a to do list, it's much better at keeping on track and following instructions,

obviously, because it can just check these off. And you'll see it as it goes. Once it finishes

something, it'll just like check these off, which is really cool to follow. And it can work on

multiple at once, as long as it knows that there's no dependency. So internally, this to do list

looks like a kind of a to do list with dependencies, right? Like it knows that you can't verify and

spell check until all of this is done. But maybe a couple of these you can run in parallel and

is doing that. Yeah, like maybe spinning up the blog pages and the podcast pages because they're

different design. Yep. That's exactly it. Interesting. Yeah. We're kind of out of time on this part as

well. But suffice to say, like, yeah, we're going to get to a plan. It's going to tell us how it

wants to execute. And then it'll open up the browser similar to how you're seeing it in here.

And it'll ask it'll ask me to sign in. And then it'll be able to take control of that browser.

Impressive and concerning. Yeah. Because why am I here? Yeah. Well, you're here to control it for

now. For now. Yeah.

I don't know exactly. But yeah, hopefully, again, let's go back to the site. Hopefully, this was

information. At least you got to see some of the moving parts. You got to understand it. And I

think we're going to have more of these kinds of episodes where we go through stuff like this

together. Because again, a lot, I think a lot of people haven't touched this stuff quite yet.

You know, even if it is in the zeitgeist and it'll be important for people to see it in action

to understand what it can and can't do as we move forward. And hopefully, like, again, it can do a

lot. It can't do everything. It can do a lot. And we need to be aware of that. Because if you're

kind of putting your head in the sand out there, if you're not using it as much, it could bite you

in the ass. If you're not, if someone else is using it and is able to do stuff faster than you.

Right. I'm not talking directly at you, Matt. I'm just talking directly at like the audience

looking for jobs and stuff like that. It's not really something you can ignore anymore.

Yeah. Like, this is definitely before we used to advertise like a page builder to be quick. And

that was like very quick. Like, that's even faster than a page builder. Yeah. Exactly. And you

speak to it in English, right? Like, you speak to it like a regular, you would speak to a designer.

That's the difference here, too, is like with a page builder, you have to learn the interface.

Here is there's no learning really. There's some like adapting to why? Like, there's some

adapting to the LMS and how to talk to them properly, maybe. But even that's kind of going away.

There's less and less needed as they get better. It becomes more and more natural of how you speak

to them. But unfortunately, I've got to go. Maybe we'll post what we get here as like a screenshot

somewhere on our on our socials. So look out for that. And that's up to you because you're you're

I'm just watching. Correct. Yeah. For sure. All all all posted. But yeah, hopefully this was at

least informational for you, Matt. We didn't do anything crazy, but we kind of scratched the surface

of this a little bit. Well, no, it's good. It's good context because I actually understand the

context of our website. So it's a good place to start for sure. Yeah. That's what I was thinking too.

It would have been cooler with the the API key, I think, who would have been a little bit more

demonstrable, but either way. Yeah. I mean, it worked good even without leaving with that hurdle.

But I think that's it. It can. It can. It can. Sorry. All good. But that's it.

I think that's it. That's it for the web news. Thanks for hanging out. If you liked it. If you

liked this episode, slash if you liked this is sort of a new video for us. Please let us know in

the comment section on various places. If you're on the audio version, there's video version should

be on Spotify, but definitely on YouTube. And we are signing off. Goodbye.

More from HTML All The Things - Web Development, Web Design, Small Business

View all episodes →

How Engineers Stand Out in 2026 (Skills That Actually Matter Now)

HTML All The Things - Web Development, Web Design, Small Business

Apr 14, 202657:47failed

Web News: Is Anthropic’s Mythos Too Dangerous to Release?