Sitemap

Member-only story

Windsurf Model Benchmark: Which AI Builds a Vue TODO App Fastest?

6 min readJun 7, 2025

--

I asked Claude, GPT-4.1, Gemini, SWE-1, and more to build the same Vue + Pinia TODO app in Windsurf. The results? Most models used outdated patterns, struggled with folder creation, and bloated global CSS — despite producing working apps

Windsurf offers many models, and it’s sometimes hard to choose what is the best. Today I’ll show you results of a test, when I ask these models to create a simple Vue application with Pinia store showing TODO list, and with ability to add new items and delete them.

In the test I measured time from hitting enter until starting the server. So it includes time to accept console commands, installing dependencies, and starting the server. While these are mostly same, they didn’t change comparable results in this set. I didn’t exclude these to show realistic time spend on creating the app from user perspective.

The prompt isn’t complicated:

In folder xxx create a Vue app with
Pinia that shows a TODO list. User should
be able to add items, and delete them

Claude 3.7

The model crashed at this command:

--

--

Tom Smykowski
Tom Smykowski

Written by Tom Smykowski

I help startups ship stunning, scalable MVPs—fast. With deep frontend expertise and AI-powered development workflows, I build and audit. Programmer

No responses yet