Peter van Onselen

The Boring Path to Actually Shipping with AI

2025-10-31T08:00:00+00:00

Or: How I Learned to Stop Vibing and Love the Spec”

OMG. This spec driven development process is BORING!

Okay okay, for reals though, following this process of using a spec and a clear breakdown of tasks is tangibly yielding results and making remarkable progress forward in the game.

In the past week I have:

Created 3 new creatures: one that moves fast, hits hard and stuns enemies; another that spawns minions and multiplies when it dies; and one that shoots a bolt that blows things up and destroys land all over the place
Around 9 new abilities created and working
Got the AI to hack some terrible models together so they would be unique enough to be playable
Made the islands generate more interestingly
Completed a horde of UI cleanups
Handled some general refactorings and got a bunch of systems working
Total changes: 52 files modified, +3,718 lines, -474 lines across 34 commits all for about 10 hours effort.

By basically all metrics… productive?

So Where Did the “Boring” Comment Come From?

It comes down to what following the spec driven development process has actually become. Now that I’m being militant about making AI follow a todo list, what I’ve functionally done is put on multiple hats:

Product Manager hat: Created a complete game design doc. High-level, aspirational, covering combat systems, creatures, abilities, victory conditions — the whole vision thing.

Delivery/Feature Lead hat: Took one section (the combat system) and broke it down into actual features. Not just “build combat” but “what does combat need? Movement? Attacks? Status effects? Death?” The unglamorous work of turning vibes into verbs.

3 Amigos hat: Turned those features into a massive todo task list. Every checkbox a micro-commitment. “Add blink ability.” “Implement stun on hit.” “Make multiplying enemy spawn minions.” The kind of granular breakdown that makes you feel like you’re doing corporate sprint planning for your hobby project.

Engineer hat: Actioning the tasks one at a time. No wandering off to make prettier models. No “oh but what if the islands had weather systems?” Just: checkbox, code, commit, next checkbox.

QA hat: Testing behavior. Does the stun actually stun? Does the explosion destroy terrain properly? Do the spawned minions inherit the right stats? The tedious-but-essential validation loop.

The realization: I’ve become …. an entire agile team.

And what that practically means is that I’ve made gamedev into work. My day job. God damn it.

Spent a lifetime developing habits on how to do engineering, and then you get a newfangled tool and you just… follow the process. Good job me. Yay! Right? …Right?!

The Tangents I Didn’t Follow (And Why That Hurts a Little)

I’ll be honest: getting lost in tangents and running away with the vibes is a whole hell of a lot of fun.

In past weeks, I would have absolutely gone off on any of these:

Visual polish: Using all the gorgeous tiles from Kenney’s asset packs to make everything look beautiful instead of just functional. Making each creature feel distinct and characterful instead of “placeholder cube with stats.”

Procedural generation rabbit hole: Diving deeper into Wave Function Collapse algorithms to generate more dynamic, interesting terrain. Making islands that feel hand-crafted even though they’re algorithmic.

Creature personality: Actually modeling unique designs for each bot. Giving them visual identity, animations, character beyond their mechanical function.

Worldbuilding: Fleshing out the lore of the different factions. Their motivations, their aesthetics, their place in this weird sky-island world I’m building.

These are the fun parts. The parts where you lose track of time because you’re following curiosity instead of a checklist. The parts that make gamedev feel like play instead of work.

But here’s the thing: none of them get me closer to a playable game.

They’re all polish on a foundation that doesn’t exist yet. They’re the dessert when I haven’t finished the vegetables. So the spec says: not now. Stay focused. Ship the MVP first.

It’s the right call. I know it’s the right call. Oh heavens please let this be the right call …

And I hate how boring the right call is.

The Discipline vs. Fun Paradox

Being strictly disciplined with myself about how to dev with this tool is super productive. Lots of forward momentum in an actual direction is really fantastic.

And also… boring.

There’s something deeply satisfying about seeing the commit graph fill up. About checking off todo items. About watching the line count grow in a structured, intentional way. It feels professional. It feels like I’m actually building something instead of just playing around.

But it’s missing that chaotic energy that made the early weeks of this project so intoxicating. The “what if I just try this wild thing?” moments. The tangents that turned into features I didn’t know I needed.

The spec process works. It’s just not romantic.

Where I’m Headed

My plan is to stay disciplined until I hit what I’m calling an “exit point” — a milestone where the game is functioning just enough to validate the gameplay and experience. Right now, that means:

2 unique decks that feel different to play
Basic strategy cards that offer meaningful choices
Fog of war (because exploration matters in a tactics game)
A simple victory condition

It won’t be done. But it will be playable. And testable. A real artifact I can put in front of someone and ask: “Is this fun?”

Following this process is giving me something I’ve never had before in side projects: predictable, consistent progress.

Not explosive bursts of inspiration followed by month-long abandonments. Not chasing vibes until I hit a wall and lose interest. Not 10,000-line notebooks that collapse under their own weight.

Actual, measurable forward motion toward a concrete goal.

The Bottom Line

Is it boring? Yes.

Is it working? Also yes.

And maybe that’s the trade I need to make right now. There’s a time for tangents and vibes — I spent weeks in that mode and learned a ton. But there’s also a time to put your head down, follow the checklist, and actually finish something.

The irony isn’t lost on me: I spent four months learning how to use AI as a collaborator, only to discover that the real unlock was bringing back all the boring engineering discipline I use at my day job.

Turns out “vibe coding” still requires structure. Who knew?

Next week: More abilities. More discipline. More checkboxes. And hopefully, one step closer to knowing if this game is worth making at all.

P.S. If you’re following along with this devlog and thinking “wow, this sounds like he’s sucked all the joy out of his hobby” — yeah, a little bit. But also: I’m actually building something now instead of just dreaming about it. So maybe boring is the price of shipping.

We’ll see how I feel when I hit that exit point.

AI Spec Driven Development

2025-10-30T08:00:00+00:00

A brief summary of what I have learnt

This is an exert of From AI Skeptic to Constant Collaborator: What I Learned Vibe Coding.

Practical Workflows

Through trial and error, I developed specific patterns to manage AI’s weaknesses:

1. The Planning Folder Pattern Keep numbered specs (1-initial-feature.md, 2-pay-by-discard.md, etc.) that document feature discussions. These become persistent context across sessions.

2. The Todo Accountability System Break specs into granular checkbox lists. Use them to hold the AI accountable during implementation.

3. The Git Save-Scumming Strategy Commit frequently. AI will overwrite working solutions without memory of what worked before.

4. The Role-Based AI Selection

ChatGPT: Brainstorming, exploration, asking “what’s wrong with this design?”
Claude: Implementation, code review, pair programming
Copilot/Codex: Ticket-style work where you hand off and come back later

5. The Discipline Override Set hard rules to counter AI’s momentum:

Force refactor cycles
Write tests even when AI makes it feel unnecessary
Question every tangent: “Is this the MVP?”

Minimum Viable Prompt Literacy

I have no idea what a “perfect prompt” looks like. But I know one rule that consistently works:

No matter what you ask the AI to make, the last sentence should be: “Ask me questions.”

Get the AI to ask you questions. Ask it “what am I missing?” type questions. This back-and-forth is where the real value emerges, not in the first response, but in the dialogue.

I Actually Stayed On Task (For Once): A Dev Miracle

2025-10-24T08:00:00+00:00

Breaking news: Developer completes planned features … who would have thought?

Remember all those high-minded ideas I had about staying on task with AI-assisted development? All that big game I’ve been talking about “this is how you do dev with an AI and keep it on task” and “this is the way you got to do it”?

Yeah, about that.

If you’ve been reading the wonderful adventures of the tangent king over here, I know what you’re thinking. He can’t do that.

For context: I’m building Horizon’s Edge: a turn-based tactical wargame inspired by NetStorm where floating islands battle for control of the skies. I’ve been developing it with AI assistance in Godot, and staying focused has been… a journey. Previous highlights include: diving into Wave Function Collapse procedural generation instead of finishing game rules, and a grand refactor that ate five entire evenings because I accidentally committed the .godot cache folder.

Well, I will have you know… that this week I set out to get a bunch of combat systems up and running. Working tip top. No funny business with wave function algorithms or hexes or anything! And I managed, for the first time in almost 4 months, to stay on task and not meander (too much) and actually get some things completed.

I now have 3 creatures, each with at least 1 or more special abilities and those abilities actually work. I know! I am surprised too. I’m meandering my way through my todo list and this week I have a test mage with 5 abilities actually working:

Blink - Teleportation ability
Ethereal Phase - Phasing ability
Arcane Missile - Ranged magic projectile
Divination - Detection/reveal ability
Mana Drain - Resource manipulation ability

I might have wandered off and made one of the spells do terrain destruction too. Arcane Missile is the first ability in the game that actually destroys terrain rather than just building it. I just love the idea of the battlefield being dynamic and changing under the players’ feet. Right now it wipes out the hex the target was standing on, plus a 50% chance on each of the 6 surrounding hexes. The idea is that if a creature is no longer standing on anything, even if it has a lot of health, it will plummet to its death. It should create tension around expansion and positioning. Do you risk getting close for that attack if one wrong spell could drop you into the void?

I also figured out how to get rounds and turns working together (this was needed for Ethereal Phase). And I might have spent some silly time hacking in a few animations to make the effects a bit more obvious when they happen.

Admittedly none of this is “let other people play with it” yet. And it doesn’t yet have a win condition. But it is meandering in a direction.

See? Like I told you. Todo list! Let’s see if I can do it again?

Next up on the todo list: Three more creature archetypes, each with their own special abilities.

test_voltage_bot with Overcharge and Turbo Boost abilities.
test_biomass_spawn with Regenerate and Spawn Swarmlings.
test_flux_walker with Chaos Bolt and Lucky Strike.

…assuming I don’t get distracted tweaking that wave function collapse algorithm again.

Incidentally, the stats for the week:

32 files changed
3,029 additions
508 deletions
Net addition of ~2,500 lines of code

For an hour and a bit each night, the progress here is just down right fierce!

From AI Skeptic to Constant Collaborator: What I Learned Vibe Coding

2025-10-20T08:00:00+00:00

The Question That Started Everything… am I going to lose my job?

The Question That Started Everything

How does one actually vibe code? And the follow-on questions that kept me up at night: Is it any good? Can I actually generate real code with this? Is this going to take my job? Am I running out of career runway?

Around the start of June this year, I was an AI optimist while not really engaging with it. I used ChatGPT, DeepSeek, and Anthropic’s Claude, ran some thinking through them, maybe did basic searches. Honestly, I wasn’t using it in any meaningful way. Functionally, I was just playing with it. Nothing more.

Then I got stuck on a problem. By simply following my curiosity, I went from not having a clear idea of what I wanted to accomplish with AI to actively using it as a constant collaborator across multiple domains of my life. My approach to AI has fundamentally changed over the past four months, and it continues to evolve.

The Catalyst: Magic the Gathering (Naturally)

TLDR: I tried to make a Jumpstart cube. ChatGPT couldn’t solve it. Co-pilot couldn’t solve it. Co-pilot vibe coded a solution that kinda worked. I vibe coded a new solution that actually worked.

I couldn’t get the damn thing out of my mind, so I vibe coded this portfolio site to document what happened. Then, while working on the cube, a board game idea struck, and I couldn’t get it out any other way except by building it with AI. That board game has somehow morphed into a video game that’s far too complicated for the “get an MVP into prod fast” approach I keep trying to follow.

The time investment: 1-2 hours a day, either in the morning before work or while watching TV with my wife in the evening. This became an all-consuming obsession for four months. I sacrificed learning urban sketching, which I’d spent the first half of the year actively pursuing.

The cost: I’m paying £16/month for Claude Pro and £20/month for ChatGPT Pro. I use Claude as my primary coding assistant and switch between ChatGPT and Claude for thinking through problems. It’s worth it, without AI, none of these projects would exist.

Pre-AI, my side projects were timeboxed to a couple of days and small, achievable problems. Anything more would rapidly collapse under its own weight, too much code, too little time. Basically I didn’t do tech side projects. AI changed that equation entirely.

What I’ve Learned: The Core Insights

AI is a Tool and a Multiplier

You have to treat AI not as a magic box that will automatically solve whatever you hope it does, but rather as another person you’re working with over Slack. If you tell a coworker “make me a feature!” you can’t be upset when they return junk.

The best way to use this tool is to assume it doesn’t actually know what you want. I’ve found the most effective approach is to start conversations with lots of negative validation questions: What am I missing? What could be improved? Be critical. Be objective. Get the AI to shoot holes through your ideas.

Once you’ve had this conversation, write that plan to file. Congrats, you now have a high-level plan. This becomes useful context for future chats. However, this alone won’t give you consistent, reasonable, progressive progress. Because basically, AIs like to write code, and they write an awful lot of it.

So get it to make a todo list with a painful amount of tick boxes.

When building, use that todo document to hold the AI accountable. It makes testing and building more predictable and manageable.

The “Junior Engineer” Mental Model Goes Deeper Than You Think

Treating AI like a junior engineer isn’t just about tone, it’s about workflow. Through my projects, I discovered I needed to:

Use different AIs for different roles: ChatGPT for exploration and brainstorming, Claude for implementation and code review
Create specs as “shared memory”: Documentation that gets committed to the repo so the AI can reference it across sessions
Break work into granular todos: Not just for you, for holding the AI accountable to what actually matters
Pair with it through code review: Not just generation

This evolved from my Magic cube project where I had specs numbered 1 through 15, each documenting a feature discussion. These weren’t outputs, they were context that survived beyond individual chat sessions.

The Dangerous Patterns: What They Don’t Tell You

The Refactor Paradox

Here’s something crucial I learned the hard way: AI accelerates the “green” phase so much that you skip “refactor,” leading to massive technical debt.

During my Magic cube project, I went from manually patching decks to having a 10,000-line IPython notebook that was completely impossible to understand. I had hit cognitive overload.

As an ardent TDD advocate in my day job, I realized I was missing two critical pieces of the red-green-refactor cycle: I was just writing code. No tests. No cleanups. Rookie mistake.

I had to start from scratch, consciously embracing a build-and-refactor loop, following the code smell patterns that years of clean code practices had drilled into me. AI doesn’t just multiply your output, it multiplies your technical debt if you’re not careful.

The game dev project repeated this pattern. I’d run git ls-files | grep '\.gd$' | xargs wc -l and see files well over 2k lines. I’d missed refactor cycles again.

The Tangent Amplification Problem

AI doesn’t just enable scope creep, it actively encourages it by making every side quest feel achievable.

My board game project is the perfect example. I set out one week to work on creature combat. By the end of the week, I had:

Created test creatures
Built movement and attack systems
Added height-based defense
Implemented dice roll combat
Created a radial menu for unit actions
Downloaded 3D models from Kenney.nl
Rebuilt roads with proper models and rotations
Added a blink ability

One feature became an ecosystem. And here’s the thing: it is beyond exceedingly simple to wander off on completely unrelated tangents when the AI makes everything feel possible.

I kept telling myself “get an MVP to prod fast” while simultaneously building procedural island generation with Wave Function Collapse algorithms.

But here’s why I didn’t stop: AI provides so much momentum that even when it’s frustrating, you can think about the problem slightly differently and feel like you’re making progress. When direct AI generation hits a wall, I switch to having it build the broad structure while I manually tweak settings. This makes working on side projects genuinely fun in a way they haven’t been before.

The momentum AI provides is a double-edged sword. It keeps you engaged through the frustration, but it also keeps you building when you should be stepping back and asking “is this the right thing?”

The Direction Problem: AI’s Spatial Blindness

Some tasks reveal AI’s sharp limitations. During my road system implementation, I discovered that AI spatial reasoning is terrible.

When work involves orientation, rotation, or physical space, the AI’s sense of direction doesn’t match how the world is rendered. Trying to explain rotations in a way that makes sense to both of us is like teaching a goldfish to drive.

And if the AI ever accidentally gets something right, it will immediately overwrite it in the next change.

My eventual workflow:

Get the AI to build the big stuff, toggles, switches, base structure
Manually go through and tweak everything myself

This led me to develop what I call “Git save-scumming”, treating Git like a video game save system because AI will thoughtlessly overwrite correct solutions without remembering what worked.

The Momentum Trap

AI gives me the same benefit I get from using Audible for reading books: momentum. It’s a lot easier to keep working on a side project with AI than without, especially when you have no time at all to do the work.

But here’s the tension my blog posts reveal: momentum without direction leads nowhere useful.

I built 24,000 lines of Python code for a board game that probably should have been paper prototyped first. I kept reminding myself to “get an MVP to prod fast and learn lessons,” but the AI made it so easy to keep building that I kept following the fun instead of following the plan.

The momentum is addictive even when it’s pulling you away from your goal. You have to be disciplined about direction, or you’ll end up with beautiful code for the wrong thing.

The Transformation: Four Months, Everything Changed

The most remarkable thing about this journey isn’t what I built, it’s the speed of transformation.

June 2025: AI optimist, barely engaging October 2025: 24k lines of game code, active collaborator across multiple projects, writing blog posts documenting the journey in real-time

Getting to the “treat AI like a junior engineer” mindset took about 2-3 weeks. I had to unlearn the “AI is magic” assumption and figure out how to actually use it.

This wasn’t gradual learning, it was catalytic. Each success made the next leap feel possible:

Magic cube problem → vibe coding solution
Couldn’t stop thinking about it → portfolio site
One blog post → entire blog series
Board game idea → 24k lines of video game code
Video game reimplementation → another 24k lines of video game code

The cascading confidence is real. Once you see AI help you solve one “impossible” problem, you start seeing possibilities everywhere.

Bringing It Back to the Day Job

The spec-driven workflow I developed through these side projects has now become how I work professionally. I take tickets and reframe them into specs with task breakdowns. I use AI to analyze complex codebases I’m barely familiar with.

Right now I’m refactoring a monolith written in Go into a commons library with five microservices, using the AI spec-driven workflow with AI-assisted code development, working in small increments. Everything I’m doing, I learned from these side projects.

The irony: The Economist (where I work) has embraced AI tooling internally, while engineers in general remain reticent. I get it, I was there four months ago.

But here’s what changed for me: I still do traditional hand-crafted coding in my day job. I regularly work through code katas, which are fun and enjoyable in and of themselves. AI hasn’t replaced my coding skills, it’s multiplied what I can accomplish when I need to move fast or explore unfamiliar territory.

Practical Workflows That Emerged

Through trial and error, I developed specific patterns to manage AI’s weaknesses:

1. The Planning Folder Pattern Keep numbered specs (1-initial-feature.md, 2-pay-by-discard.md, etc.) that document feature discussions. These become persistent context across sessions.

2. The Todo Accountability System Break specs into granular checkbox lists. Use them to hold the AI accountable during implementation.

3. The Git Save-Scumming Strategy Commit frequently. AI will overwrite working solutions without memory of what worked before.

4. The Role-Based AI Selection

ChatGPT: Brainstorming, exploration, asking “what’s wrong with this design?”
Claude: Implementation, code review, pair programming
Copilot/Codex: Ticket-style work where you hand off and come back later

5. The Discipline Override Set hard rules to counter AI’s momentum:

Force refactor cycles
Write tests even when AI makes it feel unnecessary
Question every tangent: “Is this the MVP?”

What About the Code Quality?

Let’s be honest: the code AI generates can be good, can be overly verbose, tends toward duplication. But it can be nudged in the right direction quite easily.

The game currently works. It’s not feature complete, not even a pared-down, super-trimmed version. But it’s playable, testable, and iterating forward.

That’s the trade-off: you get speed and momentum in exchange for code that needs shepherding. You’re not writing every line, but you’re still responsible for the architecture, the patterns, and the quality.

Minimum Viable Prompt Literacy

I have no idea what a “perfect prompt” looks like. But I know one rule that consistently works:

No matter what you ask the AI to make, the last sentence should be: “Ask me questions.”

Get the AI to ask you questions. Ask it “what am I missing?” type questions. This back-and-forth is where the real value emerges, not in the first response, but in the dialogue.

It took me 2-3 weeks to figure this out, but once I did, everything clicked.

The Bottom Line

AI hasn’t replaced my thinking, it’s changed how I work. The best analogy I’ve found: it’s like pairing with a junior engineer who:

Never gets tired
Has read everything
Has no memory between sessions
Will confidently suggest terrible ideas alongside brilliant ones
Makes everything feel achievable (which is both blessing and curse)

You have to bring the discipline, direction, and judgment. The AI brings speed, exploration, and momentum.

After four months of solo exploration,watching YouTube videos, AI Engineer conference talks, and lots of trial and error,I’m not worried about my career ending. I’m worried about not learning these tools fast enough.

Why This Matters (and Why I’m Writing This)

I’m writing this for two audiences:

Future me: So I can succinctly explain “this is what I learned” when the details fade.

You: To give you an idea of how to approach AI development that’s more than the nebulous “what the hell do I do here” feeling I had in June.

This is an invitation. Not a tutorial, not a manifesto,an invitation to experiment, to treat side projects with these tools as “learn how to AI” projects, and to discover your own patterns through building.

Because here’s what I know now: the developers who learn to work effectively with AI aren’t going to replace the ones who don’t. They’re going to outpace them by an order of magnitude.

The question isn’t “will AI take my job?”

The question is: “Am I learning to multiply my effectiveness, or am I just playing with shiny tools?”

For me, the answer finally became clear somewhere between a Magic the Gathering cube and a procedurally generated sky island wargame.

I’m building the plane while flying it. And documenting the journey as I go.

Because maybe, just maybe, someone else is standing where I was in June, wondering “how does one actually vibe code?”

And maybe this helps them take the first step.

The Road to Combat Is Paved with Tangents: A Devlog

2025-10-17T08:00:00+00:00

I set out to make a combat system. I returned with roads, models, and a blink ability…

Tangents, Roads, and Blinking Archers

okay okay okay, so I know last time I was all:

“I am going to work on the combat system.”

and I meant it. I really did.

I started the week’s development figuring out how to get a second creature working — this time one that could shoot! and it shot! with animation! and a lovely red number floating overhead when it hit.

Then I spent some time trying to convince the AI to make an archer… which came out as a cylinder and a torus mashed together like some long-lost relic from Lord of the Rings.

Somewhere between the floating numbers and cursed geometry, I ran into a movement bug where the creature could only move upwards but never downwards — so, you know, just another normal day working on a game with a super-powered AI as a pair.

And that’s the problem with having a super-powered AI as a pair: it is beyond exceedingly simple to wander off on completely unrelated tangents. Which is, of course, what I did.

The Tangent: Shiny Hexes ✨

When I was working through a game dev course on Jumpstart to 2D Game Development: Godot 4.4+ for Beginners last year, I came across a really cool site: kenney.nl.

They’ve got models, textures, and all sorts of game assets. And wouldn’t you know it, I found a bunch of hex models that perfectly matched the vibe in my head.

And just like that, all thoughts of combat systems and specs and rational planning evaporated.

No, now was obviously the perfect time to get models and textures in. So I immediately downloaded the GLB files and started plugging them into my wave function collapse algorithms.

Instant vibe shift. The world suddenly looked like something. I’m not using all the models yet… but give me time 😄.

The Roads to Madness 🛣️

With proper models in hand, I finally turned to something that’s been quietly tormenting me for weeks: roads.

Now that I had actual path models to work with, I thought this was going to be a cinch.

Nope. No. Oh hell no.

This is the kind of task that is inherently infuriating to get an AI to handle. Its sense of direction doesn’t match the way the world is rendered, and trying to explain rotations in a way that makes sense to both of us is like trying to teach a goldfish to drive.

And if the AI ever accidentally gets something right, it will immediately overwrite it in the next change. Because of course it will.

My eventual workflow:

Get the AI to build the big stuff — toggles, switches, base structure.
Then manually go through and tweak everything myself.

Thank god for Git and my obsession with save scumming my way back to a sensible state.

Meanwhile… Combat System? 🫣

Hang on though. Wasn’t I working on a combat system? Damn it.

Okay, where was I?

Ah yes — my ever-growing planning folder to the rescue.

I’ve developed a habit of getting the AI to output the result of any long feature discussion into a spec or planning file that gets committed into the repo.

Which is how I ended up with… about a dozen specs.

Some were implemented, some were “I realized this was a tangent and saved myself,” and some were very much still alive. After some reorganizing, I was down to a high level planning doc and a solid todo list that was a solid active plan.

➜  horizons-edge git:(main) ✗ ls planning/6-
6-implementation-todo.md
6-test-deck-archetypes-energy-abilities.md

Then came the fun part: getting the AI to validate how accurate my current TODO list was against the plan — and then, obviously, not trusting it at all and just starting to build anyway 😄.

Back on Track: Blink ✨

At last, finally, I was back on combat system track. And the first order of business: give my test creature a blink ability.

Let’s see how far I get next week.

Until then, I hope you’re enjoying whatever tangent is currently distracting you from your side project. I salute you, fellow tangent adventurers. 🫡

📝 Progress Summary Since last time.

🚧 Road System: Added straight roads, corners, intersections, and 3–5-way connections. Complex rotation logic, road-to-hex connectivity.
🧱 3D Asset Library: Integrated Kenney Hexagon Kit (180+ models, textures, documentation).
🏹 Combat & Abilities: Projectile system, Archer & Scout cards, ability system improvements, radial menu, energy planning.
🖱 Input & UI: New input controller with hex highlighting, better radial menu, multiplayer safeguards, grid visualization.
🧭 3D Model Integration: hex_tile_model_config.gd, updated rendering, road path visualization.
🗂 Documentation: New spec for energy payment (spec 7), reorganized planning docs.

Scope Creep Chronicles: Creature Combat Devlog

2025-10-10T11:00:00+00:00

Crouching Creature Combat, hidden tangents…

Combat is a cornerstone of the game’s core loop — it’s a wargame after all. If the creatures can’t attack or move, nothing else really works. So it felt like the perfect place to focus on getting one thing working at a time.

You know that feeling when you promise yourself to build just one simple feature… and then accidentally end up building half a combat system? Yeah. That happened.

I set out to make creature movement and attack work. Just that. And technically… I did.

But also…

So What Happened Since Last Time?

Created a test creature
Added stubs for abilities and creatures
Can play the creature onto the world
Creature can move
Creature can attack (with a cheeky little “nudge” animation!)
Creature is persistent in the world
First version of a combat system is in place
Height contributes to defence
Combat uses dice rolls
Added a UI element showing active units

Somewhere between “just movement and attack” and the inevitable barrage of scope creep that is this list… I may have overshot my initial scope a tiny bit.

The Big Question I Didn’t Think Through

When I first added creature cards, I realised something obvious but important:

“What happens to the card once the creature is played?”

Does it:

🃏 Get discarded like a one-off spell?
🏗️ Stay permanently on the battlefield, RTS-style?
🧍 Act more like an ally in Marvel Champions?
❄️ Or a unique character like in Undaunted?

I wasn’t sure which direction would be more fun, so I had a long chat with the AI to explore the implications of each — and in classic not-thinking-it-through fashion, this has the feeling of being a dramatic question that might shift the whole damn game all over again. Hello scope creep, my old friend. I’m trying to keep you in check this time, I promise — setting clearer boundaries for what belongs in the MVP and what gets kicked down the road for future me to deal with.

The key takeaway: I want players to get cards out fast, keep them on the board, and have those creatures feel unique. So the “remove from deck” idea (like Undaunted) felt too punishing. RTS swarm spam didn’t fit either.

The sweet spot seems to be something like Marvel Champions: unique characters with meaningful impact.

A Fun Rabbit Hole: Rewarding Combat

This led me to thinking: what happens when you win a combat?

That’s when Battle Realms came to mind — an underrated, vibey-as-hell RTS from way back, with samurai, werewolves, vampires, and mystical Eastern weirdness. It used combat to generate resources for upgrades.

I love that idea. It adds stakes and momentum to battles without punishing players for losing units. And in yet another stunning moment of self-realisation, I promptly undid all my changes related to this feature — because it was scope creep of the highest order!

UI Tangents and Side Quests

UI clarity is what really shapes how the game feels to play, so making it intuitive is critical. When the interface gets in the way, the whole experience slows down, so I’ve been extra sensitive about how these elements work.

Of course, I also got distracted by UI.

Playing a card and then having it disappear so you can see where to place it just makes sense. So I ended up tinkering with creature display windows and UX polish far beyond what I planned. I also found myself wrestling with the active unit display — trying (and failing repeatedly) to get Claude and Codex to understand exactly what I wanted changed. It turns out it’s surprisingly difficult to use plain English to precisely specify how a UI element should behave.

But I did get a shiny radial menu for unit actions out of it…

The Actual Goal? Achieved.

At the end of the day, I really did achieve what I set out to do:

✅ Creature movement and attack — working (ish) 🌀 Plus a whole lot of unexpected side questing, because game dev is never a straight line.

Next Steps

Onward to the next rabbit hole.

This sprint was messy but productive — a reminder that even the tangents feed back into making the game feel more alive.

Refine creature persistence and combat flow
Continue working through the massive creature card spec doc — I barely made it through the first two abilities, and I still want to explore resource generation and upgrades from combat wins
Keep iterating without accidentally building a full-blown RTS 😅

TL;DR: I aimed for one thing. I built a small ecosystem. I’m proud, a little tired, and very excited about where this is going. And I’ve learned more about using AI tooling than I ever imagined.

Cards, Chaos and the subtle art of Claude Code

2025-10-03T11:00:00+00:00

Last Time on Madness Boulevard…

Last time on the wonderful adventures of a random walk down the boulevard of madness in making a ~~board~~ video game, I was busy refactoring cards and trying to get them to actually work.

Now I have working CARDS! They discard! They look devy! They click! They discard! The UI is janky, but hey—it works. I’ve got… 3 cards. Three. OMG. Only 3 cards. :mindblown: This is going to be harder than I thought (obviously).

Factions of Madness

When I nudged Claude toward “cards instead of buttons” as a way to unify the gameplay, I immediately got ahead of myself. Naturally, I dreamed up two factions, a bit of lore, some shiny card concepts, and before I knew it… two half-baked 20-card decks plus a half-baked strategy deck. It was going to be over 9000!

The concepts were fun:

Nezumi Swarm: Honorable rat samurai + endless undead swarm tactics. Weak but unstoppable. Multiplying and spawning more and more. For the SWARM!

Aetheric Empire: Victorian British Empire × Girl Genius mad science romance. Cyborgs, mechs, airship supremacy, love and madness.

The Romance. The Swarm. It was glorious.

Overload Incoming

Then reality hit: instead of my usual TDD-style “one thing at a time, working confidently,” I was doing everything, everywhere, all at once. Unsurprisingly: overloaded. Cue the “who’s on first, what’s on second, I don’t knows on third” situation.

So, the logical next step was to pause the chaos and implement turn/round structure. Makes sense, right? …Except instead I doubled back to cards. Because of course I did.

Planning Docs, or Madness in Writing

Here’s the interesting part: working with Claude (and Codex) has led me to generate deep planning docs for features. I’ll spend hours iterating back and forth, treating them like living documents. Each doc builds context for the next, and since I’m writing them inside the codebase, they get deeply grounded in the actual work. It’s been a surprisingly useful feedback loop.

If you’re bored, you can peek at the planning doc for Turn and Round Structure. But spoiler: instead of building turns/rounds like a sane person, I realized maybe… just maybe… I should start small. Like, test cards first. Lego pieces before Royal Albert Hall.

Small steps. Right?

Next Steps

So the plan now: get a test deck with basic cards sketched up and playable.

Next steps: MORE CARDS!

I’m focusing on functional requirements like:

Complete archetype coverage: Territory, creatures, spells, interaction, combos
Energy-gated abilities: Powers tied to energy types
Ability framework: Up to 4 abilities per creature
Energy-scaling bonuses: Special effects if fueled by the right energy

And covering these card types:

✅ Territorial Claim – Place core nodes
✅ Terraforming Push – Expand island
✅ Road Network – Connect islands
✅ Build Generator – Energy production
❌ Creatures with abilities
❌ Combat mechanics
❌ Opponent interaction
❌ Energy-specific bonuses

You’d think I’d take it easier this time, right? Nope. My card planning doc is 1,299 lines long. But this time, I’m getting Claude to implement one card at a time.

Let’s see how that goes.

Will I actually build a turn structure next time? Or just make even more cards? Until then, may your rats be honorable and your airships only slightly on fire.

Also, if you are counting, I am now up to using Claude Code, Claude, Codex, ChatGPT and Gemini to do this project ….

When Refactors Eat Your Game (and Your Evenings)

2025-09-29T11:00:00+00:00

From Buttons to Cards: A Sideways Journey

The Grand Refactor!

Last time we chatted I was making excellent progress with the video game implementation of a board game I’m randomly working on. I had island placement, roads, chunking, waveform-collapse procedural generation. It was smoking. I was learning how to get Claude to code well. The way of the Force was strong.

And then I ran a little terminal command:

$ git ls-files | grep '\.gd$' | xargs wc -l

…and what I saw was a bunch of files well over 2k lines. Clearly I’d missed a refactor cycle (or three).

Meanwhile, I’d spun up a Claude sub-agent to do research on tech topics. Naturally, one of the first queries was: “best practices for Godot.”

The results:

Use a signal-based architecture (which matched my half-remembered notes from a Godot course last year).
Use singletons for global access and cross-cutting concerns like game managers and event buses.
And, buried near the bottom as a tiny afterthought: “Do not include the .godot directory in your repo.”

Guess what I had in my repo.

I deleted the folder… and unleashed a week of refactor drama. Turns out the AI had written multiple cyclic dependencies that only “worked” because the cache made it look fine.

Cue the AI loop:

“To fix cyclic deps, remove types.”
“No wait, add types.”
“Think harder.”
“Ultra-think.”

This went on for hours until I forced a combo:

do ultrathink,
do not remove types,
change the dependency pattern,
refactor large files into smaller files.

That finally got things moving. Between Claude and Codex, the refactor slogged forward. But it cost me five evenings of work.

When the dust cleared, I… immediately rethought the UI.

Up to now, core gameplay actions (building islands, adding chunks, placing roads and buildings) were just buttons. Cards existed, but only for creatures and spells. Then I thought: what if everything was a card?

That would unify the control system: play a road, play an island, play a creature — all from cards. It lays the groundwork for an MVP, and it leans into what I love about games like Undaunted and Memoir ’44, where your options each turn come from your hand.

So naturally, I dove straight into a UI refactor.

At least this time I started with wireframes in my sketchbook, thinking mobile-first. Right now the UI shows:

your resources,
your current hand of cards,
a test deck with only the working cards,
and the ability to create islands by playing cards and paying with other cards.

To call this “progress” would be… generous. It’s more of a profound sideways move. But that’s coding sometimes — discovery through chaos.

The one genuinely new thing I’ve started doing is feature planning with Claude. Basically it goes like:

I describe the feature
Claude grills me with clarifying questions
I save that Q&A as documentation
then Codex implements
Claude reviews.

Complete aside: I also started playing with OpenAI’s Codex to see what it’s like. Codex feels like an agent you hand a ticket to and then 30 minutes later it drops a PR on your desk. Claude, on the other hand, is more like a junior dev you pair with — lots of back-and-forth, explaining, nudging, but it stays with you in the problem.

This flow has meant:

less back-and-forth,
better documentation,
and an easier way to track what I’m actually building.

Which is good, because this project has been the wildest, flux-crazed coding ride I’ve ever been on.

What’s next

Honestly? No idea. Probably “make more cards actually work.” and “make them look more informative than tiny text you got to squint at to figure out what you are trying to do!”

Right now I’m wrestling the road network into the card system. The “play a card → build roads” flow keeps fighting me. It works as a button; it sulks as a card.

Near-term plan (aka cope notes):

Make roads a first-class action. Card triggers a BuildRoadNetwork action with inputs (start, end, cost), not a kitchen-sink utility.
Signals over reach-ins. Card emits “build-road-requested”; the road system owns the how.
Pay-before-play. Validate cost and constraints first, then build; if it fails, no state touched.
Isolate the graph. Get road-graph ops (connectivity, loops, costs) tested in isolation so I’m not debugging UI + rules at once.
Tiny wins. One card → one road segment → then chains → then networks.

If that behaves, I’ll circle back and keep converting the old button-y stuff into cards until the whole game flows from a hand. If it doesn’t… I will perform ritual sacrifices to the debug gods and try the other other thing.

Building AI Before Building the Game: A Cautionary Tale

2025-09-07T11:00:00+00:00

The curious case of Sky Islands and the Endless Roads leading nowhere ….

Lost in the Sky Islands: My First Attempt at AI for a Board Game

I’ve been on a mission to design a board game — Sky Islands — and build a simulation of it in Pygame so I can playtest it digitally.

As a seasoned engineer with 14+ years of experience, you’d think I’d take a well-organized, disciplined approach to building my game: a clear Trello board, carefully prioritized tasks, and a smooth, methodical path from prototype to polish.

Except… no.

Instead, I’ve been wandering from random idea to random idea, in a meandering creative hike with zero map and no water bottle.

Lately, I’ve been closing in on a pretty complete rules implementation: combat is figured out, island control works, magic buffs and damage systems are in place, and the Pygame simulation is almost bug-free.

Which naturally meant it was the perfect time to… add AI.

Enter the LLM AI Player

Traditional game AI is this slow, careful, deeply technical craft. It takes months — sometimes years — to tune and polish.

So in my infinite wisdom, I skipped all that and said: “Nah. Let’s just use an LLM as the AI.”

Better yet, let’s make it fully agentic. Surely if I give it a prompt and some game state data, it’ll play like a pro.

Here’s part of what I gave it:

You are an AI player in Sky Islands, a strategic card game. 
You control {self.faction_name} with {self.strategy_name} strategy.

Core principles:
- Maintain resource economy
- Protect your base
- Control valuable islands
- Use command points efficiently

Always respond in this exact format:
ACTION: [action_name]
TARGET: [target_if_needed]
REASONING: [brief explanation]

CRITICAL: Only use actions from the "AVAILABLE ACTIONS" section.

It seemed foolproof. Even better, I didn’t have to pay for every token — I hooked the OpenAI API up to a local LLM and got free AI gameplay. Perfect, right?

The Dream… and the Reality

The AI did exactly one thing well: write extremely convincing reasoning for its moves.

Like this:

“I’ll acquire the Roads island to improve connectivity and expand my territory, while also gaining a foothold on the board. Since it’s a low-cost island and cannot have generators built, it won’t disrupt my resource economy. This action will also give me more options for future expansion and defense.”

Sounds smart, right? A master strategist!

Except in practice, it did this for 20 turns straight — acquiring road after road, until it had a glorious chain of roads leading absolutely nowhere.

My digital empire was basically a medieval fantasy version of the UK’s motorway network.

Turns out, getting AI to work right is hard. Who knew?

What Went Wrong

I had made two big mistakes:

No actual game tools. I was giving the AI text descriptions and asking it to pick actions, but it had no real way to interact with the game world. I should have built proper MCP tools so the AI could act, not just talk.
Building before understanding. I jumped into AI development before I had a clear idea of what I was even making. It was scope creep disguised as “innovation.”

Basically, I was trying to build a self-driving car while still figuring out how to assemble a bicycle.

Takeaways

If you’re going agentic, build tools first. Don’t just toss prompts at an LLM and hope for intelligence. Give it structured ways to interact with the game world.
Figure out what you’re building before adding AI. AI shouldn’t be a replacement for clarity — it should come after you understand the game design itself.

For now, my AI dreams are on pause. Back to the basics: fixing my Pygame simulation and actually finishing the game rules.

Because the real Sky Islands challenge wasn’t the enemy factions, or the balance issues, or even the road spam.

It was me.

I Just Wanted to Make a Board Game and Now There Are Procedural Islands

2025-09-07T11:00:00+00:00

Why I Stopped Worrying and Learned to Love Chaos ….

So… I’m supposed to be making a board game. At least, I think I am. Probably? Maybe?

What started as a simple idea about floating islands locked in war has slowly mutated into something far stranger.

Where We Left Off

If you caught my last post, Building AI Before Building the Game: A Cautionary Tale, you’ll know I’d been trying to simulate Sky Islands in Pygame. The goal was to playtest the rules, track combat and control, and maybe get some balance insights. Then I got ambitious and plugged an LLM in to act as an AI player.

That went… poorly. The AI spent twenty straight turns building roads that went nowhere, while writing deeply convincing justifications for every decision. It was like watching a medieval city planner gone mad. Funny? Absolutely. Useful? Not so much.

The Escalation: From Boring to Beautiful to Wild

After that fiasco, I stepped back and looked at my simulation. It worked, technically—but visually, it was about as inspiring as a spreadsheet. Dry. Functional. Boring.

I thought: It just needs a little something …

So naturally I did the completely logical thing of iterating on the base concept, creating some physical components and playtesting the hell out of this… right? No.

Instead I:

Downloaded Godot.
Built a basic isometric grid.
Added hexes stacked on other hexes.
Added colours and a cloud-like background.
Added roads.

And, then naturally, decided to throw in Wave Function Collapse for procedurally generated islands.

(Wave Function Collapse, for the unfamiliar: you feed an algorithm a set of tiles and rules for which tiles can sit next to which. It then generates new maps that look organic but still follow those constraints. Think Sudoku meets Minecraft.)

Each new step felt like a tiny, harmless addition—until I looked up and realized I was knee-deep in procedural generation instead of working on my board game.

The Identity Crisis

At this point, I had to stop and ask myself … what the hell am i doing?

Am I still making a board game?
Or is this now a video game prototype masquerading as a board game?

Because here’s where I stood:

A physical prototype with real pieces, great for fast iteration.
A digital build with pretty visuals, dynamic maps, and increasingly complex systems.

Two games. One idea. Total confusion.

What I’m Learning About Chaos

Yes, there’s a certain logic to focusing purely on a physical prototype first. But this is a side project meant to be fun, and I honestly have no idea where it’s heading. So instead of forcing order, I’m leaning into the fun and seeing where it takes me.

Right now, chaos is bubbling everywhere because I’m basically doing both at the same time—tinkering with a paper version and iterating on a digital one.

And that’s not entirely unprecedented. Building a paper version of a game before implementing it digitally is a time‑honored tradition. One of the best tactics games I’ve played in years (Mario + Rabbids Kingdom Battle) was built this way, and it worked beautifully.

Here’s what I’m starting to realize:

Building flashy visuals and complex systems is fun, but they’re add-ons, not the core of the game.
Chaos is part of the creative process. The trick is knowing which parts to hold onto and which to let drift away.
Having two prototypes isn’t inherently bad—as long as you know why each exists and what you’re testing with it.

The Plan (For Now)

For now, I’m planning on following the fun—playing with procedural generation and trying to make this into a video game that can be played quickly.

And I might still be making a physical version because drawing is fun and making physical things is entertaining from time to time.

…yes, I know I just went, “I am going to do everything everywhere all at once…”

Embracing the Weirdness

Maybe this project was never going to stay neat and tidy. Maybe my creative process just looks like a cloud of floating hexes.

And maybe that’s okay.

So, here’s to embracing chaos, even when you’re not entirely sure what the hell you’re building.

Islands at War: Designing a Board Game with AI

2025-08-21T11:00:00+00:00

Or my on going adventures in playing with AI development tools …

Inspiration: Rediscovering NetStorm

Sometimes the strangest of projects begin with a simple question: do you remember NetStorm?

For most people, the answer is no. NetStorm was a quirky real-time strategy game from the late 90s where players built chains of bridges to connect floating islands, deployed priests to capture enemy units, and unleashed elemental spells to control the battlefield. Matches were fast, tactical, and strange in a way that made it unforgettable, even if few played it at the time.;

But for me, it triggered an itch that just couldn’t be scratched. For the past year it’s been popping into my head like a song that gets stuck on repeat — especially the memory of floating islands, the roads stretching between them, and the chaos that ensued once those bridges connected. And since it’s impossible to play on modern tech without resorting to figuring out complicated emulation for a legacy Windows game on a modern Mac, I found myself wondering: what if I could capture that vibe from a board game instead?

It was a ridiculous idea in some ways, rooted in nostalgia for something obscure. But it stuck. And once the cube project was done, I found myself diving headfirst into a new design challenge. This time, though, I wasn’t working alone.

Lessons from the Cube Project

The Jumpstart cube project had taught me something important: AI could be a collaborator, not just a tool.

The breakthrough wasn’t about the right prompt or the cleverest model hack. It was about treating the AI like a junior engineer — someone who could help generate ideas, point out problems, and accelerate iteration, but who still needed guidance, context, and decisions.

I learned to:

Ask AI to critique ideas, not just produce them.
Frame questions in terms of gaps, edge cases, and “what doesn’t make sense.”
Use the back-and-forth to sharpen my own intent.

That mindset became the foundation for the new board game project.

From Digital to Tabletop: Goals and Challenges

The initial vision that drove me was the feeling of islands floating in the sky at war — a mashup of inspirations:

Magic: The Gathering’s Jumpstart ease of deck building and fun with interesting themed decks,
Marvel Champions’ card management,
and Star Wars: The Deckbuilding Game’s capital ships — which at first felt like they could fit the vibe I was imagining, but in my version, those ships were islands.

I started with a speculative chat that led to a rough design doc — high-level, but with enough of a skeleton to start iterating toward a solution. Then I followed it up with discussions about resource management, card flow, and how creatures might interact with the islands.

At each stage I translated these prompts ithrough Claude Code into a sprawling Python codebase, building rough but tangible systems so I could test how the mechanics meshed and influenced each other.

And then came the breakthrough …. or perhaps it was a bombshell. This was the small decision that went on to change every other single decision, as the consequences kept cascading out from it.

The Sky Island Moment

The question that changed everything was deceptively simple: what exactly are the islands, thematically?

Somewhere in that back-and-forth, the idea of sky islands emerged. The idea went from an abstract hand wave of “there is a thing” to grappling with how to tie them into gameplay — what does it mean to interact with them, and why does that even matter? Suddenly the game wasn’t just a dueling card game. It shifted into something closer to a dueling wargame — a clash of floating fortresses, each vying for control of the skies.

That one shift cascaded into every corner of the design:

Creatures became more than just units — they had roles tied to the islands themselves.
Resources weren’t just mana, they were generated through interactions between creatures, cards, and island abilities.
The turn structure morphed to resemble wargames like Bolt Action or even the action phases of Terraforming Mars.
The definition of the “player” avatar in the game changed, along with how victory was determined.

It was dramatic. The feel of the entire game had transformed. And it had all come out of a collaborative conversation with the AI.

How Collaboration Looked in Practice

One of the things I’ve been experimenting with is using different AIs for different roles. For this project:

I’d often workshop prompts and do deep exploratory discussions in ChatGPT, digging into mechanics and edge cases.
Then I’d switch to Claude when it came time to pair on writing code or fleshing out structured text.

I’ve been leaning into a spec‑driven design philosophy: writing in‑depth specification files that dive into each game system, then keeping them directly in the codebase alongside the code itself. This way they act as living documents the AI can reference when implementing features or validating correctness.

~ ls ~/workspace/sky-islands/specs
1-initial-feature.md              14-streamline-resources.md        5-creatrure-combat.md
10-resource-management.md         15-streamline-upkeep.md           6-turn-structure.md
11-indepth-turn-structure.md      2-pay-by-discard.md               7-leader-homebase.md
12-upkeep-payment-supply-lines.md 3-sky-islands.md                  8-victory-conditions.md
13-deck-composition.md            4-ui.md                           9-hand-size-and-draw-mechanics.md

The rhythm became clear: spend more time up front in detailed conversation about a single feature or mechanic, then move to execution.

The most valuable questions I asked were:

Where are the gaps?
What doesn’t make sense?
What’s wrong with this design?

Those questions made the AI less of an idea generator and more of a critical partner. And that’s when the collaboration felt most real.

Where I Am Now

At this point, I have a partially refined board game with a comprehensive set of rules that cover deck design, map interaction and play, resources, hand management, combat, magic, and more. I also have a partially implemented working Pygame simulation of the board game. It’s still very much a developer interface and only kind of works, but it’s there. It’s already over 24k lines of Python code.

What’s exciting is how the different systems impact each other in ways that feel earned and interesting — at least from my high-level thinking so far. I don’t yet have a fully functional version of the game, but the foundations are solid and interconnected. Every new rule or mechanic ripples through the system in a way that makes the design feel alive.

Looking Back

AI didn’t design this game. But it helped me test ideas faster, spot flaws earlier, and push through creative blocks.

What surprised me most wasn’t the cleverness of any single AI output. It was how much momentum I got from the dialogue itself. The interaction between my intent and the AI’s output was where the creativity lived.

I started this whole thing chasing a nostalgic itch for a half-forgotten 90s RTS. I ended up building the foundations of a brand new board game.

And I don’t think I would have got there without treating the AI as a collaborator.

The Zero-to-Vibe Coding Jumpstart Cube Catastrophication

2025-08-04T11:00:00+00:00

From Pauper to Pandemonium: Building a Magic Jumpstart cube when you can’t change anything

The Problem: Jumpstarting a Pauper Cube

This story starts, as with all good stories, with a bad idea that just wouldn’t get out of my head.

I hadn’t played Magic: The Gathering in 20 years, but the idea of building a Pauper Jumpstart Cube was too enticing to ignore. It all started when I came across this Reddit post that described how trading card games like Magic could be treated as curated board game experiences. That led me to discover thepaupercube.com, and I was captivated.

For the uninitiated:

Magic: The Gathering is a collectible card game that mixes strategy, fantasy, and an unholy amount of rules.
Pauper format only allows commons, the lowest rarity of card. It’s great for design constraints and budget builds.
Jumpstart is a format where you mash two themed 20-card packs together for instant deck-building fun.

But there was a snag: where do I even get these cards?

At first, I tried buying them — until I saw the price. About £250 for the cards and an eye-watering £500 for shipping. Absolutely not.

Eventually I figured out how to print proxies using MakePlayingCards.com and sourcing the images and data from mpcfill.com. This workaround gave me control, affordability, and the ability to experiment freely.

Now I had a freshly printed Pauper cube in hand… and absolutely no idea what to do with it.

That’s when I came across Hasted’s Cube on CubeCobra — a Jumpstart cube built from a Pauper cube. It was basically perfect. Everything I wanted! The archetypes were there, the card synergy was elegant, the deck themes were fully formed. Instant salvation!

Except… there was a problem.

Hasted’s cube was built in September 2024. My version of the Pauper cube — freshly downloaded and printed — was from June 2025. As I soon discovered, the Pauper Cube is a community-maintained project that’s been actively updated every three months since 2008.

So I was 3–4 patches ahead of Hasted. Many of the cards his cube used had been removed or swapped out. When I tried to rebuild or expand his design, I quickly realized I was missing about 30 cards. Only 260 of the 450 cards in my cube were being used, and the 10 dual-color archetypes from his Jumpstart structure? Gone.

My brilliant idea? Build the Hasted decks as-is, then look at the leftover cards and do my best to patch in replacements that matched the original themes.

Which, given I hadn’t touched Magic in 20 years, turned out to be far harder than I expected — I barely understood what the cards did, let alone how they synergized.

Smooth sailing? Nope.

Wrong. Welcome to cube catastrophication.

The Solution(s): Iterating Toward Sanity

🛠️ Step 1: Do It By Hand

My first instinct was to patch in the decks manually. I started with Hasted’s archetypes and filled in gaps using leftover cards, trying to match themes as best as I could. It took hours. By the end, I had something that looked playable — but I had zero trust in it. Colors kind of matched, synergies sort of existed, and I kept second-guessing every choice. These were artisanal, hand-crafted, duct-taped abominations. They technically worked, but just how good (or bad) were they? I had no idea.

🤖 Step 2: Enter ChatGPT

Clearly it was time to bring out the big guns. AI to the rescue! It would magically do everything I wanted, right?

So I chucked a CSV with a list of cards, Oracle text, and some additional metadata at ChatGPT, along with — in hindsight — an absurdly naive prompt:

I am considering making a jumpstart cube from a pauper cube (see attached file). I need themes for the decks given the pauper cube. Could you generate 20 mono-colour themes (4 of each colour) and 10 dual-colour themes? Could you also include a list of potential cards that would fit that theme given the cube list attached?

What could go wrong?

At first glance, it was perfect. It generated a whole bunch of themed decks and cards to go with each one. It looked amazing.

But… it didn’t take constraints into account — like the fact that each card only appears once in the cube. Or that many of the cards it selected didn’t even exist in the list I had provided.

I found myself going back and forth, trying to mangle the data into a structure that was vaguely plausible. But eventually, I just got lost. The AI was enthusiastic, but this wasn’t working.

🧠 Step 3: Better Prompts

It was becoming increasingly obvious that the prompt I had started with just wasn’t working out. So I spent time hand-crafting a new one. (In hindsight, I probably should’ve asked GPT to help write the prompt — but that idea came much later.)

The new prompt:

Hi, you are a deck designer working on building game decks in Magic: The Gathering focusing on Jumpstart decks. You need to build a set of decks from a Pauper cube (see list of cards available in attached file).

Your goal is to create 20 mono-color decks (4 of each type) and 10 dual-color decks. Each deck should follow the rules of Jumpstart deck construction.

Let’s start with creating themes for each of these decks and assigning some cards to match those themes. Key constraint is that we cannot assign a card more than once from the Pauper cube list as they are uniques.

CONSTRAINTS:

You may only use a card once across all 30 decks

You may only use cards that are in the list of cards provided

You must ensure that the decks you construct are valid; cards in each deck should be usable with mana from that deck

You must ensure the chosen cards match the themes you have defined

This time, I had a clearly defined role, concrete constraints, and structured tasks.

This was it — the magical incantation that would solve all my problems…

Nope.

It struggled to generate valid cubes. It kept reusing cards. Decks were sometimes the wrong size. And worst of all, the data lived entirely inside GPT’s reply: in a format that was hard to verify or visualize.

I had to keep going back and forth to fix errors, find inconsistencies, and validate logic. Eventually, the prompt-response loop collapsed under its own weight. I ran out of context window, and the outputs started to degrade into nonsense.

🧠✨ Step 4: Better Model, Same Problems

Okay, so maybe the prompt wasn’t the problem. It was clearly much better than the first one. Maybe the real issue was the model I was using.

I had initially been working with GPT-4o, but I decided to switch to GPT-4-turbo (o3). And oh my word — the reasoning was incredible. Reams of Python code, clearly explained logic, thoughtful breakdowns. This was it. This was going to work!

…Nope.

Despite the improved explanations and structure, I ran into the same validation problems. Constraints were still being willfully violated. Cards were duplicated. CSVs had to be regenerated and re-debugged over and over and over again.

There was just too much context needing to go over the wire. Too much reliance on remote execution and ephemeral chat memory. I had no local reproducibility, no way to rerun anything myself — and too much of the process was invisible, buried in model responses I couldn’t reliably audit.

This was not working.

📓 Step 5: IPython Notebooks with Claude

I had recently been playing with agentic programming in IPython notebooks and thought to myself, “Hey! An IPython notebook might just be the ticket!”

So I set up a repo with a notebook and used GitHub Copilot along with Claude 4 Sonnet, treating it exactly the same way I’d treated ChatGPT: start with a prompt, feed in some context data, and ask questions.

Claude wrote all the Python code in the IPython notebook, and I could rerun things. This was great — I now had the data locally. I could iterate. I could export everything to a CSV in a format compatible with CubeCobra. I was finally developing a working validation process.

It was really working!

Until… it wasn’t.

Rapidly, the notebook exploded to over 10,000 lines of Python and Markdown. It became completely impossible to understand. I was noised out. There was too much code, too many hidden assumptions, and I couldn’t maintain any meaningful context anymore.

💫 Step 6: Vibe Coding

There’s a saying: when all you have is a hammer, every problem looks like a nail. I’ve been coding since 2000 — I should have known better.

Just getting the AI to dump everything into a single file and hoping for some semblance of sanity clearly wasn’t working. So, I decided to try vibe coding consciously for the first time. I started writing reusable functions: one to export CSVs in a consistent format, another to validate that cubes were generated correctly, more to parse the data and swap cards in and out.

Bit by bit, a new structure emerged: a collection of a dozen files, each with around a thousand lines of code. It was big, messy… and actually working.

I uploaded a CSV to Cube Cobra — and it was starting to look right. Not perfect, of course — there were still bugs. But these were bugs I could work with.

Encouraged, I kept going. I asked more questions. Tweaked more things. And then I hit that familiar point in every real-world software project when you skip refactoring. Everything becomes chaotic, fragile, and painful.

Some files grew so massive they defied comprehension. I had hit cognitive overload again. I couldn’t understand what I was reading anymore.

As an ardent TDD advocate in my day job, I realized I was missing two critical pieces of the red-green-refactor cycle: I was just writing code. No tests. No cleanups. Rookie mistake.

🔁 Step 7: Start from Scratch, Methodically

I decided to start from scratch, taking a leaf from Code Retreats and embracing the idea of throwing away code. I began with creating clean modules to construct a deck, refactored out constant values, and organized logic around key ideas. I added the ability to balance decks using different metrics, gave themes the ability to value cards differently, and even got the AI to extract themes from all existing Magic cards and map them against the cards I had available.

This time, I embraced a conscious build-and-refactor loop, following many of the patterns and code-smell habits Uncle Bob Martin drilled into me back when I first read his books. Slowly, things started to click. I had a system that could take a list of cards and — hands-free — generate a Jumpstart cube that actually worked.

And it was beautiful. The decks looked reasonable (at least to my untrained eye), and they respected the constraints of working within a limited cube.

The experience of vibe coding here was totally eye-opening. It required a different mindset — more like pairing with a junior engineer. There was back-and-forth about design patterns, code smells, duplication, and testing. It was a collaborative, iterative process, not what I had anticipated at all when I set out to “just get the AI to do it for me.”

Learnings

I came away from this with a set of learnings I didn’t anticipate. Over the course of this project, I used a wide range of AI tools — ChatGPT, Claude, GitHub Copilot — not just as assistants but as creative collaborators. The work would’ve been infeasible to complete by hand; AI didn’t just make it possible, it changed how I approached the problem.

AI is a Force Multiplier — With Strings Attached

AI made exploration and iteration possible at speeds that felt impossible before.
But it was only effective once I provided structure: clear prompts, reusable functions, validations, and constraints.
It was never “write this for me”; it was “pair with me while I figure this out.”

ChatGPT

Great for brainstorming, naming, structure, and kicking off ideas.
Struggles with long-running logic or deeply stateful tasks.
Context limits are real — and painful. Once you hit them, coherence breaks.
Loves to generate Python, but not always consistently or responsibly.

Claude + IPython

Being able to run, re-run, and inspect code locally was a game changer.
Having longer context helped — until the notebook turned into a 10k-line mess.
Still, this was my turning point: the shift from “AI as magic” to “AI as collaborator.”

Vibe Coding

Treating the AI like a junior engineer was the breakthrough.
I focused on function-level design, modularity, and code smells — and the AI followed along.
Once the codebase was structured, the AI stopped hallucinating and started contributing meaningfully.
But skipping testing and refactoring led me right back into chaos. The rules of clean code still apply.

The Meta-Learning

The process changed how I think about coding, prompting, and problem-solving.
It wasn’t the model, or the prompt — it was the interaction between human intent and machine output.
I didn’t just build a cube. I learned how to build with AI — iteratively, imperfectly, and eventually successfully.

The GitHub Repo

Want to see the madness? The final codebase — notebooks, card lists, helpers, and all — is here:

👉 github.com/vanonselenp/magic-jumpstart

👉 Cube Cobra: The actual cube generated

Final Thoughts

This project started as a light-hearted return to Magic and spiraled into a lesson in prompt engineering, AI pair programming, and the perils of working with 13k-word datasets inside stateless chat threads.

Would I do it again?

Absolutely. But next time I’ll write the tests first.

Charting a Path to Senior Staff Engineer

2025-08-03T10:00:00+00:00

Reflections on defining and growing into a Senior Staff Engineer role at The Economist.

Over the last few months, I’ve been thinking a lot about where I want to grow next in my career. I’m currently a Staff Engineer at The Economist, and while the organisation doesn’t (yet) have a formal Senior Staff Engineer role, the shape of that next step has started to come into focus. Rather than wait for a title to be defined for me, I’ve been working to define what Senior Staff means in this context — and how I might grow into it.

This post is a reflection on that process: how I evaluated my current impact, aligned with the broader goals of our engineering organisation, and defined personal goals that push me toward broader, deeper influence.

What is a Senior Staff Engineer?

From internal drafts, industry definitions, and personal reflection, a Senior Staff Engineer is not just a Staff Engineer with more time or scope. The role is about driving org-wide technical strategy, amplifying the work of others, and influencing engineering culture at scale. Some of the key traits include:

Architectural and strategic leadership across systems and teams
Influence without authority – aligning work through relationships and trust
Mentorship and capability-building, especially for other Staff+ engineers
Navigating ambiguity to create clarity and momentum
Deep alignment with business and organisational goals

Reflecting on My Work

Looking at the past year, I saw evidence that I was already starting to operate in this way:

Led technical initiatives with high impact, like CP2 performance improvements and the Cloudfront rollout, which doubled performance.
Drove architectural clarity through shared interface design (e.g. Rich Topics, TWIB) and API cleanups.
Mentored and paired regularly, particularly promoting TDD and clean code.
Shaped cross-team alignment on Homepage and caching strategies.
Worked systemically, reducing tech debt, improving observability, and simplifying services.

But I also saw areas where I could grow more deliberately into a Senior Staff role — particularly in scaling influence, amplifying others, and linking deeply to strategy.

Aligning With Organisational Goals

The Economist’s engineering strategy for FY26 includes goals such as:

Building a high-performing engineering organisation
Increasing engineering productivity and impact
Scaling foundational systems globally
Embracing AI
Embedding security throughout the stack

And in the Content & Web pillar specifically:

Making CP2 the source of truth for all content
Locking down APIs and content access
Improving content discovery
Deploying services globally for performance
Reducing tech debt and streamlining architecture

To grow into Senior Staff, I want to make sure my personal goals directly support this strategic direction.

Personal Goals for Growth

Here are two goals I’ve set for myself over the coming quarters:

1. Lead Strategic Modernisation of Media Workflows in CP2

Unify and simplify audio, video, and image ingestion and retrieval in CP2 to enable globally available, secure, and consistent content management.

Facilitate cross-team discovery (CAAS, Enablement, B2B)
Propose and validate new architectural models
Deliver a reference implementation
Define reusable contracts and security patterns

2. Grow Engineering Culture Through TDD, Mentorship, and AI Tooling

Elevate engineering quality and developer efficiency by fostering TDD culture, mentoring future leaders, and championing the use of AI tools.

Launch an “Engineering Craft Circle” for pairing and clean code
Mentor 2–3 engineers with Staff potential
Pilot AI tooling (e.g. Copilot, Cody) and measure impact
Drive adoption of 1–2 shared packages to reduce duplication

These goals stretch me beyond my current scope, while still delivering immediate value to the organisation. They also model the behaviours I believe a Senior Staff Engineer should embody.

Final Thoughts

Not every organisation has clearly defined Staff+ roles — but that shouldn’t stop us from shaping them. By reflecting on your work, aligning with strategic goals, and choosing deliberate areas of growth, you can build the case (and the capability) for the next step.

If you’re on a similar path, I’d love to hear how you’re thinking about growth beyond Staff. What’s helped you? What are you working on? Drop me a message or share your own story.

What Does a Staff Engineer Actually Do?

2025-08-02T10:00:00+00:00

Originally written as an internal working spec while I was at Cazoo, this post is my take on the responsibilities and mindset of a Staff Engineer. It helped me reflect on what the role requires, not just in terms of technical ability, but also leadership, communication, and constant adaptation.

Introduction

The Staff Engineer role can feel nebulous. It’s not quite management, but it’s also not just “Senior Engineer, but more so.” When I stepped into the role, I found myself constantly questioning: What should I be focusing on? What does success look like?

To answer that, I wrote down what I believed the role should encompass. This post is the cleaned-up version of that spec. It’s part philosophy, part operating manual.

The Role in a Nutshell

A Staff Software Engineer brings advanced technical skill and a strong sense of quality, leadership, and communication. It’s a role that blends design thinking, mentoring, business context, and servant leadership.

This is someone who:

Drives technical excellence.
Guides teams without pulling rank.
Mentors and grows engineers around them.
Communicates fluently across technical and non-technical domains.
Keeps one foot in the code, and one in the strategy.

Core Tenets

These are the pillars that, in my view, define the role.

1. Servant Leadership

The Staff Engineer leads by enabling others. They remove obstacles, give context, and help teammates succeed — without needing to be the loudest or most visible person in the room.

2. Mentoring and Coaching

They actively help other engineers grow — sharing experience, offering feedback, and creating space for others to step up.

3. Advanced Technical Skills & Quality Focus

In my case, this included deep expertise in:

TypeScript
AWS Lambda and the Serverless Framework
TDD and BDD practices

But more importantly, it meant advocating for a “Shift Left” approach — tackling quality early, writing strong tests, and building confidence into the development process.

4. Software Architecture

The Staff Engineer should be able to design scalable, secure, serverless systems — understanding trade-offs, constraints, and how to align architecture with business needs.

5. Problem Solving

This means breaking down complexity, identifying root causes, and coming up with solutions that work in real-world systems — not just whiteboard diagrams.

6. Communication

A key part of the role is acting as a bridge — translating technical ideas for stakeholders, and helping engineers understand the “why” behind the work.

7. Domain Understanding

Good decisions require context. A Staff Engineer needs to understand the business domain deeply enough to make trade-offs that serve real user and company needs.

The Mindset: Be Less Wrong Over Time

I tried to approach the role with a Bayesian mindset: always updating my understanding based on new evidence. In practice, this meant:

Re-evaluating what’s most important, constantly.
Adjusting plans as work reveals new complexity.
Changing approach based on feedback.
Keeping a habit of continuous learning.

This mindset helped me stay adaptive and avoid getting stuck in “default” thinking.

Measuring Impact

Performance at this level can’t be measured by ticket output. Instead, I think about impact across these dimensions:

Business outcomes: Are you driving results that matter?
Team enablement: Is the team more productive because you’re on it?
Mentorship: Are others levelling up with your help?
Communication: Can stakeholders understand you — and trust your judgement?
Learning and adaptability: Are you growing faster than the problems are changing?
Problem-solving: Are you taking on the hard, messy challenges?
Team feedback: Do your peers want to work with you again?
Stakeholder trust: Do people outside the team rely on you?

No single metric tells the full story. But together, they can indicate whether you’re making a difference.

Final Thoughts

This spec isn’t meant to be definitive — just a personal take. The Staff Engineer role will always vary based on the team, company, and product. But having a compass — a sense of what good looks like — has helped me stay focused on the kind of impact I want to have.

If you’re stepping into a Staff+ role, or just thinking about it, I hope this helps spark some ideas.