Vibe-coding in Crystal

zw963 · April 8, 2026, 2:29am

From my perspective, the fact that GPT is actually very familiar with my project（which might be one of the reasons, given that we’ve been “vibe coding” together on and off for about two weeks）means that when I ask him to write tests, I only need to provide a brief description, and it knows exactly what needs to be tested.

In actual practice (after adding the tests, I tried reverting production code), to see the test failed, it’s almost always happen.

So, these tests are actually effective—even though they were written by AI and I didn’t even bother to look at them once during the whole process.

(Translated from Chinese using gemma4:26b)

zw963 · April 8, 2026, 2:41am

As I mentioned in my previous comment, AI can actually pull this off.

In fact, I think the Crystal specs written by GPT are quite good. The two files below were written entirely by AI (I didn’t change a single word):

github.com/crystal-china/procodile

spec/specs/scheduled_process_spec.cr

1b76b74df

require "../spec_helper"

private def wait_for_control_socket(sock_path : String) : Bool
  wait_until(5.seconds, 50.milliseconds) do
    begin
      UNIXSocket.new(sock_path).close
      true
    rescue Socket::Error | File::Error
      false
    end
  end
end

private def run_status_command(app_root : String) : String
  output = IO::Memory.new
  status = ::Process.run(
    "crystal",
    ["run", "src/procodile.cr", "--", "-r", app_root, "status"],
    output: output,
    error: output,

This file has been truncated. show original

github.com/crystal-china/procodile

spec/specs/runtime_issues_spec.cr

1b76b74df

require "../spec_helper"

private def wait_for_control_socket(sock_path : String) : Bool
  wait_until(5.seconds, 50.milliseconds) do
    begin
      UNIXSocket.new(sock_path).close
      true
    rescue Socket::Error | File::Error
      false
    end
  end
end

private def run_cli_command(app_root : String, *args : String) : {::Process::Status, String}
  output = IO::Memory.new
  status = ::Process.run(
    "crystal",
    ["run", "src/procodile.cr", "--", "-r", app_root] + args.to_a,
    output: output,
    error: output,

This file has been truncated. show original

If you are strictly pursuing DRY (Don’t Repeat Yourself) principles in your testing, then AI might fall short. But like I said before, back when I was writing a lot of tests in Ruby, I only cared about keeping the production code clean. I intentionally wrote redundant tests (in fact, due to a lot of copy-pasting, my test code was several times larger than my production code). So, in my view, letting AI handle this kind of redundancy nowadays is actually a great idea.

I’m not a dogmatist when it comes to TDD (Test-Driven Development) either（for example, the “test-first” approach）. Especially after moving from Ruby to Crystal, the weight I give to testing has decreased significantly.

That said, regression testing for core features is still vital: you want to make sure you don’t accidentally introduce breaking changes during a refactor without realizing it.

(Translated from Chinese using gemma4:26b)

Xen · April 8, 2026, 6:39am

I’m sure it can do an excellent job most of the time, but everybody got the story about that one time where the LLM fixed the test by removing the important part. If you don’t even review it, how can you be sure that they’re as good as you think?

zw963 · April 8, 2026, 3:39pm

If you don’t even review it, how can you be sure that they’re as good as you think?

I don’t care the spec code, only if can work.

the LLM fixed the test by removing the important part.

How this happen? I am chatting with GPT in codex, he know what is current need doing, if he change my production code, I know it immediately.

In fact, I write most of the production code myself (with the help from GPT, of course). I take responsibility for the code I write, while the AI is responsible for the specs. If there’s an error in the specs, GPT will point out the cause during our conversation.

npn · April 8, 2026, 5:24pm

GenAI cheats every time. instead of solve the problem properly, it will take shortcut, like hard code the solution for the test suit directly in the library, or adjust the unit case to the wrong output so it passes the test.

believe me, I have seen it doing it every time, no matter what model you are using, they all cheat when they can’t think of any proper solution.

ralsina · April 8, 2026, 8:31pm

That happens very rarely in my experience.

renich · June 5, 2026, 1:11pm

Check this video out:

It’s about Linus Torvald’s recent experience with AI in the kernel.

The thing I was interested the most was discussed here. How do we actually collaborate now that everyone can write their own kernel to boot. Developing personal tools (patience, tolerance, compromise, etc) that allow us to work together.

It’s always better to work together than alone.

… and we all became Akira in this story. ;D

Topic		Replies	Views
Crystal for Agents v1.20.0 release News	18	457	April 23, 2026
What are you cooking? Community	18	484	July 20, 2026
Why Isn't Every Sane Developer Obsessed with Crystal? Community	41	1452	July 25, 2026
[RFC] Surviving the AI PR Flood: The Macro X-Ray & Asymmetric TDD Crystal Contrib rfc	23	731	July 26, 2026
Collaboration in the age of AI Community	3	149	July 13, 2026

Vibe-coding in Crystal

Related topics