Add `A Claude Code command for Hypothesis` blog post #4571

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

Zac-HD merged 21 commits into HypothesisWorks:master from Liam-DeVoe:claude-code-blog

Nov 1, 2025

+300 −24

Member

Liam-DeVoe commented Oct 21, 2025

Includes a new /hypothesis command, rewritten from the base /hypo command in the paper to focus on test writing for long-term maintainers and developers, not immediate bug hunting. Based on an initial draft from @mmaaz-git (thanks!).

I'm very unsure where the best place to host hypothesis.md is. I don't really want to do a claude dir, because (at least at the moment) this is a generic AI command, as long as your framework implements claude-style tools. I've put it in agents, even though it's not really an agent, because agents clearly communicates "ai". One idea is we host it as a static file on hypothesis.works, and figure out a more permanent place when the ecosystem settles down.

cc other paper authors: @Zac-HD @mmaaz-git @carlini

BTW @carlini I've kept you as an author on this blog post because it discusses the paper, and you're an author there. But I spend about half the post talking about non-paper things, so if you don't want to be listed as endorsing that, just let me know and I can remove you. Just didn't want to take paper credit away. Quite happy to keep you as an author as well of course!

Liam-DeVoe added 8 commits

October 20, 2025 01:22


          improve warning message

2df4766


          deflake test deadline

7d67c00


          rename website headers

5dca507


          improve website styling

346787d


          add /hypothesis blog post

a63d41c


          Merge remote-tracking branch 'upstream/master' into claude-code-blog

796a995


          reword

65ddb5c


          fixes

d49b2d1

Liam-DeVoe force-pushed the claude-code-blog branch from 3f28719 to d49b2d1 Compare

October 21, 2025 06:37

Liam-DeVoe added 3 commits

October 21, 2025 14:56


          format to make shed happy

d9233dd


          match github's python colors closer

528cc46


          "inverse gaussian" is not the inverse of the gaussian

33f7776

Liam-DeVoe commented

View reviewed changes

website/content/2025-10-20-claude-code-plugin.md Outdated Show resolved Hide resolved

Liam-DeVoe added 2 commits

October 21, 2025 20:11


          typo

9ffcdc2


          rewording

a54bf37

Zac-HD reviewed

View reviewed changes

Member

Zac-HD left a comment

(partial review, more later)

agents/hypothesis.md Outdated Show resolved Hide resolved

agents/hypothesis.md Outdated Show resolved Hide resolved

website/content/2025-10-20-claude-code-plugin.md Outdated Show resolved Hide resolved

agents/hypothesis.md Outdated Show resolved Hide resolved

website/content/2025-10-20-claude-code-plugin.md Outdated Show resolved Hide resolved

Liam-DeVoe added 3 commits

October 27, 2025 00:17


          address minor-ish review comments

a6ce031


          avoid use of /hypothesis as a noun

4161baf


          reword prompt to focus on valuable properties

23649a5

Zac-HD approved these changes

View reviewed changes

Member

Zac-HD left a comment

minor copyedits below, but I'm looking forward to publishing this! (and #4556)

website/content/2025-10-20-claude-code-plugin.md Outdated Show resolved Hide resolved

website/content/2025-10-20-claude-code-plugin.md

    
              ## Failure modes

              We observed a few failure modes while developing `/hypothesis`. For example, AI models like to write strategies with unnecessary restrictions, like limiting the maximum length of a list even when the property should hold for all lengths of lists. We added explicit instructions in `/hypothesis` not to do this, though that doesn't appear to have fixed the problem entirely.

Member

Zac-HD Oct 31, 2025

Seems fun to note here that many of our human users do the same thing - and docs don't stop them either 😅

website/content/2025-10-20-claude-code-plugin.md Outdated Show resolved Hide resolved

website/content/2025-10-20-claude-code-plugin.md Show resolved Hide resolved

website/content/2025-10-20-claude-code-plugin.md Outdated Show resolved Hide resolved

Liam-DeVoe added 5 commits

November 1, 2025 12:29


          reword

abb594e


          better link

2e990c8


          reword + link

31cd5c3


          mention hypofuzz

0efc9a6


          spacing

125b908

Zac-HD approved these changes

View reviewed changes

Zac-HD enabled auto-merge

November 1, 2025 20:46

Zac-HD merged commit 4cbd566 into HypothesisWorks:master

149 of 151 checks passed

Liam-DeVoe deleted the claude-code-blog branch

November 1, 2025 20:48

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet