| AI Model | Score | FCSR | Status |
|---|---|---|---|
openai/gpt-4.1 |
100 | 90% | π’ |
openai/gpt-4.1-mini |
100 | 76% | π’ |
openai/gpt-5-mini |
82.5 | 85% | π’ |
anthropic/claude-sonnet-4.5 |
65 | 69% | π‘ |
moonshotai/kimi-k2-0905-exacto |
65 | 61% | π‘ |
openai/gpt-5 |
57.5 | 92% | π‘ |
anthropic/claude-haiku-4.5 |
55 | 40% | π‘ |
qwen/qwen3-next-80b-a3b-instruct |
55 | 70% | π‘ |
deepseek/deepseek-v3.1-terminus-exacto |
47.5 | 79% | π‘ |
qwen/qwen3-30b-a3b-thinking-2507 |
33.75 | 66% | β |
meta-llama/llama-4-maverick |
12.5 | 94% | β |
google/gemini-2.5-pro |
10 | 97% | β |
- FCSR: Function Calling Success Rate
- Status:
- π’: All projects completed successfully
- π‘: Some projects failed
- β: All projects failed or not executed
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
100 | π’ | π’ | π’ | π’ | π’ |
bbs |
100 | π’ | π’ | π’ | π’ | π’ |
reddit |
100 | π’ | π’ | π’ | π’ | π’ |
shopping |
100 | π’ | π’ | π’ | π’ | π’ |
- Source Code:
openai/gpt-4.1/todo - Score: 100
- Elapsed Time: 23m 2s
- Token Usage: 7.09M
- Function Calling Success Rate: 96.23%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 1, documents: 11 |
490.1K | 2m 55s | 100% |
| π’ Prisma | namespaces: 2, models: 3 |
177.0K | 1m 37s | 100% |
| π’ Interface | operations: 15, schemas: 19 |
3.46M | 7m 41s | 91% |
| π’ Test | functions: 16 |
1.79M | 4m 36s | 100% |
| π’ Realize | functions: 15 |
1.17M | 6m 11s | 100% |
- Source Code:
openai/gpt-4.1/bbs - Score: 100
- Elapsed Time: 52m 29s
- Token Usage: 28.72M
- Function Calling Success Rate: 91.72%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 2, documents: 11 |
497.8K | 6m 8s | 85% |
| π’ Prisma | namespaces: 3, models: 10 |
318.1K | 3m 52s | 100% |
| π’ Interface | operations: 57, schemas: 62 |
14.93M | 23m 43s | 86% |
| π’ Test | functions: 40 |
8.70M | 7m 35s | 96% |
| π’ Realize | functions: 57 |
4.28M | 11m 9s | 100% |
- Source Code:
openai/gpt-4.1/reddit - Score: 100
- Elapsed Time: 1h 42m 33s
- Token Usage: 111.61M
- Function Calling Success Rate: 92.87%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 2, documents: 11 |
593.4K | 5m 32s | 95% |
| π’ Prisma | namespaces: 10, models: 40 |
1.22M | 3m 25s | 100% |
| π’ Interface | operations: 161, schemas: 175 |
61.07M | 55m 12s | 87% |
| π’ Test | functions: 159 |
32.49M | 14m 30s | 98% |
| π’ Realize | functions: 161 |
16.23M | 23m 52s | 97% |
- Source Code:
openai/gpt-4.1/shopping - Score: 100
- Elapsed Time: 2h 40m 18s
- Token Usage: 384.86M
- Function Calling Success Rate: 89.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 15 |
946.3K | 4m 8s | 100% |
| π’ Prisma | namespaces: 11, models: 69 |
2.05M | 7m 55s | 92% |
| π’ Interface | operations: 366, schemas: 328 |
213.38M | 1h 17m 27s | 79% |
| π’ Test | functions: 339 |
113.95M | 28m 3s | 96% |
| π’ Realize | functions: 366 |
54.53M | 42m 43s | 99% |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
100 | π’ | π’ | π’ | π’ | π’ |
bbs |
100 | π’ | π’ | π’ | π’ | π’ |
reddit |
100 | π’ | π’ | π’ | π’ | π’ |
shopping |
100 | π’ | π’ | π’ | π’ | π’ |
- Source Code:
openai/gpt-4.1-mini/todo - Score: 100
- Elapsed Time: 1h 6m 32s
- Token Usage: 8.97M
- Function Calling Success Rate: 84.98%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 1, documents: 4 |
165.8K | 2m 15s | 90% |
| π’ Prisma | namespaces: 2, models: 3 |
115.6K | 1m 3s | 100% |
| π’ Interface | operations: 18, schemas: 23 |
3.90M | 31m 27s | 77% |
| π’ Test | functions: 25 |
3.35M | 16m 21s | 88% |
| π’ Realize | functions: 18 |
1.43M | 15m 24s | 97% |
- Source Code:
openai/gpt-4.1-mini/bbs - Score: 100
- Elapsed Time: 1h 35m 40s
- Token Usage: 15.06M
- Function Calling Success Rate: 82.20%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 4 |
152.6K | 2m 25s | 90% |
| π’ Prisma | namespaces: 3, models: 7 |
243.3K | 1m 25s | 77% |
| π’ Interface | operations: 30, schemas: 37 |
6.85M | 1h 10m 50s | 69% |
| π’ Test | functions: 38 |
5.25M | 12m 40s | 94% |
| π’ Realize | functions: 30 |
2.56M | 8m 18s | 98% |
- Source Code:
openai/gpt-4.1-mini/reddit - Score: 100
- Elapsed Time: 2h 18m 34s
- Token Usage: 92.96M
- Function Calling Success Rate: 78.92%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 4, documents: 11 |
607.5K | 12m 13s | 82% |
| π’ Prisma | namespaces: 8, models: 23 |
920.3K | 2m 40s | 85% |
| π’ Interface | operations: 137, schemas: 137 |
44.04M | 1h 11m 53s | 67% |
| π’ Test | functions: 149 |
33.37M | 30m 49s | 83% |
| π’ Realize | functions: 137 |
14.02M | 20m 56s | 98% |
- Source Code:
openai/gpt-4.1-mini/shopping - Score: 100
- Elapsed Time: 3h 22m 26s
- Token Usage: 179.97M
- Function Calling Success Rate: 72.80%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 4, documents: 10 |
474.3K | 12m 52s | 72% |
| π’ Prisma | namespaces: 10, models: 39 |
1.29M | 2m 39s | 84% |
| π’ Interface | operations: 205, schemas: 199 |
85.70M | 1h 32m 57s | 58% |
| π’ Test | functions: 240 |
62.16M | 30m 0s | 77% |
| π’ Realize | functions: 205 |
30.36M | 1h 3m 56s | 97% |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
100 | π’ | π’ | π’ | π’ | π’ |
bbs |
100 | π’ | π’ | π’ | π’ | π’ |
reddit |
100 | π’ | π’ | π’ | π’ | π’ |
shopping |
30 | π’ | π’ | β | β | β |
- Source Code:
openai/gpt-5-mini/todo - Score: 100
- Elapsed Time: 3h 5m 54s
- Token Usage: 79.34M
- Function Calling Success Rate: 86.84%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 11 |
959.0K | 27m 47s | 95% |
| π’ Prisma | namespaces: 5, models: 20 |
789.2K | 7m 10s | 100% |
| π’ Interface | operations: 92, schemas: 124 |
50.05M | 51m 11s | 77% |
| π’ Test | functions: 105 |
18.03M | 59m 14s | 98% |
| π’ Realize | functions: 92 |
9.51M | 40m 29s | 96% |
- Source Code:
openai/gpt-5-mini/bbs - Score: 100
- Elapsed Time: 2h 9m 19s
- Token Usage: 102.47M
- Function Calling Success Rate: 86.59%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 11 |
871.4K | 9m 52s | 100% |
| π’ Prisma | namespaces: 7, models: 25 |
1.17M | 9m 20s | 94% |
| π’ Interface | operations: 105, schemas: 153 |
65.04M | 55m 13s | 77% |
| π’ Test | functions: 126 |
23.45M | 20m 41s | 98% |
| π’ Realize | functions: 105 |
11.95M | 34m 11s | 98% |
- Source Code:
openai/gpt-5-mini/reddit - Score: 100
- Elapsed Time: 3h 10m 27s
- Token Usage: 102.76M
- Function Calling Success Rate: 83.99%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 12 |
1.07M | 28m 27s | 96% |
| π’ Prisma | namespaces: 10, models: 35 |
2.07M | 9m 47s | 95% |
| π’ Interface | operations: 87, schemas: 132 |
69.07M | 1h 38m 35s | 72% |
| π’ Test | functions: 107 |
20.21M | 22m 6s | 97% |
| π’ Realize | functions: 87 |
10.34M | 31m 29s | 99% |
- Source Code:
openai/gpt-5-mini/shopping - Score: 30
- Elapsed Time: 25m 7s
- Token Usage: 3.22M
- Function Calling Success Rate: 91.84%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 11 |
1.03M | 11m 43s | 95% |
| π’ Prisma | namespaces: 10, models: 50 |
2.19M | 13m 23s | 88% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
100 | π’ | π’ | π’ | π’ | π’ |
bbs |
100 | π’ | π’ | π’ | π’ | π’ |
reddit |
30 | π’ | π’ | β | β | β |
shopping |
30 | π’ | π’ | β | β | β |
- Source Code:
anthropic/claude-sonnet-4.5/todo - Score: 100
- Elapsed Time: 1h 5m 15s
- Token Usage: 29.67M
- Function Calling Success Rate: 75.46%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 2, documents: 11 |
1.19M | 11m 44s | 82% |
| π’ Prisma | namespaces: 2, models: 5 |
432.4K | 3m 28s | 100% |
| π’ Interface | operations: 36, schemas: 35 |
13.94M | 32m 53s | 86% |
| π’ Test | functions: 52 |
7.36M | 7m 5s | 94% |
| π’ Realize | functions: 36 |
6.75M | 10m 3s | 42% |
- Source Code:
anthropic/claude-sonnet-4.5/bbs - Score: 100
- Elapsed Time: 2h 3m 5s
- Token Usage: 159.74M
- Function Calling Success Rate: 68.03%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 11 |
1.66M | 24m 41s | 63% |
| π’ Prisma | namespaces: 6, models: 19 |
2.72M | 9m 22s | 66% |
| π’ Interface | operations: 96, schemas: 117 |
86.48M | 36m 36s | 80% |
| π’ Test | functions: 226 |
40.23M | 23m 14s | 86% |
| π’ Realize | functions: 96 |
28.64M | 29m 9s | 32% |
- Source Code:
anthropic/claude-sonnet-4.5/reddit - Score: 30
- Elapsed Time: 25m 55s
- Token Usage: 4.78M
- Function Calling Success Rate: 70.91%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 11 |
1.66M | 14m 10s | 82% |
| π’ Prisma | namespaces: 7, models: 30 |
3.12M | 11m 45s | 59% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
anthropic/claude-sonnet-4.5/shopping - Score: 30
- Elapsed Time: 40m 22s
- Token Usage: 8.36M
- Function Calling Success Rate: 73.33%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 16 |
3.45M | 28m 11s | 75% |
| π’ Prisma | namespaces: 10, models: 69 |
4.91M | 12m 11s | 70% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
100 | π’ | π’ | π’ | π’ | π’ |
bbs |
100 | π’ | π’ | π’ | π’ | π’ |
reddit |
30 | π’ | π’ | β | β | β |
shopping |
30 | π’ | π’ | β | β | β |
- Source Code:
moonshotai/kimi-k2-0905-exacto/todo - Score: 100
- Elapsed Time: 1h 55m 7s
- Token Usage: 27.07M
- Function Calling Success Rate: 67.25%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 2, documents: 4 |
152.3K | 2m 49s | 100% |
| π’ Prisma | namespaces: 3, models: 6 |
222.9K | 2m 5s | 100% |
| π’ Interface | operations: 47, schemas: 40 |
14.85M | 1h 9m 32s | 50% |
| π’ Test | functions: 53 |
8.10M | 24m 27s | 85% |
| π’ Realize | functions: 47 |
3.74M | 16m 12s | 94% |
- Source Code:
moonshotai/kimi-k2-0905-exacto/bbs - Score: 100
- Elapsed Time: 3h 21m 30s
- Token Usage: 48.13M
- Function Calling Success Rate: 55.89%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 8 |
378.5K | 5m 31s | 100% |
| π’ Prisma | namespaces: 5, models: 18 |
462.2K | 5m 33s | 92% |
| π’ Interface | operations: 39, schemas: 69 |
30.54M | 1h 30m 29s | 42% |
| π’ Test | functions: 47 |
9.37M | 29m 17s | 55% |
| π’ Realize | functions: 39 |
7.38M | 1h 10m 39s | 90% |
- Source Code:
moonshotai/kimi-k2-0905-exacto/reddit - Score: 30
- Elapsed Time: 23m 3s
- Token Usage: 1.98M
- Function Calling Success Rate: 87.76%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 4, documents: 12 |
700.7K | 12m 25s | 89% |
| π’ Prisma | namespaces: 8, models: 44 |
1.28M | 10m 37s | 85% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
moonshotai/kimi-k2-0905-exacto/shopping - Score: 30
- Elapsed Time: 42m 23s
- Token Usage: 2.01M
- Function Calling Success Rate: 74.07%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 4, documents: 10 |
587.8K | 12m 40s | 87% |
| π’ Prisma | namespaces: 8, models: 47 |
1.42M | 29m 43s | 63% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
100 | π’ | π’ | π’ | π’ | π’ |
bbs |
70 | π’ | π’ | π’ | π‘ | β |
reddit |
30 | π’ | π’ | β | β | β |
shopping |
30 | π’ | π’ | β | β | β |
- Source Code:
openai/gpt-5/todo - Score: 100
- Elapsed Time: 1h 8m 24s
- Token Usage: 14.87M
- Function Calling Success Rate: 93.95%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 1, documents: 11 |
854.2K | 7m 10s | 100% |
| π’ Prisma | namespaces: 3, models: 4 |
448.4K | 5m 20s | 100% |
| π’ Interface | operations: 17, schemas: 32 |
7.96M | 29m 39s | 87% |
| π’ Test | functions: 17 |
4.48M | 16m 24s | 100% |
| π’ Realize | functions: 17 |
1.13M | 9m 48s | 100% |
- Source Code:
openai/gpt-5/bbs - Score: 70
- Elapsed Time: 1h 24m 16s
- Token Usage: 74.64M
- Function Calling Success Rate: 90.41%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 2, documents: 11 |
886.3K | 7m 13s | 95% |
| π’ Prisma | namespaces: 5, models: 24 |
809.1K | 9m 57s | 100% |
| π’ Interface | operations: 77, schemas: 148 |
49.91M | 48m 9s | 84% |
| π΄ Test | functions: 13, errors: 1 |
23.04M | 18m 55s | 100% |
| βͺ Realize |
- Source Code:
openai/gpt-5/reddit - Score: 30
- Elapsed Time: 25m 32s
- Token Usage: 4.62M
- Function Calling Success Rate: 100.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 16 |
1.79M | 10m 46s | 100% |
| π’ Prisma | namespaces: 10, models: 83 |
2.83M | 14m 46s | 100% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
openai/gpt-5/shopping - Score: 30
- Elapsed Time: 28m 10s
- Token Usage: 6.51M
- Function Calling Success Rate: 100.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 17 |
2.02M | 9m 28s | 100% |
| π’ Prisma | namespaces: 13, models: 107 |
4.49M | 18m 42s | 100% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
80 | π’ | π’ | π’ | π’ | β |
bbs |
80 | π’ | π’ | π’ | π’ | β |
reddit |
30 | π’ | π’ | β | β | β |
shopping |
30 | π’ | π’ | β | β | β |
- Source Code:
anthropic/claude-haiku-4.5/todo - Score: 80
- Elapsed Time: 1h 5m 19s
- Token Usage: 59.94M
- Function Calling Success Rate: 39.61%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 2, documents: 10 |
890.8K | 5m 14s | 100% |
| π’ Prisma | namespaces: 2, models: 6 |
692.5K | 4m 38s | 85% |
| π’ Interface | operations: 37, schemas: 44 |
42.42M | 40m 31s | 30% |
| π’ Test | functions: 96 |
15.94M | 14m 55s | 46% |
| βͺ Realize |
- Source Code:
anthropic/claude-haiku-4.5/bbs - Score: 80
- Elapsed Time: 1h 5m 37s
- Token Usage: 93.39M
- Function Calling Success Rate: 37.83%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 11 |
1.08M | 6m 59s | 76% |
| π’ Prisma | namespaces: 7, models: 12 |
1.73M | 5m 54s | 80% |
| π’ Interface | operations: 64, schemas: 79 |
55.35M | 30m 54s | 29% |
| π’ Test | functions: 188 |
35.24M | 21m 49s | 42% |
| βͺ Realize |
- Source Code:
anthropic/claude-haiku-4.5/reddit - Score: 30
- Elapsed Time: 18m 15s
- Token Usage: 3.28M
- Function Calling Success Rate: 73.47%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 4, documents: 12 |
1.82M | 13m 46s | 69% |
| π’ Prisma | namespaces: 9, models: 52 |
1.46M | 4m 29s | 84% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
anthropic/claude-haiku-4.5/shopping - Score: 30
- Elapsed Time: 20m 0s
- Token Usage: 3.71M
- Function Calling Success Rate: 77.08%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 12 |
1.58M | 14m 31s | 75% |
| π’ Prisma | namespaces: 11, models: 79 |
2.13M | 5m 29s | 81% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
100 | π’ | π’ | π’ | π’ | π’ |
bbs |
80 | π’ | π’ | π’ | π’ | β |
reddit |
30 | π’ | π’ | β | β | β |
shopping |
10 | π’ | β | β | β | β |
- Source Code:
qwen/qwen3-next-80b-a3b-instruct/todo - Score: 100
- Elapsed Time: 1h 8m 53s
- Token Usage: 8.34M
- Function Calling Success Rate: 72.86%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 1, documents: 3 |
111.6K | 32s | 100% |
| π’ Prisma | namespaces: 2, models: 3 |
114.5K | 8m 34s | 29% |
| π’ Interface | operations: 8, schemas: 12 |
3.95M | 45m 3s | 65% |
| π’ Test | functions: 2 |
2.39M | 7m 31s | 77% |
| π’ Realize | functions: 8 |
1.78M | 7m 11s | 97% |
- Source Code:
qwen/qwen3-next-80b-a3b-instruct/bbs - Score: 80
- Elapsed Time: 1h 3m 52s
- Token Usage: 30.98M
- Function Calling Success Rate: 68.36%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 2, documents: 5 |
206.0K | 51s | 100% |
| π’ Prisma | namespaces: 7, models: 15 |
558.9K | 2m 34s | 94% |
| π’ Interface | operations: 57, schemas: 48 |
18.31M | 33m 58s | 62% |
| π’ Test | functions: 22 |
11.91M | 26m 29s | 73% |
| βͺ Realize |
- Source Code:
qwen/qwen3-next-80b-a3b-instruct/reddit - Score: 30
- Elapsed Time: 13m 16s
- Token Usage: 2.79M
- Function Calling Success Rate: 80.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 10 |
639.6K | 2m 42s | 100% |
| π’ Prisma | namespaces: 10, models: 40 |
2.15M | 10m 33s | 67% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
qwen/qwen3-next-80b-a3b-instruct/shopping - Score: 10
- Elapsed Time: 5m 29s
- Token Usage: 576.5K
- Function Calling Success Rate: 87.50%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 10 |
576.5K | 5m 29s | 87% |
| βͺ Prisma | ||||
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
100 | π’ | π’ | π’ | π’ | π’ |
bbs |
30 | π’ | π’ | β | β | β |
reddit |
30 | π’ | π’ | β | β | β |
shopping |
30 | π’ | π’ | β | β | β |
- Source Code:
deepseek/deepseek-v3.1-terminus-exacto/todo - Score: 100
- Elapsed Time: 1h 43m 24s
- Token Usage: 15.39M
- Function Calling Success Rate: 76.71%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 1, documents: 11 |
631.3K | 6m 0s | 92% |
| π’ Prisma | namespaces: 3, models: 4 |
356.7K | 2m 37s | 100% |
| π’ Interface | operations: 24, schemas: 31 |
9.58M | 1h 7m 44s | 66% |
| π’ Test | functions: 20 |
2.60M | 9m 31s | 84% |
| π’ Realize | functions: 24 |
2.22M | 17m 29s | 91% |
- Source Code:
deepseek/deepseek-v3.1-terminus-exacto/bbs - Score: 30
- Elapsed Time: 19m 42s
- Token Usage: 1.79M
- Function Calling Success Rate: 91.11%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 11 |
562.7K | 5m 43s | 95% |
| π’ Prisma | namespaces: 8, models: 35 |
1.23M | 13m 59s | 85% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
deepseek/deepseek-v3.1-terminus-exacto/reddit - Score: 30
- Elapsed Time: 29m 34s
- Token Usage: 2.43M
- Function Calling Success Rate: 85.19%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 4, documents: 11 |
621.7K | 6m 8s | 88% |
| π’ Prisma | namespaces: 10, models: 46 |
1.80M | 23m 26s | 82% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
deepseek/deepseek-v3.1-terminus-exacto/shopping - Score: 30
- Elapsed Time: 35m 21s
- Token Usage: 2.34M
- Function Calling Success Rate: 79.63%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 11 |
651.7K | 7m 34s | 92% |
| π’ Prisma | namespaces: 9, models: 40 |
1.69M | 27m 46s | 68% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
45 | π’ | π’ | π‘ | β | β |
bbs |
30 | π’ | π’ | β | β | β |
reddit |
30 | π’ | π’ | β | β | β |
shopping |
30 | π’ | π’ | β | β | β |
- Source Code:
qwen/qwen3-30b-a3b-thinking-2507/todo - Score: 45
- Elapsed Time: 1h 17m 21s
- Token Usage: 7.13M
- Function Calling Success Rate: 60.28%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 1, documents: 10 |
461.2K | 9m 6s | 91% |
| π’ Prisma | namespaces: 1, models: 1 |
105.1K | 2m 4s | 100% |
| π΄ Interface | operations: 8, schemas: 11 |
6.57M | 1h 6m 10s | 53% |
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
qwen/qwen3-30b-a3b-thinking-2507/bbs - Score: 30
- Elapsed Time: 11m 14s
- Token Usage: 550.8K
- Function Calling Success Rate: 75.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 1, documents: 11 |
442.2K | 8m 14s | 75% |
| π’ Prisma | namespaces: 1, models: 2 |
108.6K | 2m 59s | NaN% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
qwen/qwen3-30b-a3b-thinking-2507/reddit - Score: 30
- Elapsed Time: 17m 7s
- Token Usage: 1.48M
- Function Calling Success Rate: 78.85%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 10 |
479.0K | 8m 30s | 80% |
| π’ Prisma | namespaces: 9, models: 24 |
1.00M | 8m 36s | 76% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
qwen/qwen3-30b-a3b-thinking-2507/shopping - Score: 30
- Elapsed Time: 43m 40s
- Token Usage: 1.79M
- Function Calling Success Rate: NaN%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 11 |
582.5K | 7m 4s | NaN% |
| π’ Prisma | namespaces: 8, models: 37 |
1.21M | 36m 36s | NaN% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
30 | π’ | π’ | β | β | β |
bbs |
10 | π’ | β | β | β | β |
reddit |
10 | π’ | β | β | β | β |
shopping |
0 | β | β | β | β | β |
- Source Code:
meta-llama/llama-4-maverick/todo - Score: 30
- Elapsed Time: 2m 7s
- Token Usage: 235.5K
- Function Calling Success Rate: 82.35%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 1, documents: 4 |
134.6K | 11s | 100% |
| π’ Prisma | namespaces: 3, models: 4 |
100.9K | 1m 56s | 62% |
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
meta-llama/llama-4-maverick/bbs - Score: 10
- Elapsed Time: 25s
- Token Usage: 263.8K
- Function Calling Success Rate: 100.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 8 |
263.8K | 25s | 100% |
| βͺ Prisma | ||||
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
meta-llama/llama-4-maverick/reddit - Score: 10
- Elapsed Time: 31s
- Token Usage: 273.1K
- Function Calling Success Rate: 100.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 4, documents: 8 |
273.1K | 31s | 100% |
| βͺ Prisma | ||||
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
| Project | Score | Analyze | Prisma | Interface | Test | Realize |
|---|---|---|---|---|---|---|
todo |
10 | π’ | β | β | β | β |
bbs |
10 | π’ | β | β | β | β |
reddit |
10 | π’ | β | β | β | β |
shopping |
10 | π’ | β | β | β | β |
- Source Code:
google/gemini-2.5-pro/todo - Score: 10
- Elapsed Time: 3m 40s
- Token Usage: 524.1K
- Function Calling Success Rate: 100.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 1, documents: 11 |
524.1K | 3m 40s | 100% |
| βͺ Prisma | ||||
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
google/gemini-2.5-pro/bbs - Score: 10
- Elapsed Time: 3m 46s
- Token Usage: 388.2K
- Function Calling Success Rate: 100.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 8 |
388.2K | 3m 46s | 100% |
| βͺ Prisma | ||||
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
google/gemini-2.5-pro/reddit - Score: 10
- Elapsed Time: 3m 37s
- Token Usage: 691.6K
- Function Calling Success Rate: 100.00%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 2, documents: 12 |
691.6K | 3m 37s | 100% |
| βͺ Prisma | ||||
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |
- Source Code:
google/gemini-2.5-pro/shopping - Score: 10
- Elapsed Time: 6m 4s
- Token Usage: 975.3K
- Function Calling Success Rate: 93.94%
| Phase | Generated | Token Usage | Elapsed Time | FCSR |
|---|---|---|---|---|
| π’ Analyze | actors: 3, documents: 15 |
975.3K | 6m 4s | 93% |
| βͺ Prisma | ||||
| βͺ Interface | ||||
| βͺ Test | ||||
| βͺ Realize |