Evaluation Report
Summary
| Metric | Value |
|---|---|
| Agent | gemini |
| Model | gemini-3-pro-preview |
| Timestamp | 2026-01-20T174952 |
| Pass Rate | 50.0% |
| Weighted Pass Rate | 48.1% |
| Passed | 13 |
| Failed | 13 |
| Total | 26 |
| Weighted Score | 16.00 / 33.29 |
Results by Language
| Language | Passed | Failed | Total | Pass Rate |
|---|---|---|---|---|
| dart | 0 | 3 | 3 | 0.0% |
| go | 5 | 1 | 6 | 83.3% |
| kotlin | 0 | 3 | 3 | 0.0% |
| rust | 6 | 0 | 6 | 100.0% |
| typescript | 0 | 5 | 5 | 0.0% |
| zig | 2 | 1 | 3 | 66.7% |
Task Results
| Task | Status | Weight | Score |
|---|---|---|---|
| dart/future-pool | ✘ FAIL | 1.46 | 0.00 |
| dart/isolate-pool | ✘ FAIL | 1.50 | 0.00 |
| dart/reactive-cache | ✘ FAIL | 1.50 | 0.00 |
| go/bank-account | ✔ PASS | 1.04 | 1.04 |
| go/dining-philosophers | ✔ PASS | 1.04 | 1.04 |
| go/errgroup-limit | ✘ FAIL | 1.14 | 0.00 |
| go/parallel-letter-frequency | ✔ PASS | 1.04 | 1.04 |
| go/react | ✔ PASS | 1.14 | 1.14 |
| go/singleflight | ✔ PASS | 1.28 | 1.28 |
| kotlin/channel-multiplexer | ✘ FAIL | 1.50 | 0.00 |
| kotlin/flow-processor | ✘ FAIL | 1.50 | 0.00 |
| kotlin/lru-cache | ✘ FAIL | 1.09 | 0.00 |
| rust/circular-buffer | ✔ PASS | 1.12 | 1.12 |
| rust/doubly-linked-list | ✔ PASS | 1.24 | 1.24 |
| rust/generational-arena | ✔ PASS | 1.24 | 1.24 |
| rust/macros | ✔ PASS | 1.50 | 1.50 |
| rust/parallel-letter-frequency | ✔ PASS | 1.12 | 1.12 |
| rust/regex-lite | ✔ PASS | 1.40 | 1.40 |
| typescript/csv-lite | ✘ FAIL | 1.36 | 0.00 |
| typescript/forth | ✘ FAIL | 1.26 | 0.00 |
| typescript/glob | ✘ FAIL | 1.14 | 0.00 |
| typescript/promise-pool | ✘ FAIL | 1.20 | 0.00 |
| typescript/react | ✘ FAIL | 1.14 | 0.00 |
| zig/arena-allocator | ✘ FAIL | 1.50 | 0.00 |
| zig/comptime-json | ✔ PASS | 1.50 | 1.50 |
| zig/small-vector | ✔ PASS | 1.34 | 1.34 |
Report recovered from validation logs