Gemini CLI v0.24.5

Gemini 3 Pro Preview 2026-01-20
Score
16.00
FAILED
Pass Rate: 50.0%
rust: 100.0% (6 tasks)
rus
go: 83.3% (6 tasks)
go
zig: 66.7% (3 tasks)
zig
dart: 0.0% (3 tasks)
dar
kotlin: 0.0% (3 tasks)
kot
typescript: 0.0% (5 tasks)
typ

Evaluation Report

Summary

Metric Value
Agent gemini
Model gemini-3-pro-preview
Timestamp 2026-01-20T174952
Pass Rate 50.0%
Weighted Pass Rate 48.1%
Passed 13
Failed 13
Total 26
Weighted Score 16.00 / 33.29

Results by Language

Language Passed Failed Total Pass Rate
dart 0 3 3 0.0%
go 5 1 6 83.3%
kotlin 0 3 3 0.0%
rust 6 0 6 100.0%
typescript 0 5 5 0.0%
zig 2 1 3 66.7%

Task Results

Task Status Weight Score
dart/future-pool ✘ FAIL 1.46 0.00
dart/isolate-pool ✘ FAIL 1.50 0.00
dart/reactive-cache ✘ FAIL 1.50 0.00
go/bank-account ✔ PASS 1.04 1.04
go/dining-philosophers ✔ PASS 1.04 1.04
go/errgroup-limit ✘ FAIL 1.14 0.00
go/parallel-letter-frequency ✔ PASS 1.04 1.04
go/react ✔ PASS 1.14 1.14
go/singleflight ✔ PASS 1.28 1.28
kotlin/channel-multiplexer ✘ FAIL 1.50 0.00
kotlin/flow-processor ✘ FAIL 1.50 0.00
kotlin/lru-cache ✘ FAIL 1.09 0.00
rust/circular-buffer ✔ PASS 1.12 1.12
rust/doubly-linked-list ✔ PASS 1.24 1.24
rust/generational-arena ✔ PASS 1.24 1.24
rust/macros ✔ PASS 1.50 1.50
rust/parallel-letter-frequency ✔ PASS 1.12 1.12
rust/regex-lite ✔ PASS 1.40 1.40
typescript/csv-lite ✘ FAIL 1.36 0.00
typescript/forth ✘ FAIL 1.26 0.00
typescript/glob ✘ FAIL 1.14 0.00
typescript/promise-pool ✘ FAIL 1.20 0.00
typescript/react ✘ FAIL 1.14 0.00
zig/arena-allocator ✘ FAIL 1.50 0.00
zig/comptime-json ✔ PASS 1.50 1.50
zig/small-vector ✔ PASS 1.34 1.34

Report recovered from validation logs

Run Statistics

Tasks Passed 13 / 26
Duration 0m
Cost (Est) -
By Language
dart 0%
go 83%
kotlin 0%
rust 100%
typescript 0%
zig 67%