Productization Status
VKAE-Gemma is moving from challenge proof to controlled customer delivery. The first commercial surface is E4B private preview; other models are sold as explicit porting tracks.
VKAE-Gemma-E4B
Engine Roadmap
Claim-Scoped Performance
Only claim-ready model profiles show before/after throughput. Packaged preview models remain visible, but their raw harness numbers are blocked from product-speed claims until a model-specific VKAE recipe is promoted.
Gemma 4 E4B
Gemma 4 Model Catalog
The acceleration plan tracks the official Google `gemma-4-*` BF16 serving surface: base, instruct, and assistant variants. QAT, GGUF, and mobile-specific packages are excluded from this benchmark scope.
GPU Sizing And Family Speed
Each Gemma 4 family gets a minimum and recommended BF16 serving lane. Measured TPS is shown only when available; otherwise the chart uses a planning speed index until hardware-specific runs replace it.
Full-Family Speed View
Minimum / Recommended GPU
Measured Benchmark Evidence
Measured rows are separated by claim level. E4B uses the challenge reference path as its before baseline. E2B, 12B, 26B-A4B, and 31B package rows are raw harness checks only, not public VKAE before/after speedups.
Kernel-Led Engine Stack
VKAE is positioned as a model-specialized engine: kernels, recipes, serving, and verifier gates are treated as one product surface.
Model Profile
Capture Gemma-specific serving assumptions, context windows, quality guardrails, and hardware lanes.
Kernel Path
Route hot decode work through custom kernel and runtime patches instead of generic defaults.
Serving Recipe
Bind model, memory, batching, speculative, and verifier settings into repeatable deployment recipes.
Verification
Keep artifact, private benchmark, PPL, and dashboard-valid status in separate lanes.
Request Private Preview
Choose the service path that matches your deployment: report, managed endpoint, private Docker evaluation, enterprise on-prem, or custom engine build.
Early access is staged to protect kernel IP while giving customers measurable before/after results.