Simple version
Same careful map of how a model works, but for a model so large we had to stream it from a network drive.
In plain terms
Grok-1 has 314 billion parameters and uses a routing design that picks different sub-networks for different inputs. We ran it from network-attached storage, captured a bounded slice of its routing and attention structure, and ran adversarial checks against the result.
Technical artifact
NAS-backed local execution and bounded mixed-edge topology for a 314B MoE model, 8 bounded edges, ratified under a page-cache-purge storage contract with adversarial verification.