← Library · Frontier

Holo3.1 Improves Local Inference for Computer Use Agents

Holo3.1, an update to the Holo3 computer-use model, focuses on improving robustness and enabling local inference across various environments. This release introduces quantized checkpoints (FP8, Q4 GGUF, NVFP4) optimized for running agents efficiently on consumer hardware and offers a range of model sizes from 0.8B to 35B-A3B. It also expands capabilities to mobile environments and supports function-calling protocols for better integration with third-party agent stacks.

Why it matters

Holo3.1 makes powerful computer-use agents more accessible and deployable on edge devices and consumer hardware. This shift towards local inference increases privacy, reduces latency, and lowers operational costs for developers and enterprises.

Learn one new AI thing every day.

Daily Deck sends you seven plain-English cards like this every morning. Free.

Start free