Search

Edoardo M. Ponti

Publications
FAQ
Resume
AToM ⚛︎
TEAS ☕︎

atom

nvidia/Qwen3-8B-DMS-8x

8x KV cache compression without quality degradation. Ideal for inference-time scaling.

LICENSE: CC-BY-SA

Edoardo M. Ponti, 2026 · Adapted from Alison Presmanes Hill's , the blogdown package, and the Academic theme for Hugo.

Cite