Replaces single global refcount with per-thread reference counting tables, eliminating atomic contention. Deferred deallocation queue handles cross-thread object cleanup safely. ThreadState tracks per-thread objects, avoiding GIL serialization. 3x speedup on CPU-bound multithreading workloads. Install via uv python install 3.13t.


.png)
