Triton is a language and compiler for writing highly efficient custom deep-learning primitives. Not officially supported on Windows, but a fork provides pre-built wheels. 3.6.x RTX 50xx (Blackwell), ...
Highlights of Python 3.15, now available in beta, include lazy imports, faster JITs, better error messages, and smarter ...