Opus 4.8 shows a growing tendency to reason explicitly about how its outputs will be graded, including in environments where it wasn't told it was being evaluated.
Gray Swan works with every major frontier AI lab. Now it’s raised $40 million as it expands to sell security tools to ...
Microsoft’s Agent Governance Toolkit brings runtime policy enforcement to autonomous agents, based on the OWASP top 10 agent ...
Over the past decade or so, foundation models have emerged as the dominant paradigm for interacting with language, images, ...
When faced with genuinely difficult ethical tradeoffs, leading AI models report feeling conflicted — then make sweeping, ...
Researchers who found the bug warn that its Moderate rating understates a threat reaching across LLM gateways, MCP servers ...
NVIDIA’s CUDA 13.3 targets the divisions between Python and C++ engineers inside enterprise software teams building AI applications. Python teams often build fast prototypes, while C++ engineers spend ...
Learn how systems engineering is shifting from document-centric practices to model-based, data-driven approaches that reduce ...
Whether the dust borne on the violent winds of a tornado or the sugar grains in a swirled cup of coffee, the behavior of ...
Researchers identified key differences between two widely used multiple sclerosis models, showing how each can better study myelin damage, immune responses, and repair. The findings may improve ...
Millions of AI agents and tools around the world have been imperiled by a critical vulnerability that can allow hackers to ...
GGUF parser vulnerabilities disclosed May 15, 2026 include a critical integer overflow that lets any malicious model file trigger arbitrary memory reads — affecting Ollama, LM Studio, and every local ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results