Debates over AI benchmarks — and how they’re reported by AI labs — are spilling out into public view. This week, an OpenAI employee accused Elon Musk’s AI company, xAI, of publishing misleading ...
XAI Grok 4 Benchmarks are showing it is the leading model. Humanity Last Exam at 35 and 45 for reasoning is a big improvement from about 21 for other top models. If these leaked Grok 4 benchmarks are ...
The artificial intelligence community is in the midst of a heated debate over xAI’s Grok 3 model. OpenAI’s Boris Power has accused xAI of manipulating benchmark evaluations to artificially enhance ...
In just two years, Elon Musk’s xAI has become one of a dozen or so labs capable of developing state-of-the-art AI models. Now xAI is out with its Grok 3 large language model, which beats ...
Elon Musk's xAI has launched its new flagship AI model, Grok-4, which demonstrates leading performance in various academic, reasoning, and coding benchmarks. Elon Musk's xAI today announced Grok 4, ...
Yesterday, just as OpenAI celebrated its 10-year anniversary, the AI company launched GPT-5.2, its latest series of AI models to power ChatGPT. The latest release is allegedly in response to OpenAI’s ...
Grok-3, the latest AI model from xAI, outperformed ChatGPT, Gemini and DeepSeek in a blind AI evaluation, achieving a record-breaking score, according to xAI’s internal analysis. Update (Feb. 19, 8:34 ...
xAI, the artificial intelligence company founded by Elon Musk, has recently unveiled grok-code-fast-1, a groundbreaking agentic coding model designed to revolutionize how developers approach software ...