Large Language Models LLMs in Chatbots

"Humanity's Last Exam" Reveals How Accurate AI Actually Is. Chatbots Might Want To Look Away Now.

In updated tests published to the Humanity's Last Exam website, Gemini's 3.1 Pro model achieved 45.9 percent accuracy, with a ...

Earth.com

AI can feign moral reasoning by repeating online language patterns

Scientists warn that current AI tests reward polite responses rather than real moral reasoning in large language models.

24d

Genesys Shifts Enterprise CX Strategy From LLMs To Large Action Models

CX software provider Genesys unveiled Genesys Cloud Agentic Virtual Agent, positioning it as the industry’s first agent built ...

Nature

Hey ChatGPT, write me a fictional paper: these LLMs are willing to commit academic fraud

Mainstream chatbots presented varying levels of resistance to deliberate requests for fabrication, study finds.

Analytics Insight

Master Large Language Models in 2026: 10 Must-Vist GitHub Repositories

Overview: Modern Large Language Models are faster and more efficient thanks to open-source innovation.GitHub repositories remain the main hub for building, test ...

Diginomica

Tencent Summit – why general Large Language Models and chatbots "no longer meet business needs" for enterprise Artificial Intelligence

The pizazz feels welcoming and familiar: the expectant crowd filling a hangar-sized convention hall; a stage the width of a football field; the pounding music and widescreen visuals; the discreet ...

Communications of the ACM

Measuring What Matters in Large Language Model Performance

As large language models (LLMs) gain momentum worldwide, there’s a growing need for reliable ways to measure their performance. Benchmarks that evaluate LLM outputs allow developers to track ...

Digital Information World

Chatbots overemphasize sociodemographic stereotypes, researchers report

Study shows large language models flatten minority identities using culturally coded, stereotypical language patterns.

14don MSN

Is AI Discriminatory? MIT Study Finds Chatbots Refuse To Answer Less Educated, Non-US Users

MIT research reveals that leading AI chatbots provide lower-quality responses and refuse more queries from less educated non-native English speakers.

Opinion

WLRNOpinion

A college student's perspective on using AI in class

Instead of banning AI, why don't schools teach students to use it critically? College freshman Maximilian Milovidov shares what he has learned in an "AI writing" course at Columbia University.

International Monetary Fund

How Effectively Can Current LLMs Analyze Macrofinancial Issues?

This paper empirically evaluates the ability of current Large Language Models (LLMs) to analyze macrofinancial coverage in IMF Article IV staff reports, using human economists' assessments as a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results