News

Language version:

Scientists Develop Tool to Benchmark Language Models as Agents

A team of nearly two dozen researchers from Tsinghua University, Ohio State University, and the University of California at Berkeley has developed a tool called “AgentBench” to measure the capabilities of large language models (LLMs) as real-world agents. LLMs, such …

The post Scientists Develop Tool to Benchmark Language Models as Agents appeared first on isp.page.

Desktop version