OpenAI employees have publicly accused xAI of misleading benchmark results for its latest AI model, Grok3

2025-02-23 02:23:53

Recently openai a staff member publicly accused elon musk flag xai company desk3 区块链加密新闻

Recently, an OpenAI employee publicly criticized Elon Musk's xAI company, saying that the benchmark results of its latest AI model Grok3 were misleading. In response, xAI co-founder Igor Babushkin insisted that the company was not improper. The xAI chart shows that two versions of Grok3 - Grok3 Reasoning Beta and Grok3 mini Reasoning - outperformed OpenAI's current strongest available model, o3-mini-high, on AIME 2025. However, OpenAI employees were quick to point out on the X platform that the xAI chart does not include the o3-mini-high's AIME 2025 score under "cons@64" conditions. Babushkin argued on the X platform that OpenAI had published similar misleading benchmark charts in the past, even though they were used to compare the performance of its own models.

Web3 桌面交易工具

了解币圈信息快人一步

OpenAI员工公开指责xAI最新AI模型Grok3的基准测试结果具有误导性

稳定币总市值突破2265亿美元，过去一周增长0.54%

7x24 快讯

关于DESK3

关于我们服务条款隐私保护免责声明

产品

资讯 Swap 跨链桥区块链云图铭文钱包热门工具

服务

帮助中心公告商业支持

Sociality

OpenAI employees have publicly accused xAI of misleading benchmark results for its latest AI model, Grok3

7x24 快讯

热门资讯

相关推荐