Anthropic Identifies Three Product-Layer Changes Behind Claude Code Quality Decline, Not Model Issue

Gate News message, April 23 — Anthropic’s engineering team confirmed that the Claude Code quality degradation reported by users over the past month stemmed from three independent product-layer changes, not from API or underlying model issues. The three problems were fixed on April 7, April 10, and April 20 respectively, with the final version now at v2.1.116.

The first change occurred on March 4, when the team reduced the default reasoning effort level for Claude Code from “high” to “medium” to address occasional extreme latency spikes in Opus 4.6 under high reasoning intensity. After widespread user complaints about reduced performance, the team reverted the change on April 7. The current default is now “xhigh” for Opus 4.7 and “high” for other models.

The second issue was a bug introduced on March 26. The system was designed to clear old reasoning records after conversation inactivity exceeded one hour to reduce session recovery costs. However, a flaw in implementation caused the clearing to execute repeatedly on every subsequent turn rather than once, causing the model to progressively lose prior reasoning context. This manifested as increasing forgetfulness, repeated operations, and abnormal tool invocations. The bug also resulted in cache misses on every request, accelerating user quota consumption. Two unrelated internal experiments masked the reproduction conditions, extending the debugging process to over a week. After fixing on April 10, the team reviewed problematic code using Opus 4.7 and found that Opus 4.7 could identify the bug while Opus 4.6 could not.

The third change launched on April 16 alongside Opus 4.7. The team added instructions to the system prompt to reduce redundant output. Internal testing over several weeks showed no regression, but post-launch interaction with other prompts degraded coding quality. Extended evaluation revealed a 3% performance drop in both Opus 4.6 and 4.7, leading to a rollback on April 20.

These three changes affected different user groups at different times, and their combined effect created widespread and inconsistent quality decline, complicating diagnosis. Anthropic stated it will now require more internal employees to use the same public build version as users, run full model evaluation suites for every system prompt modification, and implement staged rollout periods. As compensation, Anthropic has reset usage quotas for all subscription users.

免责声明:本页面信息可能来自第三方,不代表 Gate 的观点或意见。页面显示的内容仅供参考,不构成任何财务、投资或法律建议。Gate 对信息的准确性、完整性不作保证,对因使用本信息而产生的任何损失不承担责任。虚拟资产投资属高风险行为,价格波动剧烈,您可能损失全部投资本金。请充分了解相关风险,并根据自身财务状况和风险承受能力谨慎决策。具体内容详见声明

相关文章

Cohere 收购德国 AI 公司 Aleph Alpha,斩获 $600M 投资用于欧洲扩张

Gate 新闻消息,4月24日——加拿大 AI 公司 Cohere 宣布计划收购德国 AI 公司 Aleph Alpha,以加强其在欧洲的布局。Aleph Alpha 的支持方 Schwarz Group 计划在 Cohere 的 E 轮融资中投资 $600 百万。 预计该融资轮将于 202

GateNews23 分钟前

小鹏、Redmi 牵头:北京车展上的车载 AI 推进

快讯,4月24日——随着中国加速推进 AI Plus 战略,并寻求在国外半导体方面获得更大独立性,中国车企在4月24日的北京车展上展示了先进的车载 AI 系统。 小鹏展示了语音控制泊车功能,允许驾驶员“通过

GateNews1小时前

前字节跳动 Seed 工程师:字节跳动 AI 迭代需六个月,而谷歌为三个月

Gate News 消息,4 月 24 日——字节跳动 Seed 团队的前工程师、现任北京大学助理教授张驰在播客《Into Asia》中透露,字节跳动完成一次完整的大型语言模型训练 (预训练循环大约需要六个月

GateNews1小时前

OpenAI 工程师 Clive Chan 挑战 V4 硬件建议,称其相较 V3 存在错误与模糊之处

Gate News 消息,4 月 24 日——OpenAI 工程师 Clive Chan 就 V4 技术报告中的硬件建议章节提出了详细异议,称其“出人意料地平庸且容易出错”,与备受赞誉的 V3 版本相比尤为如此。V3 的硬件指导,其中包括问答(Q&A)环节

GateNews1小时前

Naver 推出 AI Tab 测试版,Google Gemini 进入韩国搜索市场

Gate News 消息,4 月 24 日——Naver 宣布,其新的对话式搜索功能 AI Tab 将启动封闭测试版,此前谷歌已在韩国于 Chrome 中推出 Gemini。 AI Tab 将与 Naver 现有的搜索标签并列出现,为用户提供一个专门用于对话式

GateNews2小时前

印度AI工程招聘激增59.5%,扩展至科技枢纽之外

LinkedIn 的《AI 劳动力市场报告 2026》于 4 月 24 日发布,发现印度的 AI 工程招聘同比增长 59.5%,在该平台研究的各市场中呈现出最快的增长速度。 这种增长由需求扩散至既有技术中心之外所推动。包括以下城市在内

Crypto Frontier2小时前
评论
0/400
暂无评论