Gate News message, April 20 — Top AI models excel at solving complex problems like Olympiad mathematics but struggle with routine enterprise work, according to David Meyer of Databricks. Some models may correct an incorrect invoice number instead of flagging it as an error, while coding tools like Claude can also underperform on data engineering tasks.
The gap stems from fundamental differences between enterprise data and the public web text used to train large models. Enterprise data often features vague column labels, numerous blank fields, and codes stored as plain text. In one academic study, an AI model’s F1 score, which balances precision and recall, dropped from 0.94 on public data to 0.07 on enterprise data for a data engineering task. Additionally, large models tend to default to familiar patterns from training; some defaulted to Structured Query Language (SQL) even after receiving instructions and documentation for a company’s proprietary query language.
Smaller open source models tuned with reinforcement learning can handle specific jobs more efficiently at significantly lower training costs than large general-purpose models. Databricks is building smaller AI agents for specific workflows, such as KARL, which uses reinforcement learning for multi-step reasoning with company documents. The industry is shifting from reliance on giant models to hybrid architectures where small efficient models handle routine volume, then escalate only unclear or complex cases to larger, costlier systems.
Databricks recently acquired Quotient AI to help large enterprises run AI agents more reliably. Competition in the AI business now centers on running the full AI lifecycle, including feedback systems for tracking errors and continuously improving models over time, making evaluation and tuning tools increasingly valuable after deployment.
Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to
Disclaimer.
Related Articles
UAE Announces Shift Toward AI Government Model in the Next Two Years
His Highness Sheikh Mohammed bin Rashid Al Maktoum stated that the goal was for 50% of government sectors to operate through autonomous agentic AI. The transition will also include the training of federal employees to “master AI” and will be overseen by Sheikh Mansour bin Zayed.
Key Takeaways:
Coinpedia5h ago
AI Trading Platform Fere AI Raises $1.3M in Funding Led by Ethereal Ventures
Gate News message, April 25 — Fere AI, an AI-powered digital asset trading platform, announced the completion of a $1.3 million funding round led by Ethereal Ventures, with participation from Galaxy Vision Hill and Kosmos Ventures, according to Globenewswire.
The platform supports cross-chain
GateNews6h ago
Nvidia Deploys OpenAI Codex AI Agent Across Entire Workforce on Blackwell Infrastructure
Gate News message, April 25 — Nvidia has rolled out OpenAI's Codex, an AI agent powered by GPT-5.5, to its entire workforce following a successful trial with approximately 10,000 employees, according to internal communications from CEO Jensen Huang and OpenAI CEO Sam Altman.
Codex is designed to as
GateNews11h ago
AI Coding Startup Cognition in Talks for $25B Valuation Funding Round
Gate News message, April 25 — AI coding startup Cognition is in early talks to raise hundreds of millions of dollars or more at approximately a $25 billion valuation, according to people familiar with the matter. Interest has increased following SpaceX's acquisition of a rival AI coding startup.
Co
GateNews11h ago
AI Trading Agent Platform Fere AI Raises $1.3M, Led by Ethereal Ventures
Gate News message, April 25 — AI-powered digital asset trading agent platform Fere AI announced the completion of a $1.3 million funding round, led by Ethereal Ventures, with Galaxy Vision Hill and Kosmos Ventures participating. The platform supports cross-chain networks including Ethereum,
GateNews12h ago
OpenClaw v2026.4.23 Adds gpt-image-2 Direct OAuth Support, Introduces Forked Context Mode for Sub-agents
Gate News message, April 25 — OpenClaw, an open-source AI agent framework, released v2026.4.23 on April 23, introducing updates across image generation, sub-agent mechanisms, and security hardening.
Image generation enhancements allow gpt-image-2 to be called directly via Codex OAuth without
GateNews12h ago