Anthropic's Claude Opus 4.7 scores 64.3% on SWE-bench Pro, adds multi-agent coordination and 3x vision resolution, at the ...
OpenAI is rolling out a pair of new artificial intelligence models that mimic the process of human reasoning to field more complicated coding questions and visual tasks, the latest in a flurry of ...
3月26日晚间,前阿里千问技术负责人林俊旸在其社交媒体上发表了一篇题为《从推理思维迈向智能体思维》(From "Reasoning" Thinking to "Agentic" Thinking)的文章。这也是他离开阿里巴巴之后,第一次以长文的形式公开分享他对大模型技术演进方向的思考,以及对AI下一 ...
OpenAI has released two AI “reasoning” models that it says are its most capable yet as well as an open-source AI agent that helps computer programmers code, as the company seeks to gain a lead over ...
Grok 4 and its reasoning-focused counterpart, Grok 4 Heavy, arrived with an immediate sense of ambition, offering multimodal AI designed to handle coding, logic, and perception tasks. In the initial ...
Qwen 3.6 Plus is a new advanced AI model built for agentic coding, offering multimodal reasoning and a 1-million-token context window.
A startup called Imandra Inc. says it’s taking artificial intelligence-driven code completion to the next level with the launch of an entirely new and automated reasoning system called CodeLogician.
Anthropic recently unveiled Claude 3.7 Sonnet, an advanced AI model that builds upon its predecessors to deliver improved reasoning and coding capabilities. While not the anticipated Claude 4, this ...
Nvidia's Nemotron-Cascade 2 is a 30B MoE model that activates only 3B parameters at inference time, yet achieved gold medal-level performance at the 2025 IMO, IOI, and ICPC World Finals. Nvidia has ...
Google has announced the release of Gemini 2.5, its latest AI model. The company says this new version is its most intelligent AI to date, with the first release being an experimental version of 2.5 ...
当前正在显示可能无法访问的结果。
隐藏无法访问的结果