NewsFlash Articles Data Fundraising Skill&API

Epoch AI Releases Claude Skills Graph: The Coding Long Tail Continues, Opus 4.6 and 4.7 Address Math Shortcomings

According to MetaAI Watch monitoring, Epoch AI has released the latest analysis of the Domain-specific Capability Index (Domain-specific ECI), revealing that Anthropic's Claude series models have consistently shown strength in coding ability and weakness in mathematics relative to their overall capability. However, the latest data shows that this bias is rapidly easing.

Calculations have shown that in previous multi-generational models, Claude has consistently scored higher in Software Engineering Benchmark Tests (SWE-ECI) compared to its overall score, while a long-standing gap has existed in Mathematics Benchmark Tests (Math-ECI). The newly released Opus 4.6 and 4.7 models have narrowed the gap between mathematics and overall scores to within 1 point, addressing the previous shortfall.

The calculation mechanism of ECI compares the relative performance of various models, thus directly reflecting the average difficulty of specific tasks for AI, rather than for humans.

Source

Correction/Report

On-Chain Activity

1h ago

WSJ: Gemini Co-Founders and Other Early Bitcoin Backers Turn to Zcash as Privacy Narrative Resurges

The **Hormuz Crisis** has exposed the **"AI Power Dependency Trap"** in Taiwan and South Korea, revealing the emerging **AI Energy Crisis**.

Source: Insider Insiders: SpaceX Shareholders Approve 1-for-5 Stock Split Plan

HK Media: Mainland Chinese Man Allegedly Committed Suicide in Hong Kong Due to Cryptocurrency Investment Loss, Arrested for Carrying Gasoline and Lighter

Correction/Report

Submit

Add Library

Visible to myself only

Public

Save

Choose Library

Add Library

Cancel

Finish

Epoch AI Releases Claude Skills Graph: The Coding Long Tail Continues, Opus 4.6 and 4.7 Address Math Shortcomings

A Whale Address Spent 629 ETH to Purchase More ASTEROID, Worth Around $1.4 Million

A whale went long 10x on 180,000 HYPE tokens, with a liquidation price of $32.8.

urhomie.eth purchased Punk #9233 for 32.5 ETH

Multicoin Capital's address has deposited 286,000 AAVE into Coinbase Prime, worth approximately $26.68 million