header-langage
简体中文
繁體中文
English
Tiếng Việt
한국어
日本語
ภาษาไทย
Türkçe
Scan to Download the APP

Prime Intellect Open Sources Self-Improving AI Agent Environment: Enables AI to Engage in "Self-Play" to Generate Over 8000 Testing Tools

According to Dynamic Beating monitoring, Prime Intellect has announced the open-sourcing of the intelligent agent training environment general-agent, which is a self-evolving fully synthetic environment. The core of this release is to set task generation as a two-player game: where a synthesizer and a solver take turns in competition. It has automatically constructed a large state database containing 4504 tasks and over 8000 unique tools.

The framework starts from simple seed tasks and, through 9 strategies such as condition constraints, noise instructions, and cross-entity coupling, divides tasks into five difficulty levels from t0 to t4. The synthesizer is responsible for designing tasks with a database, interactive tools, and verification functions, while the solver attempts to complete them. Only tasks with pass rates within a specific difficulty range are retained, with the most difficult levels serving as seeds for the next wave of evolution.

Official tests have shown that by fine-tuning a 30B parameter model with just over 4400 trajectories synthesized in this environment, the tool invocation accuracy in the BFCL benchmark test has increased from 18.9% to 52.3%.

This mechanism allows the model to break free from relying on manually annotated static datasets. Through direct gameplay between models, the system can continuously generate difficulty-controlled training corpora with semantic validation.

举报 Correction/Report
Correction/Report
Submit
Add Library
Visible to myself only
Public
Save
Choose Library
Add Library
Cancel
Finish