According to Dynamic Beating monitoring, Prime Intellect has announced the open-sourcing of the intelligent agent training environment general-agent, which is a self-evolving fully synthetic environment. The core of this release is to set task generation as a two-player game: where a synthesizer and a solver take turns in competition. It has automatically constructed a large state database containing 4504 tasks and over 8000 unique tools.
The framework starts from simple seed tasks and, through 9 strategies such as condition constraints, noise instructions, and cross-entity coupling, divides tasks into five difficulty levels from t0 to t4. The synthesizer is responsible for designing tasks with a database, interactive tools, and verification functions, while the solver attempts to complete them. Only tasks with pass rates within a specific difficulty range are retained, with the most difficult levels serving as seeds for the next wave of evolution.
Official tests have shown that by fine-tuning a 30B parameter model with just over 4400 trajectories synthesized in this environment, the tool invocation accuracy in the BFCL benchmark test has increased from 18.9% to 52.3%.
This mechanism allows the model to break free from relying on manually annotated static datasets. Through direct gameplay between models, the system can continuously generate difficulty-controlled training corpora with semantic validation.
