According to 1M AI News monitoring, DeepSeek has recently opened up 17 recruitment positions, shifting its core R&D focus from fundamental model research to Agent productization. The three exclusive Agent positions cover algorithm research, data evaluation, and end-to-end infrastructure: the Algorithm Researcher focuses on the application of reinforcement learning in large-scale model alignment (RLHF/RLAIF, process rewards, preference learning, among other directions); the Data Evaluation Expert is responsible for constructing evaluation datasets, designing test cases for Agent capabilities such as planning, tool invocation, multi-round interaction, and long-term memory; the Infrastructure Engineer is in charge of setting up the Agent runtime base, including integrating external tools into internal reinforcement learning infrastructure and establishing an evaluation platform.
Two notable signals are present in the job requirements. Several positions explicitly state in the bonus section that "heavy use of Claude Code, Cursor, Copilot, and other AI coding tools" is a priority. An uncommon description appears in the Full-stack Development Engineer job responsibilities: "As a heavy user of Vibe Coding, continuously explore the innovative application of model capabilities in the product," with the core focus of the same position being the construction of a "next-generation container scheduling and isolation platform supporting massive AI Agent operations."
The Model Strategy Product Manager position has separately established an Agent direction, requiring candidates to be familiar with Agent products such as Claude Code, OpenClaw, Manus, insight into high-value application scenarios (including personal assistants, deep research, workflow automation, multi-modal device control), and to lead Agent evaluation system and training data scheme design. Contrasting with January this year when the core open positions were focused on general directions like "Deep Learning Researcher-AGI," this recruitment has significantly shifted its emphasis towards Agent productization.
