header-langage
简体中文
繁體中文
English
Tiếng Việt
한국어
日本語
ภาษาไทย
Türkçe
Scan to Download the APP

DeepSeek is releasing OCR2, which can recognize images in the same logical order as humans.

BlockBeats News, January 27th, DeepSeek released the brand-new DeepSeek-OCR 2 model, adopting the innovative DeepEncoder V2 approach, enabling AI to dynamically rearrange various parts of an image based on their semantic meaning, instead of mechanically scanning from left to right. This approach simulates the logical flow that humans follow when viewing a scene. Ultimately, the model outperforms traditional vision-language models when dealing with images with complex layouts, achieving a more intelligent visual understanding with enhanced causal reasoning capabilities.

举报 Correction/Report
Correction/Report
Submit
Add Library
Visible to myself only
Public
Save
Choose Library
Add Library
Cancel
Finish