NewsFlash Articles Data Fundraising Skill&API

128 A100 GPUs Trained From Scratch! Byte Releases 3B Universal Multimodal Model Lance

According to DolphinBeat monitoring, ByteDance (ByteDance Research) has officially open-sourced the native unified multimodal large model, Lance. This is a lightweight model with only 3B activation parameters, capable of simultaneous image and video understanding, generation, and editing within a single framework.

Currently, mainstream unified models heavily rely on scaling up parameters or adopting the ViT architecture. In contrast, Lance has explored a low-compute power collaborative path. The research team trained the model entirely from scratch and managed to keep the total compute budget for the entire training period to 128 A100 GPUs.

To address internal conflicts between different modalities and tasks, Lance has implemented two rigid isolations in its architecture:
- It employs a dual-stream Mixtures of Experts (MoE) architecture to handle interleaved multimodal sequences, sharing the underlying context while decoupling the computation paths for understanding and generation.
- It introduces modality-aware rotational position encoding, directly mitigating signal interference between visual tokens of heterogeneous image and video modalities.

The extreme compute compression has not compromised the performance ceiling. With only 3B activation parameters, Lance's image and video generation and editing performance lead in the majority of benchmark tests among existing open-source unified models. Through multi-task collaboration, it has successfully demonstrated a low-cost route that balances generation and semantic understanding with small parameters.

Source

Correction/Report

On-Chain Activity

7min ago

DeepSeek Debunks "<think>Privacy Leak</think>": Actually a Model Hallucination

BlackRock deposits 5,847 BTC into Coinbase, worth approximately $449.52 million

Standard Chartered Bank plans to cut over 7,000 jobs in the next four years and increase investment in AI.

Prime Intellect Open Sources Self-Improving AI Agent Environment: Enables AI to Engage in "Self-Play" to Generate Over 8000 Testing Tools

Correction/Report

Submit

Add Library

Visible to myself only

Public

Save

Choose Library

Add Library

Cancel

Finish

128 A100 GPUs Trained From Scratch! Byte Releases 3B Universal Multimodal Model Lance

BlackRock deposits 5,847 BTC into Coinbase, worth approximately $449.52 million

A Whale Goes 40x Short on BTC, Potentially Betting That BTC Will Not Return to $78,600

Base Protocol's Meme Coin KellyClaude Surges 120% in Afternoon Rally, Market Cap Reaches $4.1 Million

HYPE Whale Turns Loss into Gain, Holds Position for Over Six Months, Now Realizing Nearly $13 Million Profit