gpt.buzz
Sign in

Models/DeepSeek

DeepSeek logoDeepSeek-V4-Flash

DeepSeek · DeepSeekreleased 2026-04-22open sourceupdated 24 days ago

Smaller, faster sibling to DeepSeek-V4-Pro. Same 1M context window with a much lighter 284B / 13B-active MoE.

Specifications

Context window
1,000,000 tokens
Parameters
284B (13B active)
Modality
text
License
MIT
Family
DeepSeek
Release date
2026-04-22

Links

Provider status

DeepSeek APIAll Systems Operational
Last incident: 【已恢复】DeepSeek 网页/API不可用([Resolved]DeepSeek Web/API Service Not Available)about 2 months ago
30-day uptime100.00%
View DeepSeek's status page →

Timeline

  1. Released

    Initial public availability.

  2. Pricing changes, lineage updates, and new benchmark results appear here as they happen. See the releases feed for the latest vendor activity.

API pricing

No API pricing recorded yet for DeepSeek-V4-Flash.

Looking for consumer subscriptions? See DeepSeek's plans →

Benchmarks

Model Index →

No benchmark scores recorded yet for DeepSeek-V4-Flash. We surface official vendor + Artificial Analysis numbers as soon as a model ships them.

Infrastructure context

All intelligence →

Compute, silicon, and capex events that shape DeepSeek-V4-Flash's economics.

  • Compute cluster

    DeepSeek operates ~50k H800 GPU fleet despite export controls

    DeepSeek's training fleet — believed to be ~50,000 H800 GPUs at peak — was assembled prior to the October 2023 US export controls extension. DeepSeek-V4-Pro's training run reportedly used <8M GPU-hours total, less than 10% of GPT-5's estimated budget, leveraging the hybrid CSA+HCA attention scheme to compress FLOPs.

    Accelerators: 50k · H800
    Location: Hangzhou, China
    SemiAnalysis estimate

Compare DeepSeek-V4-Flash with…

Related news

No tagged articles yet. The aggregator surfaces mentions every 15 minutes.