Skip to main content

Alibaba PPU (Incang 800 Next Generation)#

Product Overview#

Alibaba PPU is a next-generation AI inference chip developed by Pingtouge (T-Head), Alibaba's semiconductor subsidiary. It is the next-generation product of Incang 800. Tape-out completed at TSMC in 2023, originally planned for 2024 mass production, but affected by US sanctions, production was transferred to domestic enterprises. As of September 2025, the official name has not been announced ("PPU" is a temporary project-phase identifier).

PPU outperforms NVIDIA H20 in memory capacity (96GB HBM2e) and PCIe 5.0 interface. Single-card BOM cost is 40% lower than H20, which can drive Alibaba Cloud public cloud inference instance prices down by 50%.

⚠️ Naming Note: PPU is a temporary project-phase identifier. The official commercial model name has not been announced. Industry sources speculate it is the next-generation product of Incang 800.

Core Specifications (Public)#

ItemParameter
ArchitecturePingtouge self-developed NPU architecture
Process7nm (estimated, transferred to domestic production after sanctions)
HBM96 GB HBM2e
Memory Bandwidth700 GB/s
TDP400W
InterfacePCIe 5.0 ×16
Inter-chip Interconnect700 GB/s (specific protocol not disclosed)
Mass ProductionOriginally 2024, delayed due to sanctions
Commercial AvailabilityTesting in 2025, not yet announced

📝 Data Note: Core metrics such as FP32/FP16/INT8 compute and high-speed interconnect technology have not been publicly disclosed. Actual performance requires subsequent empirical verification.

PPU vs NVIDIA H20 Comparison#

MetricAlibaba PPUNVIDIA H20Comparison
HBM Capacity96GB HBM2e96GB HBM3Capacity comparable
Memory Bandwidth700 GB/s>1 TB/sH20 bandwidth lead
TDP400W400WComparable
InterfacePCIe 5.0 ×16PCIe 5.0PPU interface more advanced
Single-card Cost-40% vs H20BaselinePPU significant cost advantage
EcosystemSelf-developed HALO stackCUDAH20 ecosystem mature
Comprehensive PerformanceClose to H20BaselinePPU is important breakthrough

Deployment Status#

  • China Unicom Sanjiangyuan Green Computing Power Center: Alibaba Cloud signed 1,024 devices, 16,384 Pingtouge compute cards, contributing 1,945 P compute (largest brand in project procurement)
  • Alibaba Internal: Taobao search recommendation, Alibaba Cloud PAI platform, Cainiao logistics scheduling (Incang 800 already deployed, PPU will gradually replace)

Strategic Significance#

  • Domestic substitution: Affected by US sanctions, transferred to domestic production, important breakthrough in domestic AI chip supply chain autonomy
  • Cost advantage: Single-card BOM cost 40% lower than H20, driving Alibaba Cloud inference instance prices down 50%
  • Performance breakthrough: Comprehensive performance close to H20, significant upgrade from Incang 800 (12nm, 820 TOPS)#

Manufacturer Information#

ItemContent
ManufacturerAlibaba Pingtouge Semiconductor (T-Head)
Product Pagehttps://www.t-head.cn/
Software StackHALO (Hanguang Accelerated Linear Operator)
Ecosystem PositioningAlibaba Cloud internal deployment + future external sales