Particle News: Latest Mixture of experts news

Studies Unveil Scaling Laws and Edge Quantization for Mixture-of-Experts Language Models

Researchers validated a metric for predicting sparse model compute efficiency, developing Hessian-aware low-bit inference with expert offloading to reduce on-device memory by roughly 60%

20 ARTICLES

5mo ago

Meta Launches Llama 4 AI Models, Introducing Scout and Maverick

8 ARTICLES

5mo ago

Ant Group Achieves AI Breakthrough with Cost-Effective Chinese and U.S. Chips

5 ARTICLES

last yr.

Microsoft Unveils Cutting-Edge Phi 3.5 AI Models