Technology ❯Artificial Intelligence

Model Architecture

Mixture-of-Experts Mixture of Experts Transformer Models Efficiency Techniques Reasoning Models Composition of Experts Diffusion Transformer Llama 3 Multimodal Systems Hybrid Mixture-of-Experts Dense Models Hybrid Models Training Data Hybrid Reasoning Models Interleaved Shared Attention Parameters

Alibaba Deprecates Hybrid Mode in Qwen3, Launches Dedicated Instruct and Thinking Models

The Qwen3-2507 update raises benchmark scores by splitting Instruct and Thinking models, offering a 256k-token context window.

Z.ai Unveils Open-Source GLM-4.5 Models That Undercut DeepSeek

Alibaba Elevates Qwen3 Open-Source AI with Benchmark-Topping Upgrades

Alibaba Releases Qwen 3, Advancing Open-Source AI Competition

Meta Faces Backlash Over Use of Experimental AI Model for Benchmark Testing

Meta Launches Llama 4 AI Models, Introducing Scout and Maverick

DeepSeek V3 Challenges AI Giants With Open-Source Model and Efficiency Breakthroughs

Alibaba Launches QwQ-32B AI Model to Challenge OpenAI's Reasoning Models

Hugging Face and Meta Release Compact AI Models for Mobile Devices

Liquid AI Unveils Revolutionary Non-Transformer Models with Superior Efficiency

Mistral Unveils Pixtral 12B, Its First Multimodal AI Model

Meta's Llama AI Model Stands Out in Competitive Landscape

Stability AI Launches Stable Diffusion 3 and Turbo Models Via Developer API