Technology ❯ Computer Science ❯ Artificial Intelligence
Performance Evaluation Reinforcement Fine-Tuning Pre-training Low-Rank Adapters Pre-training Techniques
Researchers validated a metric for predicting sparse model compute efficiency, developing Hessian-aware low-bit inference with expert offloading to reduce on-device memory by roughly 60%