Technology ❯ Software ❯ Open Source ❯ Community Contributions
The 0.9B model pairs a NaViT-style visual encoder with an ERNIE-4.5-0.3B language model for multilingual document parsing across 109 languages, with SOTA results reported by the authors.