YearType & Venue    Work     Authors          Links               
2026ICLR workshopEarly Stopping for Reasoning LMsA. Nagle, J. Saydaliev, D. Garbaya, M. Gastpar, A.V Makkuva, H. Kim📄 Arkiv | 🌐 Project page |
🤗 Models
2025ACL’26Project Apertus [8B & 70B] 🌎Apertus team*📄 Arkiv | 💻 Code | 🤗 Artifacts
2025NeurIPSFOG architectures: towards pure FP8 LLM trainingA.H Cano*, D. Garbaya*, I. Schalg, M. Jaggi📄 Arkiv | 💻 Code
2024OS releaseFalcon3 family of open models [1B → 10B] 🦅Falcon-LLM Team*📄 Report (undisclosed) | 📜 Blog
| 🤗 Collection | 💬 Demo

*Main co-author / contributor