Metadata
- Author: yandex
- Full Title: YaFSDP
- URL: https://link.alphasignal.ai/ia5Kzc
Highlights
- YaFSDP is up to 20% faster for pre-training LLMs and performs better in high memory pressure conditions. It is designed to reduce communications and memory operations overhead. (View Highlight)