[awesome-llm-plaza/docs/awesome\_llm\_training.md at main · metame-ai/awesome-llm-plaza · GitHub](https://github.com/metame-ai/awesome-llm-plaza/blob/main/docs/awesome_llm_training.md) - [[2024__NSDI__MegaScale - Scaling Large Language Model Training to More Than 10,000 GPUs]] - [[2023__arXiv__Unicron - Economizing Self-Healing LLM Training at Scale]]