前言本文介绍了Fully Sharded Data Parallel(FSDP)学习笔记。
In this tutorial, we show how to use FSDP APIs, for simple MNIST models th
2024-07-31