-
Notifications
You must be signed in to change notification settings - Fork 115
Pull requests: aws-samples/awsome-distributed-training
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
utility to dump details of all nodes in a cluster, into a csv file
#652
opened Apr 25, 2025 by
amitosaurus
Loading…
Feature/slinkly slurm hyperpod eks
enhancement
New feature or request
#651
opened Apr 25, 2025 by
bluecrayon52
Loading…
feat: Add LoRA fine-tuning optimum-neuron example for slurm
New model
#643
opened Apr 15, 2025 by
Captainia
Loading…
feat: Add Hyperpod Optimum-neuron LoRA example
New model
#631
opened Apr 4, 2025 by
Captainia
Loading…
Update bionemo test case + propose to subdirectories per orchastrator
documentation
Improvements or additions to documentation
Update SMPv2 conda setup script with latest PT2.3.1 TSM2.4.0
#366
opened Jun 25, 2024 by
viclzhu
Loading…
End-to-End LLM Model Development with Torchtitan and Torchtune
enhancement
New feature or request
#341
opened May 20, 2024 by
KeitaW
Loading…
ProTip!
Adding no:label will show everything without a label.