- DEFAULT_RANK = 64 in train_router.py - All references use the constant, not magic numbers - ~2.5GB optimizer state instead of ~10GB Co-Authored-By: Proof of Concept <poc@bcachefs.org> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| checkpoint_sync.py | ||
| export_hook.py | ||
| optimizer.py | ||
| steering.py | ||
| train_router.py | ||
| weight_mapping.py | ||