lr, rank, betas, eps, weight_decay, warmup_steps, scale, proj_refresh, norm_growth_limit — all optional with sensible defaults. Co-Authored-By: Proof of Concept <poc@bcachefs.org> |
||
|---|---|---|
| .. | ||
| __init__.py | ||
| checkpoint_sync.py | ||
| export_hook.py | ||
| optimizer.py | ||
| steering.py | ||
| weight_mapping.py | ||
| worker.py | ||