No More Adam: Learning Rate Scaling at Initialization Is All You Need