Cautious Optimizers: Improving Training with One Line of Code