Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty