Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards Paper • 2306.04488 • Published Jun 7, 2023 • 2
eP-ALM: Efficient Perceptual Augmentation of Language Models Paper • 2303.11403 • Published Mar 20, 2023 • 3