Indeed, fine tuning with either synthetic data (as you are proposing) or human r...

		snickmy on May 25, 2023 \| parent \| context \| favorite \| on: How to Finetune GPT-Like Large Language Models on ... Indeed, fine tuning with either synthetic data (as you are proposing) or human review works like that. you can read more here: https://huggingface.co/blog/rlhf