Improving LLM Safety and Helpfulness using SFT and DPO A Study on OPT-350M

AI Review

Keywords

Click the button to extract keywords

Insights

Click the button to extract insights