Public Timelines

For companies For educational institutions For teachers For students Dashboard

For educational institutions For teachers For students Dashboard

Create a timeline

Public timelines

About & Feedback Terms Privacy FAQ

Dashboard Get premium

Donate

sep 1, 2020 - RLHF

Description:

Focus: Use a reward model which predicts human preferences to fine tune a pre-trained model

Added to timeline:

LLM timeline

ByJustin Milner

14 Aug 2023

0

0

1777

Date:

sep 1, 2020

Now

~ 5 years and 9 months ago

About & Feedback Terms Privacy FAQ

Dashboard Get premium

Donate

The service accepts bank transfer (ACH, Wire) or cards (Visa, MasterCard, etc). Processed by Stripe.

Secured with SSL