Online learning llm

The flow

  1. llm generate text

  2. user does some updates or corrections

  3. corrected text is used as finetuning data

  4. Next time similar prompt appears, LLM uses learned patterns and user needs to make less corrections over time

Please authenticate to join the conversation.

Upvoters
Status

In Review

Board
Custom icon

Feature Requests

Date

3 months ago

Author

Olya Sirkin

Subscribe to post

Get notified by email when there are changes.