Provides schema examples for structuring conversations in SFT training.
jsonl
(JSON Lines)zip
The directory should be archived in a .zip
file and stored in an object storage.cd path/to/dataset_dir && zip -r dataset_dir.zip ./*
.jsonl
file should represent a complete training example. The supported format styles are:
system
: The initial system instruction that sets the behavior or tone for the assistant.conversation:
A list of human-assistant message pairs forming the dialogue history.
human
: A user query or input in the conversation. assistant:
The assistant’s response to the corresponding human input.messages:
A sequential list of role-based messages representing a full conversation.role:
The identity of the message sender (e.g., system, user, assistant).content:
The actual text of the message corresponding to the role.