Skip to content

Add a Dataset Formats reference guide #70

Description

@adohe

Problem

Dataset shape is easy to misunderstand. Raw GSM8K (question/answer) needs different formatting per algorithm: RL math uses prompt/solutions; SFT expects messages, prompt+response, or text. "GSM8K works with AReno" does not mean it works for every algorithm.

Scope

Add a Dataset Formats page answering: "What columns must my dataset have for each AReno algorithm?"

  • Mental model: raw datasets vs AReno training schemas vs loader functions.
  • SFT schemas (prompt+response, messages, text) and their loss behavior.
  • RL math schema (prompt+solutions as reward metadata, not an SFT target).
  • DPO preference schema (prompt/chosen/rejected).
  • Dataset loader function contract and when to use one.
  • GSM8K examples for both RL and SFT; state plainly raw GSM8K is not SFT-ready.
  • Working CLI examples per shape.

Acceptance

  • Short, exact, example-heavy; no marketing language.
  • Cites existing examples/math/ files.
  • No code or CLI changes.

Metadata

Metadata

Labels

area/apiIssues or PRs related to the SDK/Trainer public APIarea/dxIssues or PRs related to developer experience (error messages, ergonomics, onboarding)good first issueDenotes an issue ready for a new contributorkind/documentationCategorizes issue or PR as related to documentation

Type

No type
No fields configured for issues without a type.

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions