How do I create a dataset?
Open Experiments, then choose Datasets from the secondary navigation. Click Add Dataset. Enter a name and optional description. Upload a CSV file. You can click the upload area or drag the CSV into it. Review the column mapping. Fetch Hive automatically maps:- normal columns to input values
expected_outputto expected outputmetadata.*columns to row metadata
How do I view dataset versions?
Open Experiments, then choose Datasets. Click a dataset row to open the dataset detail page. Use the version selector in the page header to switch between versions. Versions are shown as labels likev1, v2, or v4.
The page URL includes the selected version:
CSV file format
The first row must contain column headers. The CSV importer supports quoted values, commas inside quoted values, escaped quotes, multiline quoted values, and empty cells. Upload limits:| Limit | Value |
|---|---|
| File type | CSV |
| Maximum file size | 5 MB |
| Maximum rows | 10,000 |
| Preview rows shown in the dialog | 50 |
CSV columns
An experiment dataset has three kinds of columns:- input columns
- one optional expected output column
- optional metadata columns
Input columns
Input columns are the values Fetch Hive passes into the prompt or agent for that row. They do not have fixed names for prompt experiments.question, context, and customer_message are examples only. They are not required column names.
Use column names that match the candidate you are testing.
| Candidate type | What your CSV input columns should match | Example columns |
|---|---|---|
| Dashboard Prompt | The prompt variables in the prompt editor | question, context, input_url, input_scrape |
| Deployed Prompt | The prompt variables captured in the selected deployment version | question, context, input_url, input_scrape |
| Agent | The agent’s user message input | message |
{{question}} and {{context}}, your CSV should include question and context.
If your prompt has {{input_url}} and {{input_scrape}}, your CSV should include input_url and input_scrape instead.
For agent experiments, use message as the main input column. Agents start from a user message, so message is the clearest dataset shape.
Expected output column
Useexpected_output when you have a reference answer for the row.
| Column name | Required | Stored as | Notes |
|---|---|---|---|
expected_output | No | expected_output.value | Used for manual review today and future evaluator scoring. |
expected_output does not currently mark a result correct or incorrect automatically.
Metadata columns
Usemetadata.* columns for optional row labels.
Metadata columns do not trigger built-in behavior today. They do not change run order, model settings, locale, routing, or candidate execution. Fetch Hive stores them with the row so you can identify cases, filter results, compare groups, or connect results back to your own systems.
| Column pattern | Required | Stored as | Example |
|---|---|---|---|
metadata.case_id | No | metadata.case_id becomes case_id | geo-001 |
metadata.topic | No | metadata.topic becomes topic | geography |
metadata.priority | No | metadata.priority becomes priority | high |
metadata.source | No | metadata.source becomes source | support_faq |
metadata.language | No | metadata.language becomes language | en |
metadata.* name that helps your team review results.
Do not use meta_ prefixes for new datasets. Use metadata.* so the mapping is clear.
Common CSV shapes
Prompt with{{question}}:
{{question}} and {{context}}:
{{input_url}} and {{input_scrape}}:
Example CSV
You can download the example CSV from the Add Dataset dialog.Dataset versions
Dataset versions are immutable. If you need to change rows later, create a new dataset version instead of editing a version already used by a run. This keeps old experiment runs reproducible.Importing rows
Use Import on the dataset detail page to append rows to a dataset. Importing rows does not edit the current version. Fetch Hive creates a new immutable version that contains:- all rows from the latest dataset version
- any new rows from the uploaded CSV
- imported row count
- skipped duplicate count
- the new latest dataset version
v3 has 100 rows and you import a CSV with 20 rows where 5 are duplicates, Fetch Hive creates v4 with 115 rows.
Runs created before the import still point to their original dataset version. New runs can use the latest version.
See also: Build an experiment and Review results
