Phi 2.0 - Data Loading and Paths
Basic datasets configuration
datasets:
- path: datasets/alpaca-cleaned/alpaca_data_cleaned.json
type: alpaca
ds_type: json
data_files:
- alpaca_data_cleaned.json
dataset_prepared_path:
val_set_size: 0.20
output_dir: ./phi-outFull assessment of all of the datasets configuration options
path:
path:This parameter indicates the path to a dataset that has been prepared for training.
type:
type:ds_type:
ds_type:data_files:
data_files:dataset_prepared_path:
dataset_prepared_path:output_dir:
output_dir: Advanced: Explanation of all other data related configuration options
shards:
shards: name:
name: train_on_split:
train_on_split: conversation:
conversation: field_human and field_model: (optional)
field_human and field_model: (optional)roles:
roles: input:
input:output:
output: Custom user instruction prompt
system_prompt:
system_prompt: system_format:
system_format:field_system, field_instruction, field_input, field_output:
field_system, field_instruction, field_input, field_output: format:
format: no_input_format:
no_input_format:Datasets for Evaluation
test_datasets
test_datasetsdataset_prepared_path:
dataset_prepared_path:push_dataset_to_hub
push_dataset_to_hubhub_model_id
hub_model_idhub_strategy
hub_strategyLast updated
Was this helpful?

