◐ Shell
clean mode source ↗

Added how to use commit0 for sampling during STAR training by wenting-zhao · Pull Request #105 · commit-0/commit0

Skip to content

Navigation Menu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sign up

Appearance settings

Conversation

@wenting-zhao

python examples/star/star.py --model_name_or_path meta-llama/Llama-3.1-8B-Instruct --dataset_name commit0/mbpp -n 10 --output_dir outputs --low_cpu_mem_usage --with_tracking --report_to wandb --iteration 5 --per_device_train_batch_size 1 --gradient_accumulation_steps 8

TODOs:

  • Extend STAR to multiple iterations (Done)
  • Align the code with the formalism of STaR

1 participant

@wenting-zhao