GPT-1900 Collection Pre-1900 LLMs for physics reasoning. RL models are physics-only; use the SFT model for general chat. Tune temperature (0.6-0.7). • 11 items • Updated about 1 month ago • 8
science-of-finetuning/diff-mining-qwen3-14b-cross-method-intersection Viewer • Updated Mar 31 • 5 • 9
science-of-finetuning/diff-mining-qwen3-14b-cross-method-intersection Viewer • Updated Mar 31 • 5 • 9
science-of-finetuning/diff-mining-qwen3-14b-union-tulu-frac-fineweb-nmf Viewer • Updated Mar 31 • 20 • 15
science-of-finetuning/diff-mining-qwen3-14b-union-tulu-frac-fineweb-nmf Viewer • Updated Mar 31 • 20 • 15
science-of-finetuning/diff-mining-qwen3-14b-tulu-fraction-positive-diff Viewer • Updated Mar 31 • 5 • 15
science-of-finetuning/diff-mining-qwen3-14b-tulu-fraction-positive-diff Viewer • Updated Mar 31 • 5 • 15