arxiv:2411.15124
Jacob Morrison
jacobmorrison
AI & ML interests
None yet
Organizations
models
40
jacobmorrison/Olmo-3-7B-Instruct-DPO-do-sample
Text Generation
•
7B
•
Updated
•
80
jacobmorrison/Olmo-3-7B-Instruct-SFT-do-sample
Text Generation
•
7B
•
Updated
•
84
jacobmorrison/qwen3-32b-no-think
Text Generation
•
33B
•
Updated
•
14
jacobmorrison/toolu-tokenizer
Updated
jacobmorrison/private-test
20B
•
Updated
•
9
jacobmorrison/L3.1-70B-T3-70B-thoughts
Updated
jacobmorrison/tk-instruct-xxl-lora-experiments
Updated
•
6
jacobmorrison/tk-instruct-xl-lora-experiments
Updated
•
7
jacobmorrison/tk-instruct-large-lora-experiments
Updated
•
8
jacobmorrison/tk-instruct-base-lora-experiments
Updated
•
10
datasets
367
jacobmorrison/social-rl-eval-dataset-DPO-100
Viewer
•
Updated
•
600
•
7
jacobmorrison/social-rl-eval-dataset-SFT-100
Viewer
•
Updated
•
600
•
14
jacobmorrison/social-rl-eval-dataset-100
Viewer
•
Updated
•
600
•
34
jacobmorrison/social-rl-eval-prompts-100
Viewer
•
Updated
•
600
•
32
jacobmorrison/social-rl-eval-prompts
Viewer
•
Updated
•
6k
•
25
jacobmorrison/if-prompts-used
Viewer
•
Updated
•
3.3k
•
33
jacobmorrison/code-prompts-used
Viewer
•
Updated
•
3.33k
•
22
jacobmorrison/math-prompts-used
Viewer
•
Updated
•
3.33k
•
15
jacobmorrison/rlvr_math_ood
Viewer
•
Updated
•
22.9k
•
23
jacobmorrison/rlvr_math_id
Viewer
•
Updated
•
33.3k
•
34