·
AI & ML interests
NLP
Recent Activity
Organizations
RefalMachine/llama3_extended_darulm_20_05_24_part1-2_64000_bpe_full_lr1e4_bs256
Text Generation
•
8B
•
Updated
•
5
RefalMachine/llama3_darulm_20_05_24_part1-2_128000_unigram_full_lr1e4_bs256
Text Generation
•
8B
•
Updated
•
8
RefalMachine/llama3_darulm_20_05_24_part1-2_128000_bpe_full_lr1e4_bs256
Text Generation
•
8B
•
Updated
•
4
RefalMachine/openchat-3.5-0106__eval_bs_experiments
Updated
RefalMachine/Meta-Llama-3-8B-Instruct_eval_bs_experiments
Updated
RefalMachine/saiga_llama3_8b_v7_eval
Updated
RefalMachine/Qwen2-7B-Instruct_eval
Updated
RefalMachine/openchat-3.5-0106_eval
Updated
RefalMachine/gemma-2-9b-it
Updated
RefalMachine/Meta-Llama-3-8B-Instruct_eval
Updated
RefalMachine/llama3_cut_extended_darulm_20_05_24_part1-2_64000_64000_bpe_part1_lr2e5_bs256
Text Generation
•
8B
•
Updated
•
6
RefalMachine/llama3_cut_extended_darulm_20_05_24_part1-2_64000_64000_bpe_part1_lr5e5_bs256
Text Generation
•
8B
•
Updated
•
5
RefalMachine/llama3_cut_extended_darulm_20_05_24_part1-2_64000_64000_bpe_part1_lr2e4_bs256
Text Generation
•
8B
•
Updated
•
7
RefalMachine/llama3_cut_extended_darulm_20_05_24_part1-2_64000_64000_bpe_part1_lr1e4_bs256
Text Generation
•
8B
•
Updated
•
4
RefalMachine/mistral_darulm_20_05_24_part1-2_32000_unigram_part1-2_lr1e4_bs256
Updated
RefalMachine/llama3_extended_darulm_20_05_24_part1-2_64000_bpe_part1_lr2e5_bs256
Text Generation
•
8B
•
Updated
•
5
RefalMachine/llama3_cut_extended_darulm_20_05_24_part1-2_64000_64000_bpe_mean_init_03_07_24
Text Generation
•
8B
•
Updated
•
5
RefalMachine/mistral_extended_darulm_20_05_24_part1-2_32000_bpe_part1-2_lr1e4_bs256
Text Generation
•
7B
•
Updated
•
3
RefalMachine/mistral_darulm_20_05_24_part1-2_32000_bpe_part1-2_lr1e4_bs256
Text Generation
•
7B
•
Updated
•
5
RefalMachine/llama3_extended_darulm_20_05_24_part1-2_64000_bpe_part1_lr5e5_bs256
Text Generation
•
8B
•
Updated
•
5
RefalMachine/llama3_extended_darulm_20_05_24_part1-2_64000_bpe_part1_lr1e4_bs256
Text Generation
•
8B
•
Updated
•
3
RefalMachine/llama3_extended_darulm_20_05_24_part1-2_64000_bpe_part1_lr2e4_bs256
Text Generation
•
8B
•
Updated
•
5
RefalMachine/mistral_extended_darulm_20_05_24_part1-2_32000_bpe_full_lr1e4_bs256
Text Generation
•
7B
•
Updated
•
4
RefalMachine/mistral_darulm_20_05_24_part1-2_32000_unigram_full_lr1e4_bs256
Text Generation
•
7B
•
Updated
•
6
RefalMachine/mistral_darulm_20_05_24_part1-2_32000_bpe_full_lr1e4_bs256
Text Generation
•
7B
•
Updated
•
6
RefalMachine/llama3_extended_darulm_20_05_24_part1-2_64000_bpe_mean_init_03_07_24
Text Generation
•
8B
•
Updated
•
5
RefalMachine/mistral_extended_darulm_20_05_24_part1-2_32000_bpe_part1_lr2e5_bs256
Text Generation
•
7B
•
Updated
•
5
RefalMachine/mistral_extended_darulm_20_05_24_part1-2_32000_bpe_part1_lr2e4_bs256
Text Generation
•
7B
•
Updated
•
6
RefalMachine/llama3_darulm_20_05_24_part1-2_128000_bpe_part1_lr2e4_bs256
Text Generation
•
8B
•
Updated
•
5
RefalMachine/llama3_darulm_20_05_24_part1-2_128000_bpe_part1_lr2e5_bs256
Text Generation
•
8B
•
Updated
•
5