hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_24with_question_embedding-1-0-20260522-235359 Updated about 5 hours ago
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_24without_question_embedding-1-0-20260523-000400 Updated about 5 hours ago
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_16with_question_embedding-1-0-20260522-231635 Updated about 5 hours ago
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_16without_question_embedding-1-0-20260522-231635 Updated about 5 hours ago • 1
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_8with_question_embedding-1-0-20260522-231636 Updated about 5 hours ago
hanspeterlyngsoeraaschoujensen/Qwen3-0.6B-softmax-1-linear-hidden_states_layer_8without_question_embedding-1-0-20260522-231635 Updated about 5 hours ago
hanspeterlyngsoeraaschoujensen/DeepScaleR-1.5B-lora-256-scaling_factor_5.0-mask_cosine_0.0 Updated Sep 19, 2025
hanspeterlyngsoeraaschoujensen/DeepScaleR-1.5B-lora-256-scaling_factor_5.0-mask_cosine_0.00_0.90 Updated Aug 29, 2025
hanspeterlyngsoeraaschoujensen/llm-finetune-DeepScaleR-1.5B-Preview-128-new-tokens-scaling-factor-5.0-mask-cosi Updated Aug 27, 2025
hanspeterlyngsoeraaschoujensen/deberta-v3-base-finetuned-nlp-course Question Answering • 0.2B • Updated Sep 23, 2024 • 1
hanspeterlyngsoeraaschoujensen/distilbert-base-uncased-finetuned-nlp-course Question Answering • 66.4M • Updated Sep 23, 2024 • 2
hanspeterlyngsoeraaschoujensen/mt5-base-finetuned-nlp-course Question Answering • 0.4B • Updated Sep 21, 2024
hanspeterlyngsoeraaschoujensen/deepseek-math-7b-instruct-awq-Q4 Text Generation • 7B • Updated Feb 8, 2024 • 7