Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
proxectonos
's Collections
FA Models
MrBERT-nos-gl
Domain Specific Corpora
CorpusNÓS: A massive Galician corpus for training LLM
Text Datasets for Fine-tuning and Instruction tuning
Text Datasets for Evaluation
MT
Text Models
TTS Models
ASR Models
Instruction Pretrained Experiments
MT Models (former)
ASR Datasets
TTS Datasets
Domain Specific Corpora
updated
24 days ago
Collection of corpora prepared from specific domains mainly in Galician language.
Upvote
-
proxectonos/corpus_dominio_legal_administrativo
Preview
•
Updated
24 days ago
•
369
proxectonos/corpus_dominio_periodistico
Viewer
•
Updated
23 days ago
•
280k
•
148
proxectonos/corpus_dominio_cientifico
Preview
•
Updated
23 days ago
•
71
proxectonos/corpus_dominio_museistico_patrimonio
Viewer
•
Updated
21 days ago
•
14.5k
•
314
Upvote
-
Share collection
View history
Collection guide
Browse collections