Anime-Llasa-3B

Overview

This is the Anime-Llasa-3B, a Text-to-Speech (TTS) model fine-tuned for Japanese. This model is based on HKUSTAudio/Llasa-3B.

Demo

You can try a demo on Hugging Face Spaces: Anime-Llasa-3B-Demo

What's New?

The primary improvement in this version is a significant increase in the training data. The amount of training data has been increased from approximately 14,000 hours (3 epochs) to approximately 33,000 hours (1 epoch).

This enhancement aims to further improve the model's expressiveness and overall stability.

Old Version

Version 3 model

Version 1 model

License

This model is licensed under the CC-BY-NC-4.0.

Downloads last month: 656

Safetensors

Model size

3B params

Tensor type

BF16

Model tree for NandemoGHS/Anime-Llasa-3B

Base model

meta-llama/Llama-3.2-3B-Instruct

Finetuned

HKUSTAudio/Llasa-3B

Finetuned

(5)

this model

Finetunes

1 model

Quantizations

4 models

NandemoGHS
/

Anime-Llasa-3B