Update README.md
Browse files
README.md
CHANGED
|
@@ -58,6 +58,7 @@ These results demonstrate consistent and effective acceleration across various t
|
|
| 58 |
|
| 59 |
- **Training Framework:** This model was trained using **[SpecForge](https://github.com/sgl-project/SpecForge)**, an open-source framework for speculative decoding research.
|
| 60 |
- **Training Data:** The model was trained on the **EagleChat** dataset. Available on [Hugging Face](https://huggingface.co/datasets/zhaode/EagleChat) and [ModelScope](https://modelscope.cn/datasets/zhaode/EagleChat).
|
|
|
|
| 61 |
|
| 62 |
</div>
|
| 63 |
|
|
@@ -105,5 +106,6 @@ EAGLE (Extrapolative A* Generative Language Engine) 是一种先进的推测解
|
|
| 105 |
|
| 106 |
- **训练框架:** 本模型使用开源推测解码研究框架 **[SpecForge](https://github.com/sgl-project/SpecForge)** 进行训练。
|
| 107 |
- **训练数据:** 训练数据使用了 **EagleChat** 数据集。您可以在 [Hugging Face](https://huggingface.co/datasets/zhaode/EagleChat) 或 [ModelScope](https://modelscope.cn/datasets/zhaode/EagleChat) 上获取该数据集。
|
|
|
|
| 108 |
|
| 109 |
</div>
|
|
|
|
| 58 |
|
| 59 |
- **Training Framework:** This model was trained using **[SpecForge](https://github.com/sgl-project/SpecForge)**, an open-source framework for speculative decoding research.
|
| 60 |
- **Training Data:** The model was trained on the **EagleChat** dataset. Available on [Hugging Face](https://huggingface.co/datasets/zhaode/EagleChat) and [ModelScope](https://modelscope.cn/datasets/zhaode/EagleChat).
|
| 61 |
+
- **Training Duration:** The model was trained for 3 epochs on 8x MI308X GPUs, which took 56 hours and totaled 448 `MI308X GPU-hours`.
|
| 62 |
|
| 63 |
</div>
|
| 64 |
|
|
|
|
| 106 |
|
| 107 |
- **训练框架:** 本模型使用开源推测解码研究框架 **[SpecForge](https://github.com/sgl-project/SpecForge)** 进行训练。
|
| 108 |
- **训练数据:** 训练数据使用了 **EagleChat** 数据集。您可以在 [Hugging Face](https://huggingface.co/datasets/zhaode/EagleChat) 或 [ModelScope](https://modelscope.cn/datasets/zhaode/EagleChat) 上获取该数据集。
|
| 109 |
+
- **训练耗时:** 训练使用 8x MI308X 训练 3 轮,耗时 56 小时,共 448 `MI308X 卡时`。
|
| 110 |
|
| 111 |
</div>
|