Upload folder using huggingface_hub
Browse files
README.md
CHANGED
|
@@ -6,7 +6,6 @@ library_name: transformers
|
|
| 6 |
|
| 7 |
**SDAR**(**S**ynergy of **D**iffusion and **A**uto**R**egression)-model is a new large language model that integrates autoregressive (AR) and discrete diffusion modeling strategies. It combines the efficient training paradigm of AR models with the highly parallel inference capability of diffusion models, while delivering performance fully on par with SOTA opensource AR models. At the same time, SDAR sets a new benchmark as the most powerful diffusion language model to date.
|
| 8 |
|
| 9 |
-
---
|
| 10 |
|
| 11 |
# Performance of SDAR-1.7B-Chat on various benchmarks
|
| 12 |
|
|
@@ -31,7 +30,6 @@ evaluation settings:
|
|
| 31 |
|
| 32 |
**Note**: The 4B, 8B, and 30B models are coming soon. Performance results for these models will be released in the near future.
|
| 33 |
|
| 34 |
-
|
| 35 |
## Inference
|
| 36 |
The inference code will come soon
|
| 37 |
|
|
|
|
| 6 |
|
| 7 |
**SDAR**(**S**ynergy of **D**iffusion and **A**uto**R**egression)-model is a new large language model that integrates autoregressive (AR) and discrete diffusion modeling strategies. It combines the efficient training paradigm of AR models with the highly parallel inference capability of diffusion models, while delivering performance fully on par with SOTA opensource AR models. At the same time, SDAR sets a new benchmark as the most powerful diffusion language model to date.
|
| 8 |
|
|
|
|
| 9 |
|
| 10 |
# Performance of SDAR-1.7B-Chat on various benchmarks
|
| 11 |
|
|
|
|
| 30 |
|
| 31 |
**Note**: The 4B, 8B, and 30B models are coming soon. Performance results for these models will be released in the near future.
|
| 32 |
|
|
|
|
| 33 |
## Inference
|
| 34 |
The inference code will come soon
|
| 35 |
|