SEA-LION-ModernBERT-Embedding-600M-OV
Last updated: 2026-05-20
- Model creator: AI Singapore
- Original model: aisingapore/SEA-LION-ModernBERT-Embedding-600M
SEA-LION (Southeast Asian Languages In One Network) is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.
This repository contains the SEA-LION-ModernBERT-Embedding-600M model converted to the OpenVINO™ IR (Intermediate Representation) format with weights compressed to INT8 by NNCF.
- Developed by: AI Products Pillar, AI Singapore
- Funded by: Singapore NRF
- Shared by: AI Products Pillar, AI Singapore
- Model Type: Encoder (OpenVINO IR Format: .xml / .bin)
- Tokenizer: Custom Gemma 3 SentencePiece Tokenizer (262K Vocabulary size)
- Context Length: 8k tokens
- Languages Supported: Burmese, Chinese, English, Filipino, Indonesian, Khmer, Lao, Malay, Tamil, Thai, and Vietnamese
- License: MIT
- Quantized from model: SEA-LION-ModernBERT-Embedding-600M
Quantization Parameters
Weight compression was performed using optimum-cli export openvino with the following parameters:
- weight-format: int8
- ratio: 1.0
- group-size: 128
For more information on quantization, see the OpenVINO model optimization guide.
Compatibility
The provided OpenVINO™ IR model is compatible with:
- OpenVINO version 2025.4.0 and higher
- OpenVINO GenAI version 2025.4.0 and higher
- Neural Network Compression Framework (NNCF) 2.19.0 or higher
- Optimum Intel 1.26.1 and higher
Training Details
This OpenVINO model is derived from the original base model via weight-only quantization; no additional training was performed.
Bias, Risks, and Limitations
For full details on limitations, please refer to the original model card.
Acknowledgement
This work was made possible through a collaboration with Intel. We deeply appreciate the support and tools provided.
The SEA-LION project is supported by the National Research Foundation Singapore and the Infocomm Media Development Authority (IMDA), Singapore under its National Large Language Model Funding Initiative.
Team
Intel
Ramesh Perumal (Intel), Malcolm Chan Hao Xian (Intel)
AI Singapore
Ahmed Dabeer, Ahn Jeongmi, Antonyrex Sajeban, Chan Hok Teng Adwin, Cheng Zi Yi Nicholas, Choa Hsueh Mei Esther, Heng Jonathan, Huang Yuli, Jann Railey Estrada Montalan, Lee Chwan Ren, Leong Wai Yi, Leong Wei Qi, Liew Rachel, Limkonchotiwat Peerat, Muhammad Ridzuan Bin Mokhtar, Nagarajan Karthik, Ng Boon Cheong Raymond, Ngee Chia Tai, Ngui Jian Gang, Nguyen Thanh Ngan, Ong Tat-Wee David, Ong Zhi Hao, Pereira Mark, Poon Joseph, Rengarajan Hamsawardhini, Siow Wei Kang Bryan, Susanto Yosephine, Sutaveephamochanon Anocha, Tan Choon Meng, Tan Chor Phin Evelyn, Tan Siao Wei Jessica, Tan Yixian, Tee Jun Yun, Teng Kok Wai Walter, Teo Eng Sipp Leslie, Tjhi William, Wu Donghang, Yeo Yeow Tong, Yong Xianbin, Zhang Haoyang, Zhang Zhou
Contact
For more info, please contact us using this sealion@aisingapore.org
- Downloads last month
- 82
