SEA-LION-ModernBERT-Embedding-600M-OV

Last updated: 2026-05-20

Model creator: AI Singapore
Original model: aisingapore/SEA-LION-ModernBERT-Embedding-600M

SEA-LION (Southeast Asian Languages In One Network) is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.

This repository contains the SEA-LION-ModernBERT-Embedding-600M model converted to the OpenVINO™ IR (Intermediate Representation) format with weights compressed to INT8 by NNCF.

Developed by: AI Products Pillar, AI Singapore
Funded by: Singapore NRF
Shared by: AI Products Pillar, AI Singapore
Model Type: Encoder (OpenVINO IR Format: .xml / .bin)
Tokenizer: Custom Gemma 3 SentencePiece Tokenizer (262K Vocabulary size)
Context Length: 8k tokens
Languages Supported: Burmese, Chinese, English, Filipino, Indonesian, Khmer, Lao, Malay, Tamil, Thai, and Vietnamese
License: MIT
Quantized from model: SEA-LION-ModernBERT-Embedding-600M

Quantization Parameters

Weight compression was performed using optimum-cli export openvino with the following parameters:

weight-format: int8
ratio: 1.0
group-size: 128

For more information on quantization, see the OpenVINO model optimization guide.

Compatibility

The provided OpenVINO™ IR model is compatible with:

OpenVINO version 2025.4.0 and higher
OpenVINO GenAI version 2025.4.0 and higher
Neural Network Compression Framework (NNCF) 2.19.0 or higher
Optimum Intel 1.26.1 and higher

Training Details

This OpenVINO model is derived from the original base model via weight-only quantization; no additional training was performed.

Bias, Risks, and Limitations

For full details on limitations, please refer to the original model card.

Acknowledgement

This work was made possible through a collaboration with Intel. We deeply appreciate the support and tools provided.

The SEA-LION project is supported by the National Research Foundation Singapore and the Infocomm Media Development Authority (IMDA), Singapore under its National Large Language Model Funding Initiative.

Team

Intel

Ramesh Perumal (Intel), Malcolm Chan Hao Xian (Intel)

AI Singapore

Ahmed Dabeer, Ahn Jeongmi, Antonyrex Sajeban, Chan Hok Teng Adwin, Cheng Zi Yi Nicholas, Choa Hsueh Mei Esther, Heng Jonathan, Huang Yuli, Jann Railey Estrada Montalan, Lee Chwan Ren, Leong Wai Yi, Leong Wei Qi, Liew Rachel, Limkonchotiwat Peerat, Muhammad Ridzuan Bin Mokhtar, Nagarajan Karthik, Ng Boon Cheong Raymond, Ngee Chia Tai, Ngui Jian Gang, Nguyen Thanh Ngan, Ong Tat-Wee David, Ong Zhi Hao, Pereira Mark, Poon Joseph, Rengarajan Hamsawardhini, Siow Wei Kang Bryan, Susanto Yosephine, Sutaveephamochanon Anocha, Tan Choon Meng, Tan Chor Phin Evelyn, Tan Siao Wei Jessica, Tan Yixian, Tee Jun Yun, Teng Kok Wai Walter, Teo Eng Sipp Leslie, Tjhi William, Wu Donghang, Yeo Yeow Tong, Yong Xianbin, Zhang Haoyang, Zhang Zhou