SEA-LION OpenVINO Banner

SEA-LION-ModernBERT-Embedding-600M-OV

Last updated: 2026-05-20


SEA-LION (Southeast Asian Languages In One Network) is a collection of Large Language Models (LLMs) which have been pretrained and instruct-tuned for the Southeast Asia (SEA) region.

This repository contains the SEA-LION-ModernBERT-Embedding-600M model converted to the OpenVINO™ IR (Intermediate Representation) format with weights compressed to INT8 by NNCF.

  • Developed by: AI Products Pillar, AI Singapore
  • Funded by: Singapore NRF
  • Shared by: AI Products Pillar, AI Singapore
  • Model Type: Encoder (OpenVINO IR Format: .xml / .bin)
  • Tokenizer: Custom Gemma 3 SentencePiece Tokenizer (262K Vocabulary size)
  • Context Length: 8k tokens
  • Languages Supported: Burmese, Chinese, English, Filipino, Indonesian, Khmer, Lao, Malay, Tamil, Thai, and Vietnamese
  • License: MIT
  • Quantized from model: SEA-LION-ModernBERT-Embedding-600M

Quantization Parameters

Weight compression was performed using optimum-cli export openvino with the following parameters:

  • weight-format: int8
  • ratio: 1.0
  • group-size: 128

For more information on quantization, see the OpenVINO model optimization guide.

Compatibility

The provided OpenVINO™ IR model is compatible with:

  • OpenVINO version 2025.4.0 and higher
  • OpenVINO GenAI version 2025.4.0 and higher
  • Neural Network Compression Framework (NNCF) 2.19.0 or higher
  • Optimum Intel 1.26.1 and higher

Training Details

This OpenVINO model is derived from the original base model via weight-only quantization; no additional training was performed.

Bias, Risks, and Limitations

For full details on limitations, please refer to the original model card.

Acknowledgement

This work was made possible through a collaboration with Intel. We deeply appreciate the support and tools provided.

The SEA-LION project is supported by the National Research Foundation Singapore and the Infocomm Media Development Authority (IMDA), Singapore under its National Large Language Model Funding Initiative.

Team

Intel

Ramesh Perumal (Intel), Malcolm Chan Hao Xian (Intel)

AI Singapore

Ahmed Dabeer, Ahn Jeongmi, Antonyrex Sajeban, Chan Hok Teng Adwin, Cheng Zi Yi Nicholas, Choa Hsueh Mei Esther, Heng Jonathan, Huang Yuli, Jann Railey Estrada Montalan, Lee Chwan Ren, Leong Wai Yi, Leong Wei Qi, Liew Rachel, Limkonchotiwat Peerat, Muhammad Ridzuan Bin Mokhtar, Nagarajan Karthik, Ng Boon Cheong Raymond, Ngee Chia Tai, Ngui Jian Gang, Nguyen Thanh Ngan, Ong Tat-Wee David, Ong Zhi Hao, Pereira Mark, Poon Joseph, Rengarajan Hamsawardhini, Siow Wei Kang Bryan, Susanto Yosephine, Sutaveephamochanon Anocha, Tan Choon Meng, Tan Chor Phin Evelyn, Tan Siao Wei Jessica, Tan Yixian, Tee Jun Yun, Teng Kok Wai Walter, Teo Eng Sipp Leslie, Tjhi William, Wu Donghang, Yeo Yeow Tong, Yong Xianbin, Zhang Haoyang, Zhang Zhou

Contact

For more info, please contact us using this sealion@aisingapore.org

Downloads last month
82
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for aisingapore/SEA-LION-ModernBERT-Embedding-600M-OV

Collections including aisingapore/SEA-LION-ModernBERT-Embedding-600M-OV