Welcome to Malaya’s documentation!#


Pypi version Python3 version MIT License Documentation total stats download stats / month discord

Malaya is a Natural-Language-Toolkit library for bahasa Malaysia, powered by PyTorch.


Proper documentation is available at https://malaya.readthedocs.io/

Installing from the PyPI#

$ pip install malaya

It will automatically install all dependencies except for PyTorch. So you can choose your own PyTorch CPU / GPU version.

Only Python >= 3.6.0, and PyTorch >= 1.10 are supported.

If you are a Windows user, make sure read https://malaya.readthedocs.io/en/latest/running-on-windows.html

Development Release#

Install from master branch,

$ pip install git+https://github.com/huseinzol05/malaya.git

We recommend to use virtualenv for development.

Documentation at https://malaya.readthedocs.io/en/latest/

Pretrained Models#

Malaya also released Malaysian pretrained models, simply check at https://huggingface.co/mesolitica


If you use our software for research, please cite:

@misc{Malaya, Natural-Language-Toolkit library for bahasa Malaysia, powered by PyTorch,
  author = {Husein, Zolkepli},
  title = {Malaya},
  year = {2018},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/mesolitica/malaya}}


Thanks to,

  1. KeyReply for private V100s cloud.

  1. Nvidia for Azure credit.

  1. Tensorflow Research Cloud for free TPUs access.



Thank you for contributing this library, really helps a lot. Feel free to contact me to suggest me anything or want to contribute other kind of forms, we accept everything, not just code!



Getting Started

Jawi Module