We want to make sure not just the code we open-sourced, but also goes to dataset, so everyone can validate.
You can check in Malay-Dataset for our open dataset.
- Please citate the repository if use these corpus.
- Please at least email us first before distributing these data. Remember all these hard workings we want to give it for free.
- What do you see just the data, but nobody can see how much we spent our cost to make it public.
- We want to make sure downloaders got the best bandwidth and top speed, we host everything on S3, please consider a donation to prevent top-speed shutdown or broken link!
- Husein really need money to stay survive, he is still a human. 7053174643, CIMB Click, Husein Zolkepli