Nikos Atlas feb0a01cdd adds synonyms - tests and configuration - snapshot testing 2 năm trước cách đây
..
LICENSE feb0a01cdd adds synonyms - tests and configuration - snapshot testing 2 năm trước cách đây
README feb0a01cdd adds synonyms - tests and configuration - snapshot testing 2 năm trước cách đây
citation.bib feb0a01cdd adds synonyms - tests and configuration - snapshot testing 2 năm trước cách đây
wn-data-ind.tab feb0a01cdd adds synonyms - tests and configuration - snapshot testing 2 năm trước cách đây
wn-data-zsm.tab feb0a01cdd adds synonyms - tests and configuration - snapshot testing 2 năm trước cách đây

README

This is the raw data for the Wordnet Bahasa, a wordnet for the Malay
languages (currently Malaysian and Indonesian).

For more details see the project page at:

The data is released under the MIT license.

File format:

synset\tlang\tgoodness\tlemma

synset is the offset-pos from Princeton wordnet 3.0
lang
B (Bahasa = msa);
I (Indonesian = ind);
M (Malay = zsm)
goodness is:
Y = hand checked and good
O = automatic high quality (good)
M = automatic medium quality (ok)
L = automatic, probably bad (low)
X = hand checked and bad

Normal release has only Y and O.


e.g.
00015388-n B X fauna
00015388-n M Y haiwan
00015388-n I Y hewan

Note: msa is the supertype of ind and zsm
========================================================================
Apostrophe should be (’) U+2019 as in: Côte d’Ivoire.
Technically glottal stop should be (ʼ) Letter apostrophe U+02BC.
We need to make the lookup more forgiving of this.

There are some abbreviations in use:
yg = yang
sso =

========================================================================
Def:

06822958-n DEF tanda koma di bawah konsonan c tanda bunyi 's'
06823760-n DEF dua titik di atas huruf vokal