12345678910111213141516171819202122232425262728293031323334353637383940414243444546474849505152535455565758596061626364656667686970717273747576777879808182 |
- This directory contains redistributable wordnets in further sub-directories.
- Structure is:
- proj/wn-data-lang.tab synset lemma pairs (see below)
- proj/LICENCE original license file (or equivalent)
- proj/README Any notes about the conversion
- proj/lang2tab.py python script to extract the data
- may rely on wordnet version mappings
- proj/wn-data-lang.tab.log any notes from the conversion
- proj/citation.bib the canonical citation reference(s)
- Note that a single directory may have wordnets for multiple languages
- wn-data is formatted as follows:
- # name<tab>lang<tab>url<tab>license
- offset-pos<tab>type<tab>lemma
- offset-pos<tab>type<tab>lemma
- ...
- name is the name of the project
- lang is the iso 3 letter code for the name
- url is the url of the project
- license is a short name for the license
- offset is the Princeton WordNet 3.0 offset 8 digit offset
- pos is one of [a,s,v,n,r]
- lemma is the lemma (word separator normalized to ' ')
- type is the language:relationship (e.g. eng:lemma)
- Example:
- # Thai tha http://th.asianwordnet.org/ wordnet
- 13567960-n tha:lemma กระบวนการทรานแอมมิแนชัน
- 00155298-n tha:lemma การปฏิเสธ
- 14369530-n tha:lemma ภาวะการหายใจเร็วของทารกแรกเกิด
- 10850469-n tha:lemma เบธัน
- 11268326-n tha:lemma เรินต์เกน
- This data is formatted by the Open Multilingual Wordnet Project
- to be used by NLTK.
- Please cite us if you find the aggregation useful (see citation.bib)
- and email us if you have any suggestions.
- Francis Bond ([email protected])
- https://omwn.org/
- 2021-12-05
- 31 languages covered (and we assume you have English):
- wn-data-als.tab
- wn-data-arb.tab
- wn-data-bul.tab
- wn-data-cmn.tab
- wn-data-dan.tab
- wn-data-ell.tab
- wn-data-fin.tab
- wn-data-fra.tab
- wn-data-heb.tab
- wn-data-hrv.tab
- wn-data-isl.tab
- wn-data-ita.tab
- wn-data-ita.tab
- wn-data-jpn.tab
- wn-data-cat.tab
- wn-data-eus.tab
- wn-data-glg.tab
- wn-data-spa.tab
- wn-data-ind.tab
- wn-data-zsm.tab
- wn-data-nld.tab
- wn-data-nno.tab
- wn-data-nob.tab
- wn-data-pol.tab
- wn-data-por.tab
- wn-data-ron.tab
- wn-data-lit.tab
- wn-data-slk.tab
- wn-data-slv.tab
- wn-data-swe.tab
- wn-data-tha.tab
|