The Thai WordNet was established in October, 2007. Based on the Princeton WordNet (PWN), we develop a method in generating the Thai WordNet by using an existing bi-lingual dictionary. We automatically align the PWN synsets to the bi-lingual dictionary via English equivalents and parts-of-speech (POS).
After the automatic alignment, manual translation and revision are carried out by experts and collaborative volunteers using a web-application tool, WNMS.
Release Notes
- The Thai WordNet is a free semantic dictionary. Basically, you may use, copy, modify and distribute the Thai WordNet for any purpose without any fee, so long as you keep it under the same license. See the full license for details.
- Please let me know if you use the Thai WordNet for any purpose or find any research publications where the Thai WordNet is applied.
Download
- 2011-01-26: Thai WordNet 1.0 as WordNet-LMF (xml)
- First release
- Statistics
- Number of words, synsets, and senses
POS Unique Strings Synsets Total Word-Sense Pairs Noun 66636 57048 74384 Verb 9256 9366 12616 Adjective 5393 4828 5737 Adverb 1965 2109 2509 Totals 83250 73351 95246
- Polysemy information
POS Monosemous
Words and SensesPolysemous
WordsPolysemous
SensesNoun 61494 5142 12890 Verb 7556 1700 5060 Adjective 5124 269 613 Adverb 1630 335 879 Totals 75804 7446 19442
POS Average Polysemy
Including Monosemous WordsAverage Polysemy
Excluding Monosemous WordsNoun 1.12 2.51 Verb 1.36 2.98 Adjective 1.06 2.28 Adverb 1.28 2.62
- Thai WordNet Construction
Sareewan Thoongsup, Thatsanee Charoenporn, Kergrit Robkop, Tan Sinthurahat, Chumpol Mokarat, Virach Sornlertlamvanich and Hitoshi Isahara., Proceedings of The 7th Workshop on Asian Language Resources (ALR7), Joint conference of the 47th Annual Meeting of the Association for Computational Linguistics (ACL) and the 4th International Joint Conference on Natural Language Processing (IJCNLP), Suntec, Singapore, August 6-7, 2009.
- Asian WordNet: Development and Service in Collaborative Approach
Virach Sornlertlamvanich., The 5th International Conference of the Global WordNet Association (GWC-2010), Mumbai, India , 31st Jan. - 4th Feb., 2010.
Other WordNets
Ontologies
- Kyoto Project - Knowledge Yielding Ontologies for Transition-based Organization
- SUMO - Suggested Upper Merged Ontology
Development Tools
- WNMS - WordNet Management System
- Language Grid
Lexical Markup Framework
- Prof. Dr. Hitoshi Isahara (isahara AT nict DOT go DOT jp)
- Dr. Virach Sornlertlamvanich (virach AT tcllab DOT org)
- Dr. Canasai Kruengkrai (canasai AT tcllab DOT org)