Japanese word sense disambiguation system based on deep feature extraction
-
摘要: 詞義消歧的特征來源于上下文.日文兼有中英文的語言特性,特征抽取更為復雜.針對日文特點,在詞義消歧邏輯模型基礎上,利用最大熵模型優良的信息融合性能,采用深層特征抽取方法,引入語義、句法類特征用于消解歧義.同時,為避免偏斜指派,采用BeamSearch算法進行詞義序列標注.實驗結果表明,與僅使用表層詞法類特征方法相比,本文構造的日文詞義消歧系統的消歧精度提高2%~3%,動詞消歧精度獲得5%的改善.Abstract: The features of word sense disambiguation (WSD) come from the context. Japanese has linguistic features of both Chinese and English at the same time, thus the feature extraction of Japanese is more complicated. Considering Japanese features, based on the proposed WSD logic model and applying the characteristics of information integration of the maximum entropy model, WSD was solved by the deep feature extraction method, introducing semantics and syntactics features. Meanwhile, for preventing the skewed assignment of lonely word sense, the word sense tagging of word sequences was completed with the BeamSearch algorithm. Experiment results show that compared with WSD methods which only focus on the surface lexical features, the disambiguation accuracy of the Japanese WSD system proposed in this paper increases 2% to 3%, and the WSD accuracy of verbs improves 5%.
-

計量
- 文章訪問數: 108
- HTML全文瀏覽量: 15
- PDF下載量: 5
- 被引次數: 0