An improved maximum relevance and minimum redundancy selective Bayesian classifier
-
摘要: 利用K均值聚類和增量學習算法擴大訓練樣本規模,提出一種改進的mRMR SBC.一方面,利用K均值聚類預測測試樣本的類標簽,將已標記的測試樣本添加到訓練集中,并在屬性選擇過程中引入一個調節因子以降低K均值聚類誤標記帶來的風險.另一方面,從測試樣本集中選擇有助于提高當前分類器精度的實例,把它加入到訓練集中,來增量地修正貝葉斯分類器的參數.實驗結果表明,與mRMR SBC相比,所提方法具有較好的分類效果,適于解決高維且含有較少類標簽的數據集分類問題.Abstract: A kind of improved mRMR SBC was proposed by using K-means clustering and incremental learning algorithms to enlarge the scale of training samples. On one hand, the testing samples are labeled using the K-means clustering algorithm and are added to the training set. A regulatory factor is introduced into the process of attribute selection to reduce the risk of mislabel resulting from K-means clustering. On the other hand, some samples that are most helpful for improving the current classification accuracy are selected from the testing set and are added to the training set. Based on the enlarged training set, parameters in the Bayesian classifier are adjusted incrementally. Experimental results show that compared with mRMR SBC, the proposed Bayesian classifier has better classification results and is applicable for solving the classification problem for the high-dimensional dataset with little labels.
-
Key words:
- classifiers /
- attribute selection /
- redundancy /
- K-means clustering /
- incremental learning
-

計量
- 文章訪問數: 210
- HTML全文瀏覽量: 84
- PDF下載量: 5
- 被引次數: 0