开发者

Baum-Welch algorithm for pos tagger

开发者 https://www.devze.com 2023-02-15 18:12 出处:网络
everyone. I\'m using the Baum-Welch algorithm to train a pos tagger,it is totally in the unsupervised way.

everyone. I'm using the Baum-Welch algorithm to train a pos tagger,it is totally in the unsupervised way. Here comes the problem: When i get the label result, I only get a sequence of numbers. I can't figur开发者_C百科e out which label stands for VV,NN,DT. How can I solve this problem?


In general, there's no way to do that. Baum-Welch will find classes of word uses that have similar distributions, but there's no particular reason to suppose that those classes will map in any straightforward way to categories posited by any specific linguistic theory. Therefore, unsupervised POS taggers are mainly useful for applications where you care about equivalence classes of words or phrases but not about the specific tags being assigned.

If you really need human-readable labels, though (e.g., during development, to evaluate whether the results you're getting are even remotely plausible), I'd hand-tag a few dozen sentences. Then you could apply your B-W-derived tagger to that labeled mini-corpus to induce a mapping between class numbers and POS labels.

0

精彩评论

暂无评论...
验证码 换一张
取 消