I'd like a complete list of parts of speech (e.g., adj., adv., and .v) for English words. All I need is just a TSV table with two columns, with the first column the word and the second column POS.

I know that wordnet should contain such information. But it contains more than what I needed. I am not sure which file I should use.


Does anybody know a convenience-to-use file that contains English words and their POS? Thanks.

1 Answers

alvas On

Wordnet doesn't give you "word" per say but lemmas.

from nltk.corpus import wordnet as wn

for ss in wn.all_synsets():
    for lemma in ss.lemma_names():
        print(ss.offset() + '\t' + ss.pos() +'\t' + lemma)