This dataset contains all the words extracted from the Swiss-Prot version 9 data (with the resulting frequency for each word). Other datasets for other database versions can be obtained by contacting Michael Bell
Full details in http://arxiv.org/abs/arXiv:1208.2175v1