TL;DR: A method of automatic document classification is optimized to give best agreement with manually assigned classification of a set of test documents to measure the indexing worth of additional keywords.
Abstract: A method of automatic document classification is optimized to give best agreement with manually assigned classification of a set of test documents. The effectiveness of the automatic classification depends on the correlation between the relevance and the mutual keyword content of pairs of documents. An expression is obtained to measure the indexing worth of additional keywords. The resolution of the automatic classification is dependent on the correlation between mutual keyword content and the uniqueness of the association of categories with documents.