Gather document stats while indexing. More...
#include <GatherDocumentList.h>
Public Member Functions | |
GatherDocumentList (void) | |
Initialise document list. | |
void | addDocument (char *name, int number) |
Add a new document to the queue. | |
void | addTotalTermCount (int number) |
Add the word count to the latest document stats. | |
void | addUniqueTermCount (int number) |
Add the term count to the latest document stats. | |
void | purgeDocuments (void) |
Write the document stats to a file. | |
void | salvageDocuments (void) |
Read the document stats from a file. | |
Document ** | generateDocumentList (void) |
Return a vector of documents. | |
void | buildDocumentList (DocumentList *list) |
Build a DocumentList from the gathered document statistics. |
Gather document stats while indexing.
This class is used to gather document statisitics on the fly while scanning a text document corpus. When finished, a DocumentList can be constructed.