PLSAFloatThesaurus Class Reference

Compute and store the PLSA thesarus. More...

#include <PLSAFloatThesaurus.h>

Inheritance diagram for PLSAFloatThesaurus:
ElementFloatIndex< SortedSubIndex< WordSubIndex > > FloatIndex

List of all members.

Public Member Functions

 PLSAFloatThesaurus (int totalWords, BlockStats *stats, SortedSubIndex< WordSubIndex > *index, FileName *fileName, int eigenvalues, int iterations)
 Compute each of the PLSA thesaurus values.
bool calculateRow (int currentRealWord)
 Calculate the row and return TRUE if it should be stored.
int listRealPosition (int position)
 Return the real list position (based on the initial index).
int indexRealPosition (int position)
 Return the real index position (based in the initial index).
QuantisecreateQuantiser (float lower, float upper)
 Assign a quantiser to convert floats to ints.

Detailed Description

Compute and store the PLSA thesarus.

This class is used to compute the Probabilistic Latent Semantic Analysis (PLSA) values from the document index, and then store the values in a compressed thesarus. This class makes use of the FloatIndex base class.

Author
lapark
Revision
1.3.2.2
Date
2009/11/02 14:35:36
Id
PLSAFloatThesaurus.h,v 1.3.2.2 2009/11/02 14:35:36 lapark Exp

The documentation for this class was generated from the following files:
 All Classes Functions Variables Friends

Generated on Tue Nov 10 14:12:15 2009 for textIR by  doxygen 1.6.1