Classes | |
class | ContextReader |
Class for reading the left and right context of a token. More... | |
class | CorpusManager |
Class for managing corpus repositories. More... | |
class | IndexManager |
Class for indexing and removing documents from the index. More... | |
class | SearchEngine |
Class that provides functionality for querying a corpus. More... | |
class | StatisticsEngine |
Class that provides functionality to do statistical calculations on a corpus. More... | |
class | Token |
Virtual base class for all Token descendants. More... | |
class | EmptyToken |
A Token without a token string nor any occurrences. More... | |
class | CorpusTokenBase |
Base class for all CorpusToken classes. More... | |
class | CorpusToken |
A Token of only one word. More... | |
class | LightCorpusToken |
A corpus token that only carries corpus frequency information (opposed to CorpusToken). More... | |
class | MutableToken |
A Token whose OccurrenceMap can be modified. The main purpose of MutableToken is to be a base class for phrases, whose occurrence matrix needs to be modified as phrases are themselve not indexed. More... | |
class | Phrase |
A phrasal token consists of several words or tokens. More... | |
class | TokenContext |
Class that holds one specific context to a Token. More... | |
class | Input |
class | StringInput |
class | FileInput |
class | Tokenizer |
class | TextSaver |
Namespaces | |
namespace | Exceptions |
Typedefs | |
typedef uint32_t | DocID |
The ID of a document in the data base. | |
typedef uint32_t | TokenID |
The ID of Token in the data base. | |
typedef uint32_t | IdxPos |
The byte offset of a Token in a document. | |
typedef std::vector< IdxPos > | PositionList |
The positions of a Token in a document. | |
typedef std::map< DocID, PositionList > | OccurrenceMap |
Documents and positions for a Token. | |
typedef boost::shared_ptr< Token > | TokenPtr |
Pointer to a Token object. | |
typedef boost::shared_ptr< TokenContext > | TokenContextPtr |
Pointer to a TokenContext object. | |
typedef std::vector< TokenPtr > | TokenVector |
A vector of TokenPtr. | |
typedef std::tr1::unordered_map< std::string, std::pair< TokenID, PositionList > > | idx_map |
Variables | |
static const TokenID | InvalidTokenID = 0 |
static const DocID | InvalidDocID = 0 |
typedef uint32_t PhraseHunter::DocID |
typedef std::tr1::unordered_map<std::string, std::pair<TokenID, PositionList> > PhraseHunter::idx_map |
Definition at line 43 of file indexer.cpp.
typedef uint32_t PhraseHunter::IdxPos |
typedef std::map<DocID, PositionList> PhraseHunter::OccurrenceMap |
typedef std::vector<IdxPos> PhraseHunter::PositionList |
typedef boost::shared_ptr<TokenContext> PhraseHunter::TokenContextPtr |
typedef uint32_t PhraseHunter::TokenID |
typedef boost::shared_ptr<Token> PhraseHunter::TokenPtr |
typedef std::vector<TokenPtr> PhraseHunter::TokenVector |
const DocID PhraseHunter::InvalidDocID = 0 [static] |
Definition at line 68 of file ph_types.h.
const TokenID PhraseHunter::InvalidTokenID = 0 [static] |
Definition at line 67 of file ph_types.h.
Referenced by PhraseHunter::Token::id(), and PhraseHunter::CorpusToken::loadFromCorpus().