PhraseHunter Namespace Reference


Classes

class  ContextReader
 Class for reading the left and right context of a token. More...
class  CorpusManager
 Class for managing corpus repositories. More...
class  IndexManager
 Class for indexing and removing documents from the index. More...
class  SearchEngine
 Class that provides functionality for querying a corpus. More...
class  StatisticsEngine
 Class that provides functionality to do statistical calculations on a corpus. More...
class  Token
 Virtual base class for all Token descendants. More...
class  EmptyToken
 A Token without a token string nor any occurrences. More...
class  CorpusTokenBase
 Base class for all CorpusToken classes. More...
class  CorpusToken
 A Token of only one word. More...
class  LightCorpusToken
 A corpus token that only carries corpus frequency information (opposed to CorpusToken). More...
class  MutableToken
 A Token whose OccurrenceMap can be modified. The main purpose of MutableToken is to be a base class for phrases, whose occurrence matrix needs to be modified as phrases are themselve not indexed. More...
class  Phrase
 A phrasal token consists of several words or tokens. More...
class  TokenContext
 Class that holds one specific context to a Token. More...
class  Input
class  StringInput
class  FileInput
class  Tokenizer
class  TextSaver

Namespaces

namespace  Exceptions

Typedefs

typedef uint32_t DocID
 The ID of a document in the data base.
typedef uint32_t TokenID
 The ID of Token in the data base.
typedef uint32_t IdxPos
 The byte offset of a Token in a document.
typedef std::vector< IdxPosPositionList
 The positions of a Token in a document.
typedef std::map< DocID, PositionListOccurrenceMap
 Documents and positions for a Token.
typedef boost::shared_ptr<
Token
TokenPtr
 Pointer to a Token object.
typedef boost::shared_ptr<
TokenContext
TokenContextPtr
 Pointer to a TokenContext object.
typedef std::vector< TokenPtrTokenVector
 A vector of TokenPtr.
typedef std::tr1::unordered_map<
std::string, std::pair< TokenID,
PositionList > > 
idx_map

Variables

static const TokenID InvalidTokenID = 0
static const DocID InvalidDocID = 0


Typedef Documentation

typedef uint32_t PhraseHunter::DocID

The ID of a document in the data base.

Definition at line 45 of file ph_types.h.

typedef std::tr1::unordered_map<std::string, std::pair<TokenID, PositionList> > PhraseHunter::idx_map

Definition at line 43 of file indexer.cpp.

typedef uint32_t PhraseHunter::IdxPos

The byte offset of a Token in a document.

Definition at line 52 of file ph_types.h.

typedef std::map<DocID, PositionList> PhraseHunter::OccurrenceMap

Documents and positions for a Token.

Definition at line 57 of file ph_types.h.

typedef std::vector<IdxPos> PhraseHunter::PositionList

The positions of a Token in a document.

Definition at line 55 of file ph_types.h.

typedef boost::shared_ptr<TokenContext> PhraseHunter::TokenContextPtr

Pointer to a TokenContext object.

Definition at line 62 of file ph_types.h.

typedef uint32_t PhraseHunter::TokenID

The ID of Token in the data base.

Definition at line 50 of file ph_types.h.

typedef boost::shared_ptr<Token> PhraseHunter::TokenPtr

Pointer to a Token object.

Definition at line 60 of file ph_types.h.

typedef std::vector<TokenPtr> PhraseHunter::TokenVector

A vector of TokenPtr.

Definition at line 65 of file ph_types.h.


Variable Documentation

const DocID PhraseHunter::InvalidDocID = 0 [static]

Definition at line 68 of file ph_types.h.

const TokenID PhraseHunter::InvalidTokenID = 0 [static]

Definition at line 67 of file ph_types.h.

Referenced by PhraseHunter::Token::id(), and PhraseHunter::CorpusToken::loadFromCorpus().


Generated on Thu Dec 21 16:14:44 2006 for The Phrasehunter by  doxygen 1.5.1