org.eml.sir.util
Class WordSelector

java.lang.Object
  extended byorg.eml.sir.util.WordSelector

public class WordSelector
extends java.lang.Object

Tokenizes user's input and creates a TreeTagger instances to obtain POS

Author:
Kostadin Cholakov

Field Summary
static java.lang.Character CONT
           
static java.lang.String DELIM
           
static java.lang.Character NOUN
           
 
Constructor Summary
WordSelector(java.lang.String input)
          Creates an instance of WordSelector and tokenizes user's input
 
Method Summary
 java.lang.String getContentWords()
          Returns content words (nouns, adjectives, verbs and adverbs) which are used for extended boolean information retreival
 java.util.ArrayList getNounList()
          Returns an ArrayList containing nouns which is used for semantic information retreival
 java.lang.String getNouns()
          Returns nouns which are used for extended boolean information retreival
 
Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

NOUN

public static final java.lang.Character NOUN

CONT

public static final java.lang.Character CONT

DELIM

public static final java.lang.String DELIM
See Also:
Constant Field Values
Constructor Detail

WordSelector

public WordSelector(java.lang.String input)
Creates an instance of WordSelector and tokenizes user's input

Parameters:
input - user's input to be tokenized
Method Detail

getNounList

public java.util.ArrayList getNounList()
Returns an ArrayList containing nouns which is used for semantic information retreival

Returns:
the ArrayList with nouns

getNouns

public java.lang.String getNouns()
Returns nouns which are used for extended boolean information retreival

Returns:
the nouns to be used

getContentWords

public java.lang.String getContentWords()
Returns content words (nouns, adjectives, verbs and adverbs) which are used for extended boolean information retreival

Returns:
the content words to be used