ws.qe
Class QueryExpansion
java.lang.Object
ws.qe.QueryExpansion
- public class QueryExpansion
- extends java.lang.Object
This class contains one method to process the download and analysis(query
expansion).
The download process has two modules selected through "Option"-menu. For
"Mudule I" the given number of URLs as Google search results are to be saved,
the process of web or server connecting, website downloading and
text-processing(e.g. tags and stopwords-elimination, stemming, etc.) for each
URL must be successfully accomplished within 5 seconds, otherwise the out-run
time limit or other possible errors cause the process unsuccessful; for
"Module II" only one URL is to be saved firstly, then the whole process for
this URL lasts at most 5 seconds, if this process ends because of out-run
time limit or other errors, then the process for anothr URL is in turn, util
the given number of websites are saved and processed. For instance, the whole
process for 10 websites in "Module I" takes totally at most 50 seconds, in
"Module II" at least 50 seconds.
The query expansion is based on two algorithms "Local Association Clustering"
and "Local Metric Clustering". More to the algorithms, please see the class
MatrixVector
, or the chapter 5 of the book:
Baeza-Yates, Ricardo; Ribeiro-Neto, Berthier Eds. (1999) Modern
Information Retrieval . Essex: Addison Wesley Longman Limited.
The implementation for the query expansion is supported by three external
libraries Google Web
APIs Service ( javadoc ), Jakarta
Lucene ( javadoc ) and Snowball Stemer for Lucene ( download ).
- Author:
- Sinian Zhang
- See Also:
Google
,
MatrixVector
,
Tools
Method Summary |
static void |
QE(int module,
javax.swing.JDialog stopDialog,
java.lang.String query,
int numOfWebs,
java.lang.String dbListFile)
This method process the download and analysis as query expansion. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
QueryExpansion
public QueryExpansion()
QE
public static void QE(int module,
javax.swing.JDialog stopDialog,
java.lang.String query,
int numOfWebs,
java.lang.String dbListFile)
throws com.google.soap.search.GoogleSearchFault
- This method process the download and analysis as query expansion.
- Parameters:
module
- Download module.stopDialog
- The "Stop"-dialog.query
- The word delivered by "Word"-inputfield.numOfWebs
- The same number which is delivered by the "Web"-box.dbListFile
- The file, where all the stems in the database are listed.
- Throws:
com.google.soap.search.GoogleSearchFault
- See Also:
MatrixVector.getStemStemVector_AC(Hashtable, String)
,
MatrixVector.getTopStemStemVector_MC(String, ArrayList, ArrayList,
int[])
,
Tools.queryExpansionResult(String, ArrayList, ArrayList, float[],
String)
,
Tools.updateDBListFile(String, String)