ws.qe
Class QueryExpansion

java.lang.Object
  extended byws.qe.QueryExpansion

public class QueryExpansion
extends java.lang.Object

This class contains one method to process the download and analysis(query expansion).

The download process has two modules selected through "Option"-menu. For "Mudule I" the given number of URLs as Google search results are to be saved, the process of web or server connecting, website downloading and text-processing(e.g. tags and stopwords-elimination, stemming, etc.) for each URL must be successfully accomplished within 5 seconds, otherwise the out-run time limit or other possible errors cause the process unsuccessful; for "Module II" only one URL is to be saved firstly, then the whole process for this URL lasts at most 5 seconds, if this process ends because of out-run time limit or other errors, then the process for anothr URL is in turn, util the given number of websites are saved and processed. For instance, the whole process for 10 websites in "Module I" takes totally at most 50 seconds, in "Module II" at least 50 seconds.

The query expansion is based on two algorithms "Local Association Clustering" and "Local Metric Clustering". More to the algorithms, please see the class MatrixVector, or the chapter 5 of the book: Baeza-Yates, Ricardo; Ribeiro-Neto, Berthier Eds. (1999) Modern Information Retrieval . Essex: Addison Wesley Longman Limited.

The implementation for the query expansion is supported by three external libraries Google Web APIs Service ( javadoc ), Jakarta Lucene ( javadoc ) and Snowball Stemer for Lucene ( download ).

Author:
Sinian Zhang
See Also:
Google, MatrixVector, Tools

Constructor Summary
QueryExpansion()
           
 
Method Summary
static void QE(int module, javax.swing.JDialog stopDialog, java.lang.String query, int numOfWebs, java.lang.String dbListFile)
          This method process the download and analysis as query expansion.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

QueryExpansion

public QueryExpansion()
Method Detail

QE

public static void QE(int module,
                      javax.swing.JDialog stopDialog,
                      java.lang.String query,
                      int numOfWebs,
                      java.lang.String dbListFile)
               throws com.google.soap.search.GoogleSearchFault
This method process the download and analysis as query expansion.

Parameters:
module - Download module.
stopDialog - The "Stop"-dialog.
query - The word delivered by "Word"-inputfield.
numOfWebs - The same number which is delivered by the "Web"-box.
dbListFile - The file, where all the stems in the database are listed.
Throws:
com.google.soap.search.GoogleSearchFault
See Also:
MatrixVector.getStemStemVector_AC(Hashtable, String), MatrixVector.getTopStemStemVector_MC(String, ArrayList, ArrayList, int[]), Tools.queryExpansionResult(String, ArrayList, ArrayList, float[], String), Tools.updateDBListFile(String, String)