org.apache.lucene.analysis
Class WordlistLoader

java.lang.Object
  extended byorg.apache.lucene.analysis.WordlistLoader

public class WordlistLoader
extends Object

Loader for text files that represent a list of stopwords.

Version:
$Id: WordlistLoader.java 192989 2005-06-22 19:59:03Z dnaber $
Author:
Gerhard Schwarz

Constructor Summary
WordlistLoader()
           
 
Method Summary
static HashSet getWordSet(File wordfile)
          Loads a text file and adds every line as an entry to a HashSet (omitting leading and trailing whitespace).
static HashSet getWordSet(Reader reader)
          Reads lines from a Reader and adds every line as an entry to a HashSet (omitting leading and trailing whitespace).
static Hashtable getWordtable(File wordfile)
          Deprecated. Use getWordSet(File) instead
static Hashtable getWordtable(String wordfile)
          Deprecated. Use getWordSet(File) instead
static Hashtable getWordtable(String path, String wordfile)
          Deprecated. Use getWordSet(File) instead
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

WordlistLoader

public WordlistLoader()
Method Detail

getWordSet

public static HashSet getWordSet(File wordfile)
                          throws IOException
Loads a text file and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the file should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer).

Parameters:
wordfile - File containing the wordlist
Returns:
A HashSet with the file's words
Throws:
IOException

getWordSet

public static HashSet getWordSet(Reader reader)
                          throws IOException
Reads lines from a Reader and adds every line as an entry to a HashSet (omitting leading and trailing whitespace). Every line of the Reader should contain only one word. The words need to be in lowercase if you make use of an Analyzer which uses LowerCaseFilter (like StandardAnalyzer).

Parameters:
reader - Reader containing the wordlist
Returns:
A HashSet with the reader's words
Throws:
IOException

getWordtable

public static Hashtable getWordtable(String path,
                                     String wordfile)
                              throws IOException
Deprecated. Use getWordSet(File) instead

Parameters:
path - Path to the wordlist
wordfile - Name of the wordlist
Throws:
IOException

getWordtable

public static Hashtable getWordtable(String wordfile)
                              throws IOException
Deprecated. Use getWordSet(File) instead

Parameters:
wordfile - Complete path to the wordlist
Throws:
IOException

getWordtable

public static Hashtable getWordtable(File wordfile)
                              throws IOException
Deprecated. Use getWordSet(File) instead

Parameters:
wordfile - File object that points to the wordlist
Throws:
IOException


Copyright © 2000-2006 Apache Software Foundation. All Rights Reserved.