Package weka.core.stemmers
Class SnowballStemmer
java.lang.Object
weka.core.stemmers.SnowballStemmer
- All Implemented Interfaces:
Serializable
,OptionHandler
,RevisionHandler
,Stemmer
A wrapper class for the Snowball stemmers. Only
available if the Snowball classes are in the classpath.
If the class discovery is not dynamic, i.e., the property 'UseDynamic' in the props file 'weka/gui/GenericPropertiesCreator.props' is 'false', then the property 'org.tartarus.snowball.SnowballProgram' in the 'weka/gui/GenericObjectEditor.props' file has to be uncommented as well. If necessary you have to discover and fill in the snowball stemmers manually. You can use the 'weka.core.ClassDiscovery' for this:
java weka.core.ClassDiscovery org.tartarus.snowball.SnowballProgram org.tartarus.snowball.ext
Valid options are:
If the class discovery is not dynamic, i.e., the property 'UseDynamic' in the props file 'weka/gui/GenericPropertiesCreator.props' is 'false', then the property 'org.tartarus.snowball.SnowballProgram' in the 'weka/gui/GenericObjectEditor.props' file has to be uncommented as well. If necessary you have to discover and fill in the snowball stemmers manually. You can use the 'weka.core.ClassDiscovery' for this:
java weka.core.ClassDiscovery org.tartarus.snowball.SnowballProgram org.tartarus.snowball.ext
Valid options are:
-S <name> The name of the snowball stemmer (default 'porter'). available stemmers: danish, dutch, english, finnish, french, german, italian, norwegian, porter, portuguese, russian, spanish, swedish
- Version:
- $Revision: 15257 $
- Author:
- FracPete (fracpete at waikato dot ac dot nz)
- See Also:
-
Field Summary
-
Constructor Summary
ConstructorDescriptioninitializes the stemmer ("porter").SnowballStemmer
(String name) initializes the stemmer with the given stemmer. -
Method Summary
Modifier and TypeMethodDescriptionString[]
Gets the current settings of the classifier.Returns the revision string.returns the name of the current stemmer, null if none is set.Returns a string describing the stemmer.static boolean
returns whether Snowball is present or not, i.e.Returns an enumeration describing the available options.static Enumeration<String>
returns an enumeration over all currently stored stemmer names.static void
Runs the stemmer with the given options.void
setOptions
(String[] options) Parses the options.void
setStemmer
(String name) sets the stemmer with the given name, e.g., "porter".Returns the word in its stemmed form.Returns the tip text for this property.toString()
returns a string representation of the stemmer.
-
Field Details
-
PACKAGE
the package name for snowball.- See Also:
-
PACKAGE_EXT
the package name where the stemmers are located.- See Also:
-
-
Constructor Details
-
SnowballStemmer
public SnowballStemmer()initializes the stemmer ("porter"). -
SnowballStemmer
initializes the stemmer with the given stemmer.- Parameters:
name
- the name of the stemmer
-
-
Method Details
-
globalInfo
Returns a string describing the stemmer.- Returns:
- a description suitable for displaying in the explorer/experimenter gui
-
listOptions
Returns an enumeration describing the available options.- Specified by:
listOptions
in interfaceOptionHandler
- Returns:
- an enumeration of all the available options.
-
setOptions
Parses the options. Valid options are:-S <name> The name of the snowball stemmer (default 'porter'). available stemmers: danish, dutch, english, finnish, french, german, italian, norwegian, porter, portuguese, russian, spanish, swedish
- Specified by:
setOptions
in interfaceOptionHandler
- Parameters:
options
- the options to parse- Throws:
Exception
- if parsing fails
-
getOptions
Gets the current settings of the classifier.- Specified by:
getOptions
in interfaceOptionHandler
- Returns:
- an array of strings suitable for passing to setOptions
-
isPresent
public static boolean isPresent()returns whether Snowball is present or not, i.e. whether the classes are in the classpath or not- Returns:
- whether Snowball is available
-
listStemmers
returns an enumeration over all currently stored stemmer names.- Returns:
- all available stemmers
-
getStemmer
returns the name of the current stemmer, null if none is set.- Returns:
- the name of the stemmer
-
setStemmer
sets the stemmer with the given name, e.g., "porter".- Parameters:
name
- the name of the stemmer, e.g., "porter"
-
stemmerTipText
Returns the tip text for this property.- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
stem
Returns the word in its stemmed form. -
toString
returns a string representation of the stemmer. -
getRevision
Returns the revision string.- Specified by:
getRevision
in interfaceRevisionHandler
- Returns:
- the revision
-
main
Runs the stemmer with the given options.- Parameters:
args
- the options
-