Package weka.filters.supervised.instance
Class StratifiedRemoveFolds
java.lang.Object
weka.filters.Filter
weka.filters.supervised.instance.StratifiedRemoveFolds
- All Implemented Interfaces:
Serializable,CapabilitiesHandler,CapabilitiesIgnorer,CommandlineRunnable,OptionHandler,RevisionHandler,WeightedAttributesHandler,SupervisedFilter
public class StratifiedRemoveFolds
extends Filter
implements SupervisedFilter, OptionHandler, WeightedAttributesHandler
This filter takes a dataset and outputs a specified
fold for cross validation. If you do not want the folds to be stratified use
the unsupervised version.
Valid options are:
-V Specifies if inverse of selection is to be output.
-N <number of folds> Specifies number of folds dataset is split into. (default 10)
-F <fold> Specifies which fold is selected. (default 1)
-S <seed> Specifies random number seed. (default 0, no randomizing)
- Version:
- $Revision: 14508 $
- Author:
- Eibe Frank (eibe@cs.waikato.ac.nz)
- See Also:
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionbooleanSignify that this batch of input to the filter is finished.Returns the tip text for this propertyReturns the Capabilities of this filter.intgetFold()Gets the fold which is selected.booleanGets if selection is to be inverted.intGets the number of folds in which dataset is to be split into.String[]Gets the current settings of the filter.Returns the revision string.longgetSeed()Gets the random number seed used for shuffling the dataset.Returns a string describing this filterbooleanInput an instance for filtering.Returns the tip text for this propertyGets an enumeration describing the available options..static voidMain method for testing this class.Returns the tip text for this propertyReturns the tip text for this propertyvoidsetFold(int fold) Selects a fold.booleansetInputFormat(Instances instanceInfo) Sets the format of the input instances.voidsetInvertSelection(boolean inverse) Sets if selection is to be inverted.voidsetNumFolds(int numFolds) Sets the number of folds the dataset is split into.voidsetOptions(String[] options) Parses a given list of options.voidsetSeed(long seed) Sets the random number seed for shuffling the dataset.Methods inherited from class weka.filters.Filter
batchFilterFile, debugTipText, doNotCheckCapabilitiesTipText, filterFile, getCapabilities, getCopyOfInputFormat, getDebug, getDoNotCheckCapabilities, getOutputFormat, isFirstBatchDone, isNewBatch, isOutputFormatDefined, makeCopies, makeCopy, mayRemoveInstanceAfterFirstBatchDone, numPendingOutput, output, outputPeek, postExecution, preExecution, run, runFilter, setDebug, setDoNotCheckCapabilities, toString, useFilter, wekaStaticWrapper
-
Constructor Details
-
StratifiedRemoveFolds
public StratifiedRemoveFolds()
-
-
Method Details
-
listOptions
Gets an enumeration describing the available options..- Specified by:
listOptionsin interfaceOptionHandler- Overrides:
listOptionsin classFilter- Returns:
- an enumeration of all the available options.
-
setOptions
Parses a given list of options. Valid options are:-V Specifies if inverse of selection is to be output.
-N <number of folds> Specifies number of folds dataset is split into. (default 10)
-F <fold> Specifies which fold is selected. (default 1)
-S <seed> Specifies random number seed. (default 0, no randomizing)
- Specified by:
setOptionsin interfaceOptionHandler- Overrides:
setOptionsin classFilter- Parameters:
options- the list of options as an array of strings- Throws:
Exception- if an option is not supported
-
getOptions
Gets the current settings of the filter.- Specified by:
getOptionsin interfaceOptionHandler- Overrides:
getOptionsin classFilter- Returns:
- an array of strings suitable for passing to setOptions
-
globalInfo
Returns a string describing this filter- Returns:
- a description of the filter suitable for displaying in the explorer/experimenter gui
-
invertSelectionTipText
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getInvertSelection
public boolean getInvertSelection()Gets if selection is to be inverted.- Returns:
- true if the selection is to be inverted
-
setInvertSelection
public void setInvertSelection(boolean inverse) Sets if selection is to be inverted.- Parameters:
inverse- true if inversion is to be performed
-
numFoldsTipText
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getNumFolds
public int getNumFolds()Gets the number of folds in which dataset is to be split into.- Returns:
- the number of folds the dataset is to be split into.
-
setNumFolds
public void setNumFolds(int numFolds) Sets the number of folds the dataset is split into. If the number of folds is zero, it won't split it into folds.- Parameters:
numFolds- number of folds dataset is to be split into- Throws:
IllegalArgumentException- if number of folds is negative
-
foldTipText
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getFold
public int getFold()Gets the fold which is selected.- Returns:
- the fold which is selected
-
setFold
public void setFold(int fold) Selects a fold.- Parameters:
fold- the fold to be selected.- Throws:
IllegalArgumentException- if fold's index is smaller than 1
-
seedTipText
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getSeed
public long getSeed()Gets the random number seed used for shuffling the dataset.- Returns:
- the random number seed
-
setSeed
public void setSeed(long seed) Sets the random number seed for shuffling the dataset. If seed is negative, shuffling won't be performed.- Parameters:
seed- the random number seed
-
getCapabilities
Returns the Capabilities of this filter.- Specified by:
getCapabilitiesin interfaceCapabilitiesHandler- Overrides:
getCapabilitiesin classFilter- Returns:
- the capabilities of this object
- See Also:
-
setInputFormat
Sets the format of the input instances.- Overrides:
setInputFormatin classFilter- Parameters:
instanceInfo- an Instances object containing the input instance structure (any instances contained in the object are ignored - only the structure is required).- Returns:
- true because outputFormat can be collected immediately
- Throws:
Exception- if the input format can't be set successfully
-
input
Input an instance for filtering. Filter requires all training instances be read before producing output.- Overrides:
inputin classFilter- Parameters:
instance- the input instance- Returns:
- true if the filtered instance may now be collected with output().
- Throws:
IllegalStateException- if no input structure has been defined
-
batchFinished
public boolean batchFinished()Signify that this batch of input to the filter is finished. Output() may now be called to retrieve the filtered instances.- Overrides:
batchFinishedin classFilter- Returns:
- true if there are instances pending output
- Throws:
IllegalStateException- if no input structure has been defined
-
getRevision
Returns the revision string.- Specified by:
getRevisionin interfaceRevisionHandler- Overrides:
getRevisionin classFilter- Returns:
- the revision
-
main
Main method for testing this class.- Parameters:
argv- should contain arguments to the filter: use -h for help
-