Class ReplaceWithMissingValue
java.lang.Object
weka.filters.Filter
weka.filters.SimpleFilter
weka.filters.SimpleBatchFilter
weka.filters.unsupervised.attribute.ReplaceWithMissingValue
- All Implemented Interfaces:
Serializable
,CapabilitiesHandler
,CapabilitiesIgnorer
,CommandlineRunnable
,OptionHandler
,Randomizable
,RevisionHandler
,WeightedAttributesHandler
,WeightedInstancesHandler
,UnsupervisedFilter
public class ReplaceWithMissingValue
extends SimpleBatchFilter
implements UnsupervisedFilter, Randomizable, WeightedAttributesHandler, WeightedInstancesHandler
A filter that can be used to introduce missing values in a dataset.
The specified probability is used to flip a biased coin to decide whether to replace a particular
attribute value in an instance with a missing value (i.e., a probability of 0.9 means 90% of values
will be replaced with missing values). This filter only modifies the first batch of data that is processed.
The class attribute is skipped by default.
Valid options are:
Valid options are:
-R <col1,col2-col4,...> Specifies list of columns to modify. First and last are valid indexes. (default: first-last)
-V Invert matching sense of column indexes.
-S <num> Specify the random number seed (default 1)
-P <double> Specify the probability (default 0.1)
-unset-class-temporarily Unsets the class index temporarily before the filter is applied to the data. (default: no)
- Version:
- $Revision: 10215 $
- Author:
- Eibe Frank (eibe@cs.waikato.ac.nz)
- See Also:
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionReturns the tip text for this propertyGets the current range selectionReturns the Capabilities of this filter.boolean
Gets the IgnoreClass value.boolean
Gets whether the supplied columns are to be worked on or the others.String[]
Gets the current settings of the filter.double
Get the probability.Returns the revision string.int
getSeed()
Get the random number generator seed value.Returns a string describing this filter.Returns the tip text for this propertyReturns the tip text for this propertyGets an enumeration describing the available options.static void
Main method for testing this class.Returns the tip text for this propertyReturns the tip text for this propertyvoid
setAttributeIndices
(String value) Sets which attributes are to be "nominalized" (only numeric attributes among the selection will be transformed).void
setAttributeIndicesArray
(int[] value) Sets which attributes are to be transoformed to nominal.void
setIgnoreClass
(boolean newIgnoreClass) Set the IgnoreClass value.void
setInvertSelection
(boolean value) Sets whether selected columns should be worked on or all the others apart from these.void
setOptions
(String[] options) Parses a given list of options.void
setProbability
(double newProbability) Set the probability to use.void
setSeed
(int newSeed) Set the random number generator seed value.Methods inherited from class weka.filters.SimpleBatchFilter
allowAccessToFullInputFormat, batchFinished, input, input
Methods inherited from class weka.filters.SimpleFilter
setInputFormat
Methods inherited from class weka.filters.Filter
batchFilterFile, debugTipText, doNotCheckCapabilitiesTipText, filterFile, getCapabilities, getCopyOfInputFormat, getDebug, getDoNotCheckCapabilities, getOutputFormat, isFirstBatchDone, isNewBatch, isOutputFormatDefined, makeCopies, makeCopy, mayRemoveInstanceAfterFirstBatchDone, numPendingOutput, output, outputPeek, postExecution, preExecution, run, runFilter, setDebug, setDoNotCheckCapabilities, toString, useFilter, wekaStaticWrapper
-
Constructor Details
-
ReplaceWithMissingValue
public ReplaceWithMissingValue()
-
-
Method Details
-
listOptions
Gets an enumeration describing the available options.- Specified by:
listOptions
in interfaceOptionHandler
- Overrides:
listOptions
in classFilter
- Returns:
- an enumeration of all the available options.
-
setOptions
Parses a given list of options. Valid options are:-R <col1,col2-col4,...> Specifies list of columns to modify. First and last are valid indexes. (default: first-last)
-V Invert matching sense of column indexes.
-S <num> Specify the random number seed (default 1)
-P <double> Specify the probability (default 0.1)
-unset-class-temporarily Unsets the class index temporarily before the filter is applied to the data. (default: no)
- Specified by:
setOptions
in interfaceOptionHandler
- Overrides:
setOptions
in classFilter
- Parameters:
options
- the list of options as an array of strings- Throws:
Exception
- if an option is not supported
-
getOptions
Gets the current settings of the filter.- Specified by:
getOptions
in interfaceOptionHandler
- Overrides:
getOptions
in classFilter
- Returns:
- an array of strings suitable for passing to setOptions
-
ignoreClassTipText
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
setIgnoreClass
public void setIgnoreClass(boolean newIgnoreClass) Set the IgnoreClass value. Set this to true if the class index is to be unset before the filter is applied.- Parameters:
newIgnoreClass
- The new IgnoreClass value.
-
getIgnoreClass
public boolean getIgnoreClass()Gets the IgnoreClass value. If this to true then the class index is to unset before the filter is applied.- Returns:
- the current IgnoreClass value.
-
probabilityTipText
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getProbability
public double getProbability()Get the probability.- Returns:
- the probability.
-
setProbability
public void setProbability(double newProbability) Set the probability to use.- Parameters:
newProbability
- the probability to use.
-
seedTipText
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getSeed
public int getSeed()Get the random number generator seed value.- Specified by:
getSeed
in interfaceRandomizable
- Returns:
- random number generator seed value.
-
setSeed
public void setSeed(int newSeed) Set the random number generator seed value.- Specified by:
setSeed
in interfaceRandomizable
- Parameters:
newSeed
- value to use as the random number generator seed.
-
invertSelectionTipText
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getInvertSelection
public boolean getInvertSelection()Gets whether the supplied columns are to be worked on or the others.- Returns:
- true if the supplied columns will be worked on
-
setInvertSelection
public void setInvertSelection(boolean value) Sets whether selected columns should be worked on or all the others apart from these. If true all the other columns are considered for "nominalization".- Parameters:
value
- the new invert setting
-
attributeIndicesTipText
Returns the tip text for this property- Returns:
- tip text for this property suitable for displaying in the explorer/experimenter gui
-
getAttributeIndices
Gets the current range selection- Returns:
- a string containing a comma separated list of ranges
-
setAttributeIndices
Sets which attributes are to be "nominalized" (only numeric attributes among the selection will be transformed).- Parameters:
value
- a string representing the list of attributes. Since the string will typically come from a user, attributes are indexed from 1.
eg: first-3,5,6-last- Throws:
IllegalArgumentException
- if an invalid range list is supplied
-
setAttributeIndicesArray
public void setAttributeIndicesArray(int[] value) Sets which attributes are to be transoformed to nominal. (only numeric attributes among the selection will be transformed).- Parameters:
value
- an array containing indexes of attributes to nominalize. Since the array will typically come from a program, attributes are indexed from 0.- Throws:
IllegalArgumentException
- if an invalid set of ranges is supplied
-
getCapabilities
Returns the Capabilities of this filter.- Specified by:
getCapabilities
in interfaceCapabilitiesHandler
- Overrides:
getCapabilities
in classFilter
- Returns:
- the capabilities of this object
- See Also:
-
globalInfo
Returns a string describing this filter.- Specified by:
globalInfo
in classSimpleFilter
- Returns:
- a description of the filter suitable for displaying in the explorer/experimenter gui
-
getRevision
Returns the revision string.- Specified by:
getRevision
in interfaceRevisionHandler
- Overrides:
getRevision
in classFilter
- Returns:
- the revision
-
main
Main method for testing this class.- Parameters:
argv
- should contain arguments to the filter: use -h for help
-