Package weka.classifiers.trees.j48
Class C45Split
java.lang.Object
weka.classifiers.trees.j48.ClassifierSplitModel
weka.classifiers.trees.j48.C45Split
- All Implemented Interfaces:
Serializable
,Cloneable
,RevisionHandler
Class implementing a C4.5-type split on an attribute.
- Version:
- $Revision: 14911 $
- Author:
- Eibe Frank (eibe@cs.waikato.ac.nz)
- See Also:
-
Constructor Summary
ConstructorDescriptionC45Split
(int attIndex, int minNoObj, double sumOfWeights, boolean useMDLcorrection) Initializes the split model. -
Method Summary
Modifier and TypeMethodDescriptionfinal int
attIndex()
Returns index of attribute for which split was generated.void
buildClassifier
(Instances trainInstances) Creates a C4.5-type split on the given data.final double
Gets class probability for instance.final double
Returns coding cost for split (used in rule learner).final double
Returns (C4.5-type) gain ratio for the generated split.Returns the revision string.final double
infoGain()
Returns (C4.5-type) information gain for the generated split.final String
Prints left side of condition..final double[][]
minsAndMaxs
(Instances data, double[][] minsAndMaxs, int index) Returns the minsAndMaxs of the index.th subset.void
resetDistribution
(Instances data) Sets distribution associated with model.final String
Prints the condition satisfied by instances in a subset.final void
setSplitPoint
(Instances allInstances) Sets split point to greatest value in given data smaller or equal to old split point.final String
sourceExpression
(int index, Instances data) Returns a string containing java source code equivalent to the test made at this node.double
Returns the split point (numeric attribute only).final double[]
Returns weights if instance is assigned to more than one subset.final int
whichSubset
(Instance instance) Returns index of subset instance is assigned to.Methods inherited from class weka.classifiers.trees.j48.ClassifierSplitModel
checkModel, classifyInstance, classProbLaplace, clone, distribution, dumpLabel, dumpModel, numSubsets, setDistribution, sourceClass, split
-
Constructor Details
-
C45Split
public C45Split(int attIndex, int minNoObj, double sumOfWeights, boolean useMDLcorrection) Initializes the split model.
-
-
Method Details
-
buildClassifier
Creates a C4.5-type split on the given data. Assumes that none of the class values is missing.- Specified by:
buildClassifier
in classClassifierSplitModel
- Throws:
Exception
- if something goes wrong
-
attIndex
public final int attIndex()Returns index of attribute for which split was generated. -
splitPoint
public double splitPoint()Returns the split point (numeric attribute only).- Returns:
- the split point used for a test on a numeric attribute
-
classProb
Gets class probability for instance.- Overrides:
classProb
in classClassifierSplitModel
- Throws:
Exception
- if something goes wrong
-
codingCost
public final double codingCost()Returns coding cost for split (used in rule learner).- Overrides:
codingCost
in classClassifierSplitModel
-
gainRatio
public final double gainRatio()Returns (C4.5-type) gain ratio for the generated split. -
infoGain
public final double infoGain()Returns (C4.5-type) information gain for the generated split. -
leftSide
Prints left side of condition..- Specified by:
leftSide
in classClassifierSplitModel
- Parameters:
data
- training set.
-
rightSide
Prints the condition satisfied by instances in a subset.- Specified by:
rightSide
in classClassifierSplitModel
- Parameters:
index
- of subsetdata
- training set.
-
sourceExpression
Returns a string containing java source code equivalent to the test made at this node. The instance being tested is called "i".- Specified by:
sourceExpression
in classClassifierSplitModel
- Parameters:
index
- index of the nominal value testeddata
- the data containing instance structure info- Returns:
- a value of type 'String'
-
setSplitPoint
Sets split point to greatest value in given data smaller or equal to old split point. (C4.5 does this for some strange reason). -
minsAndMaxs
Returns the minsAndMaxs of the index.th subset. -
resetDistribution
Sets distribution associated with model.- Overrides:
resetDistribution
in classClassifierSplitModel
- Throws:
Exception
-
weights
Returns weights if instance is assigned to more than one subset. Returns null if instance is only assigned to one subset.- Specified by:
weights
in classClassifierSplitModel
-
whichSubset
Returns index of subset instance is assigned to. Returns -1 if instance is assigned to more than one subset.- Specified by:
whichSubset
in classClassifierSplitModel
- Throws:
Exception
- if something goes wrong
-
getRevision
Returns the revision string.- Returns:
- the revision
-