Package weka.classifiers.trees.lmt
Class LMTNode
java.lang.Object
weka.classifiers.AbstractClassifier
weka.classifiers.trees.lmt.LogisticBase
weka.classifiers.trees.lmt.LMTNode
- All Implemented Interfaces:
Serializable
,Cloneable
,Classifier
,BatchPredictor
,CapabilitiesHandler
,CapabilitiesIgnorer
,CommandlineRunnable
,OptionHandler
,RevisionHandler
,WeightedInstancesHandler
Class for logistic model tree structure.
- Version:
- $Revision: 11566 $
- Author:
- Niels Landwehr, Marc Sumner
- See Also:
-
Field Summary
Modifier and TypeFieldDescriptiondouble
Alpha-value (for pruning) at the nodedouble
Weighted number of training examples currently misclassified by the logistic model at the nodedouble
Weighted number of training examples currently misclassified by the subtree rooted at the nodeFields inherited from class weka.classifiers.AbstractClassifier
BATCH_SIZE_DEFAULT, NUM_DECIMAL_PLACES_DEFAULT
-
Constructor Summary
ConstructorDescriptionLMTNode
(ModelSelection modelSelection, int numBoostingIterations, boolean fastRegression, boolean errorOnProbabilities, int minNumInstances, double weightTrimBeta, boolean useAIC, NominalToBinary ntb, int numDecimalPlaces) Constructor for logistic model tree node. -
Method Summary
Modifier and TypeMethodDescriptionint
assignIDs
(int lastID) Assigns unique IDs to all nodes in the treeint
assignLeafModelNumbers
(int leafCounter) Assigns numbers to the logistic regression models at the leaves of the treevoid
buildClassifier
(Instances data) Method for building a logistic model tree (only called for the root node).void
buildTree
(Instances data, SimpleLinearRegression[][] higherRegressions, double totalInstanceWeight, double higherNumParameters, Instances numericDataHeader) Method for building the tree structure.void
Updates the alpha field for all nodes.double[]
distributionForInstance
(Instance instance) Returns the class probabilities for an instance given by the logistic model tree.Returns a string describing the number of LogitBoost iterations performed at this node, the total number of LogitBoost iterations performed (including iterations at higher levels in the tree), and the number of training examples at this node.getNodes()
Return a list of all inner nodes in the treevoid
Fills a list with all inner nodes in the treeint
Method to count the number of inner nodes in the treeint
Returns the number of leaves in the tree.Returns the revision string.graph()
Returns graph describing the tree.boolean
Returns true if the logistic regression model at this node has changed compared to the one at the parent node.double[]
modelDistributionForInstance
(Instance instance) Returns the class probabilities for an instance according to the logistic model at the node.Returns a string describing the logistic regression function at the node.int
Returns the number of leaves (normal count).int
numNodes()
Returns the number of nodes.void
prune
(double alpha) Prunes a logistic model tree using the CART pruning scheme, given a cost-complexity parameter alpha.int
Method for performing one fold in the cross-validation of the cost-complexity parameter.toString()
Returns a description of the logistic model tree (tree structure and logistic models)void
Updates the numIncorrectTree field for all nodes.Methods inherited from class weka.classifiers.trees.lmt.LogisticBase
cleanup, getMaxIterations, getNumRegressions, getUseAIC, getUsedAttributes, getWeightTrimBeta, percentAttributesUsed, setHeuristicStop, setMaxIterations, setUseAIC, setWeightTrimBeta
Methods inherited from class weka.classifiers.AbstractClassifier
batchSizeTipText, classifyInstance, debugTipText, distributionsForInstances, doNotCheckCapabilitiesTipText, forName, getBatchSize, getCapabilities, getDebug, getDoNotCheckCapabilities, getNumDecimalPlaces, getOptions, implementsMoreEfficientBatchPrediction, listOptions, makeCopies, makeCopy, numDecimalPlacesTipText, postExecution, preExecution, run, runClassifier, setBatchSize, setDebug, setDoNotCheckCapabilities, setNumDecimalPlaces, setOptions
-
Field Details
-
m_alpha
public double m_alphaAlpha-value (for pruning) at the node -
m_numIncorrectModel
public double m_numIncorrectModelWeighted number of training examples currently misclassified by the logistic model at the node -
m_numIncorrectTree
public double m_numIncorrectTreeWeighted number of training examples currently misclassified by the subtree rooted at the node
-
-
Constructor Details
-
LMTNode
public LMTNode(ModelSelection modelSelection, int numBoostingIterations, boolean fastRegression, boolean errorOnProbabilities, int minNumInstances, double weightTrimBeta, boolean useAIC, NominalToBinary ntb, int numDecimalPlaces) Constructor for logistic model tree node.- Parameters:
modelSelection
- selection method for local splitting modelnumBoostingIterations
- sets the numBoostingIterations parameterfastRegression
- sets the fastRegression parametererrorOnProbabilities
- Use error on probabilities for stopping criterion of LogitBoost?minNumInstances
- minimum number of instances at which a node is considered for splitting
-
-
Method Details
-
buildClassifier
Method for building a logistic model tree (only called for the root node). Grows an initial logistic model tree and prunes it back using the CART pruning scheme.- Specified by:
buildClassifier
in interfaceClassifier
- Overrides:
buildClassifier
in classLogisticBase
- Parameters:
data
- the data to train with- Throws:
Exception
- if something goes wrong
-
buildTree
public void buildTree(Instances data, SimpleLinearRegression[][] higherRegressions, double totalInstanceWeight, double higherNumParameters, Instances numericDataHeader) throws Exception Method for building the tree structure. Builds a logistic model, splits the node and recursively builds tree for child nodes.- Parameters:
data
- the training data passed on to this nodehigherRegressions
- An array of regression functions produced by LogitBoost at higher levels in the tree. They represent a logistic regression model that is refined locally at this node.totalInstanceWeight
- the total number of training exampleshigherNumParameters
- effective number of parameters in the logistic regression model built in parent nodes- Throws:
Exception
- if something goes wrong
-
prune
Prunes a logistic model tree using the CART pruning scheme, given a cost-complexity parameter alpha.- Parameters:
alpha
- the cost-complexity measure- Throws:
Exception
- if something goes wrong
-
prune
Method for performing one fold in the cross-validation of the cost-complexity parameter. Generates a sequence of alpha-values with error estimates for the corresponding (partially pruned) trees, given the test set of that fold.- Parameters:
alphas
- array to hold the generated alpha-valueserrors
- array to hold the corresponding error estimatestest
- test set of that fold (to obtain error estimates)- Throws:
Exception
- if something goes wrong
-
getNumInnerNodes
public int getNumInnerNodes()Method to count the number of inner nodes in the tree- Returns:
- the number of inner nodes
-
getNumLeaves
public int getNumLeaves()Returns the number of leaves in the tree. Leaves are only counted if their logistic model has changed compared to the one of the parent node.- Returns:
- the number of leaves
-
treeErrors
public void treeErrors()Updates the numIncorrectTree field for all nodes. This is needed for calculating the alpha-values. -
calculateAlphas
Updates the alpha field for all nodes.- Throws:
Exception
-
getNodes
Return a list of all inner nodes in the tree- Returns:
- the list of nodes
-
getNodes
Fills a list with all inner nodes in the tree- Parameters:
nodeList
- the list to be filled
-
hasModels
public boolean hasModels()Returns true if the logistic regression model at this node has changed compared to the one at the parent node.- Returns:
- whether it has changed
-
modelDistributionForInstance
Returns the class probabilities for an instance according to the logistic model at the node.- Parameters:
instance
- the instance- Returns:
- the array of probabilities
- Throws:
Exception
-
distributionForInstance
Returns the class probabilities for an instance given by the logistic model tree.- Specified by:
distributionForInstance
in interfaceClassifier
- Overrides:
distributionForInstance
in classLogisticBase
- Parameters:
instance
- the instance- Returns:
- the array of probabilities
- Throws:
Exception
- if distribution can't be computed successfully
-
numLeaves
public int numLeaves()Returns the number of leaves (normal count).- Returns:
- the number of leaves
-
numNodes
public int numNodes()Returns the number of nodes.- Returns:
- the number of nodes
-
toString
Returns a description of the logistic model tree (tree structure and logistic models)- Overrides:
toString
in classLogisticBase
- Returns:
- describing string
-
getModelParameters
Returns a string describing the number of LogitBoost iterations performed at this node, the total number of LogitBoost iterations performed (including iterations at higher levels in the tree), and the number of training examples at this node.- Returns:
- the describing string
-
assignIDs
public int assignIDs(int lastID) Assigns unique IDs to all nodes in the tree -
assignLeafModelNumbers
public int assignLeafModelNumbers(int leafCounter) Assigns numbers to the logistic regression models at the leaves of the tree -
modelsToString
Returns a string describing the logistic regression function at the node. -
graph
Returns graph describing the tree.- Throws:
Exception
- if something goes wrong
-
getRevision
Returns the revision string.- Specified by:
getRevision
in interfaceRevisionHandler
- Overrides:
getRevision
in classLogisticBase
- Returns:
- the revision
-