How to build a classification tree with only binary splits in each feature variable (preferably in R)?

Question

How to build a classification tree with only binary splits in each feature variable (preferably in R)?

2.4k views Asked by John Jiang At 27 December 2013 at 21:07

I have been using rpart to train a supervised decision tree model, with binary responses. The problem with the results is that some features get split multiple times in a non-monotonic way. For instance, feature A might be split into three intervals, [0,0.4],[0.4,0.6],[0.6,1], corresponding to the following responses respectively, -1,1,-1. I would prefer that each feature gets split once and in a binary way. Is there a way to do that in R?

An illustrating example:

Suppose I am interested in predicting college dropout rate from SAT score. Then the tree or rpart package in R might give me the following model:

1. SAT > 1100: no dropout
2. SAT <= 1100:
  3. SAT > 900: dropout
  4. SAT <= 900: no dropout

While this might be the best binary tree model given the training data. I want to inject my domain knowledge that the relation between SAT score and dropout probability should be monotone, and enforce that there is a single SAT threshold for determining the dropout probability.

So my question is if there is a way to enforce monotonicity in the sense above in R.

Original Q&A

There are 1 answers

**David Arenburg** · Answer 1 · 2014-03-23T18:07:01+00:00

David Arenburg On 23 March 2014 at 18:07

You can also try the party package, you can enforce single split there

library(party)
library(survival)
plot(ctree(status  ~ time1,  rats2), type = "simple")

enter image description here

plot(ctree(status  ~ time1,  rats2, controls = ctree_control(stump = T)), type = "simple")

enter image description here

TechQA.

How to build a classification tree with only binary splits in each feature variable (preferably in R)?

There are 1 answers

Related Questions in R

Related Questions in MACHINE-LEARNING

Related Questions in DECISION-TREE

Related Questions in CART-ANALYSIS

Popular Questions

Popular Tags

Trending Questions