We characterize the best possible trade-off achievable when optimizing the construction of a decision tree with respect to both the worst and the expected cost. It is known that a decision tree achieving the minimum possible worst case cost can behave very poorly in expectation (even exponentially worse than the optimal), and the vice versa is also true. Led by applications where deciding for the right optimization criterion might not be easy, recently, several authors have focussed on the bicriteria optimization of decision trees. An unanswered fundamental question is about the best possible tradeoff achievable. Here we are able to sharply define the limits for such a task. More precisely, we show that for every ρ > 0 there is a decision tree D with worst testing cost at most (1+ρ)OPTW +1 and expected testing cost at most [Formula presented] , where OPTW and OPTE denote the minimum worst testing cost and the minimum expected testing cost of a decision tree for the given instance. We also show that this is the best possible trade-off in the sense that there are infinitely many instances for which we cannot obtain a decision tree with both worst testing cost smaller than (1+ρ)OPTW and expected testing cost smaller than [Formula presented]
Trading off Worst and Expected Cost in Decision Tree Problems
Cicalese, Ferdinando
2015-01-01
Abstract
We characterize the best possible trade-off achievable when optimizing the construction of a decision tree with respect to both the worst and the expected cost. It is known that a decision tree achieving the minimum possible worst case cost can behave very poorly in expectation (even exponentially worse than the optimal), and the vice versa is also true. Led by applications where deciding for the right optimization criterion might not be easy, recently, several authors have focussed on the bicriteria optimization of decision trees. An unanswered fundamental question is about the best possible tradeoff achievable. Here we are able to sharply define the limits for such a task. More precisely, we show that for every ρ > 0 there is a decision tree D with worst testing cost at most (1+ρ)OPTW +1 and expected testing cost at most [Formula presented] , where OPTW and OPTE denote the minimum worst testing cost and the minimum expected testing cost of a decision tree for the given instance. We also show that this is the best possible trade-off in the sense that there are infinitely many instances for which we cannot obtain a decision tree with both worst testing cost smaller than (1+ρ)OPTW and expected testing cost smaller than [Formula presented]I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.