We show empirically that the PTP test has very little discriminatory power, with highly significant PTP test probabilities often being associated with parsimony data that produce trees with low confidence (as measured by bootstrapping) and resolution. Because of this, we argue that the PTP test is useful only in the following, very limited way: if a data set fails the PTP test, it should not be used in a phylogenetic analysis. More conservative methods of measuring confidence such as the bootstrap or decay index are preferable.