Descriptor matching is recently used in optical flow to handle large motion displacements. While doing so is a success, motion jitter often arises from lack of temporal consistency. However, when accounting for the consistency, it may violate our expectation of handling arbitrary motion shifts. In this paper, we propose a new approach to remove this controversy. At its core is a hierarchical fusion of descriptor matching and optical flow to determine the true local motion displacements. Applications to estimating homography and 3D head pose verify that the approach can well adapt to either large or small motion displacements.