Abstract. Flood forecasting and simulation in semiarid regions are always poor, and a single criterion assessment provides limited information for decision making. Here, we propose a multicriterion assessment framework combining the absolute relative error, the flow partitioning and the confidence interval estimated by the Hydrologic Uncertainty Processor (HUP) to assess the most striking feature of an event-based flood–the peak flow. The physically based model MIKE SHE and three conceptual models (two models with a single runoff generation mechanism, the Xi’anjiang model (XAJ) and the Shanbei model (SBM), and one model with the mixed runoff generation mechanism, the vertically mixed runoff model (VMM)) are compared in terms of flood modeling performance in four semiarid catchments (Qiushui River, Qingjian River, Tuwei River and Kuye River) in the middle Yellow River. Our results show that VMM has a better flood estimation performance than the other models, and under the multicriterion assessment framework, the average acceptance of flood events accounts for 58 %, but when absolute relative error 20 % is used as the performance criterion, its figure is only 41 % in four semiarid catchments.