Trace fear conditioning is an important research paradigm to model aversive learning in biological or clinical scenarios, where predictors (conditioned stimuli, CS) and aversive outcomes (unconditioned stimuli, US) are separated in time. The optimal measurement of human trace fear conditioning, and in particular of memory recall after consolidation, is currently unclear. We conducted two identical experiments with a 15-s trace interval and a recall test 1 week after acquisition, while recording several psychophysiological observables. We explored learning and memory measures in the first experiment and confirmed the most sensitive measures in the second experiment. Retrodictive validity was used as a metric to estimate measurement error. We found that in the recall test without reinforcement, only fear-potentiated startle but not skin conductance, pupil size, heart period, or respiration amplitude, differentiated CS+ and CS-. During acquisition without startle probes, only skin conductance responses and pupil size responses but none of the other measures differentiated CS+ and CS-. We establish the optimal way of quantifying these conditioned responses. As a side finding, there was no evidence for extinction of fear-potentiated startle over 30 trials without reinforcement. These results may be useful to inform future substantive research using human trace fear conditioning protocols.