AbstractCan human listeners use strictly implicit temporal contingencies in auditory input to form temporal predictions, and if so, how are these predictions represented endogenously? To assess this question, we implicitly manipulated foreperiods in an auditory pitch discrimination task. Unbeknownst to participants, the pitch of the standard tone could either be deterministically predictive of the onset of the target tone, or convey no predictive information. Both conditions were presented interleaved in one stream, and separated by variable inter-stimulus intervals such that there was no dominant stimulus rhythm throughout. Even though participants were unaware of the implicit temporal contingencies, pitch discrimination sensitivity (i.e. the slope of the psychometric function) increased when the onset of the target tone was predictable in time (N = 49). Concurrently recorded EEG data (N = 24) revealed that standard tones which initiated temporal predictions evoked a more negative N1 component than non-predictive standards, and were followed by an increase in delta power during the foreperiod. Furthermore, the phase angle of delta oscillations (1–3Hz) evoked by the standard tone predicted pitch discrimination sensitivity at the target tone (1.75 s later on average), which suggests that temporal predictions can be initiated by an optimized delta phase reset. In sum, we show that auditory perception benefits from implicit temporal contingencies, and provide evidence for a role of slow neural oscillations in the endogenous representation of temporal predictions, in absence of exogenously driven entrainment to rhythmic input.Significance StatementTemporal contingencies are ubiquitous in sensory environments, especially in the auditory domain, and have been shown to facilitate perception and action. Yet, how these contingencies in exogenous inputs are transformed into an endogenous representation of temporal predictions is not known. Here, we implicitly induced temporal predictability in the absence of a rhythmic input structure, that is without exogenously driven entrainment of neural oscillations. Our results show that even implicit and non-rhythmic temporal predictions are extracted and used by human observers, underlining the role of timing in sensory processing. Furthermore, our EEG results point towards an instrumental role of delta oscillations in initiating temporal predictions by an optimized phase reset in response to a temporally predictive cue.