The sentence-initial position in Chinese* is generally associated with givenness and definiteness (Li and Thompson 1976, 1981 and subsequent literature). However, observations have been raised against this association, e.g., with informationally new sentence-initial referents (Paul, 2015) or even indefinite ones (Bisang, 2016; B. Lu, Zhang, & Bisang, 2015). Furthermore, literature has emerged that offers contradictory findings, including Fan (1985) and subsequent studies on so-called ‘indefinite subject sentences’ (无定主语句). However, little is known of the statistical relevance of the phenomenon of sentence-initial indefinites (SIIs), as well as its interaction with features connected to linear order (e.g., animacy). Crucially, corpus-based studies on the topic remain the minority and are usually conducted on relatively small, genre-specific corpora. This paper adopts corpus methodologies and tools to investigate the phenomenon of SIIs, with a particular focus on determining (i) the statistical relevance of SIIs of the type of ‘一CLF N’ in big-size corpora, and (ii) whether there is an interaction with the semantic feature of animacy. To this end, it proposes the results of quantitative and qualitative analyses conducted on three major big-size, generalised corpora, namely the PKU CCL corpus (Peking University, 470 million characters), the BCC corpus (Beijing Language and Culture University, 15 billion characters), and the ZHTenTen (Stanford Tagger) corpus mounted at Sketch Engine (13,5 billion characters). The analysis highlights that SIIs are not only possible, but also statistically relevant. Furthermore, it shows that the semantic trait of animacy plays an important role, as animate indefinites are significantly more likely to occur sentence-initially. Finally, it singles out and discusses a new pattern featuring a proper noun introduced by ‘一CLF’.