scholarly journals Enhancing Automatic Acquisition of Thematic Structure in a Large-Scale Lexicon for Mandarin Chinese

Author(s):  
Mari Broman Olsen ◽  
Bonnie J. Dorr ◽  
Scott C. Thomas
2014 ◽  
Vol 687-691 ◽  
pp. 1683-1686
Author(s):  
Shuang Wang

This thesis proposes several methods for bilingual corpus form different websites, such as Automatic acquisition of bilingual corpus base on "iciba" web, CNKI and Patent network. It introduced methods, procedures of the acquisition of a variety of corpus. We proposed different methods to obtain the bilingual corpus for different characteristics of different sites, and achieved fast and accurate automatic access of a large-scale bilingual corpus. When we obtain the bilingual corpus based on "iciba" web, the main method is Nutch crawler, which is relatively good, and has an accurate retrieve and a good correlation. In addition, we give up the idea of bilingual corpus obtained from the entire Internet, but we use an entirely new access, that is to access to the basic information of scholarly thesis’s in the CNKI to obtain the large-scale high-quality English-Chinese bilingual corpus. We obtain GB level of large-scale bilingual aligned corpus in the end, which is very accurate by the manual evaluation. And the corpus makes preparation for the further cross-language information retrieval research.


2014 ◽  
Vol 42 (3) ◽  
pp. 505-537 ◽  
Author(s):  
MEILING HAO ◽  
YOUYI LIU ◽  
HUA SHU ◽  
AILING XING ◽  
YING JIANG ◽  
...  

ABSTRACTIn this paper we report a large-scale developmental study of early productive vocabulary acquisition by 928 Chinese-speaking children aged between 1;0 and 2;6, using the Early Vocabulary Inventory for Mandarin Chinese (Hao, Shu, Xing & Li, 2008). The results show that: (i) social words, especially words for people, are the predominant type of words in Chinese-speaking children's earliest productive vocabulary; (ii) overall, Chinese-speaking children's vocabulary contains greater proportions of nouns than other word categories, especially at the earliest vocabulary stage; and (iii) verbs tend to appear earlier for Chinese-speaking children as compared with English-speaking children at the same levels of vocabulary development. In addition, our study has identified the underlying variables that influence the age of acquisition of words, specifically, the interplay between the conceptual (imageability) and linguistic properties (word frequency, word length, and grammatical category) that jointly shape the development of Mandarin-speaking children's early vocabulary.


1991 ◽  
Vol 13 (3-4) ◽  
pp. 399-419 ◽  
Author(s):  
Claire Hsun-huei Chang

Author(s):  
Anna Morbiato

This article re-examines the issue of grammatical relations in Mandarin Chinese in light of the results of recent large-scale typological research on grammatical relations (henceforth GRs) worldwide. Specifically, it discusses three syntactic operations and constructions that are cross-linguistically relevant to the definition of grammatical relations, namely relativisation, reflexivisation, and quantifier float. The study adopts a strictly language-internal typological approach and avails itself of natural linguistic data or sentences sanity-checked by native speakers. The aim of this paper is twofold: first, it explores the hypothesis that, in line with various other languages, GRs in Mandarin Chinese are construction-specific. Second, it proposes an alternative approach capable of explaining the conflicting evidence often pointed out in the literature on GRs and subjecthood in Mandarin Chinese.


Sign in / Sign up

Export Citation Format

Share Document