Vers une structuration de dictionnaires servant à la Tao

1999 ◽  
Vol 22 (1-2) ◽  
pp. 155-172
Author(s):  
Dieter Seelbach

We have investigated the contrastive French-German aspects of about 50 simple nominal and compound adjectival predicates, and designed their entries for an electronic dictionary for computer-assisted translation. Their co-occurrence with support verbs expressing different aspectual semantic functions is coded; in the case of the nouns, their co-occurrence with determiners is also coded, as is the syntactic and semantic specification of their arguments in terms of semantic classes called “object classes”. As an illustration of the envisaged translation methodology based on lexicon-grammar, the syntactic tagging of different formal types of simple and compound predicates, and the semantic tagging of their arguments, are demonstrated with reference to a German newspaper text and its French translation. intex will be used not only for identifying and translating compound words or multi-word expressions, but also — as an additional, but necessary tool — for finding object classes together with their appropriate predicates and support verbs, because object classes are clearly syntactically defined.

2016 ◽  
Vol 105 (1) ◽  
pp. 63-76
Author(s):  
Theresa Guinard

Abstract Morphological analysis (finding the component morphemes of a word and tagging morphemes with part-of-speech information) is a useful preprocessing step in many natural language processing applications, especially for synthetic languages. Compound words from the constructed language Esperanto are formed by straightforward agglutination, but for many words, there is more than one possible sequence of component morphemes. However, one segmentation is usually more semantically probable than the others. This paper presents a modified n-gram Markov model that finds the most probable segmentation of any Esperanto word, where the model’s states represent morpheme part-of-speech and semantic classes. The overall segmentation accuracy was over 98% for a set of presegmented dictionary words.


1999 ◽  
Vol 22 (1-2) ◽  
pp. 213-232 ◽  
Author(s):  
Xavier Blanco ◽  
Dolors Català

Within the framework of studies using the ladl system of electronic dictionaries, the group Applied Linguistics in Romance Languages of the uab has undertaken the construction of an electronic dictionary for frozen compound adverbs. This dictionary completes the delacs (Dictionary for Compound Words of Spanish). This paper briefly presents the characteristics of each type of frozen compound adverb and also the choices that underlie the development of the tables in which they are recorded. Details are also given about the state of electronic dictionary of frozen compound adverbs currently available on intex. Since this is a preliminary study, the final section is devoted to identifying possible future developments, rather than drawing conclusions.


2020 ◽  
Vol 6 (4) ◽  
pp. 75-81
Author(s):  
I. S. Kashenkova

The article highlights some features of German newspaper texts and related difficulties in translating them into Russian. The topicality of this study is explained by various linguistic and extralinguistic factors. The newspaper text should provide information in an understandable and accessible language form. However, in an effort to produce a greater effect on the reader, as well as to accommodate the maximum amount of information in a compressed form, journalists resort to special techniques to influence the reader, which, in particular, include the use of composites. The main purpose of the article is to remove some of the difficulties of translating composites in the German newspaper text, caused by implicit evaluation or background information (Trümmerfrau, Teflonkanzlerin, N-Wort), which requires additional “decryption” of the overall semantic meaning of the composite. To this end, the article analyzes the specifics of the German newspaper text, establishes the most active types of composites in German newspaper texts. The increased number of complex nouns in German newspaper texts is specified on the principle of linguistic economy, which is intrinsic to German communicative style, and German word formation. Options for translating rather difficult composites into Russian are offered. Attention is drawn to the transfer of the cultural specifics of the original when using lexico-grammatical transformations to preserve the semantic load of the original and its adequate translation. It is concluded that it is necessary to take into account the cultural specifics of composites in the German newspaper text when translating them into Russian.


2002 ◽  
Vol 25 (1) ◽  
pp. 7-28 ◽  
Author(s):  
Tita Kyriacopoulou ◽  
Safia Mrabti ◽  
Anastasia Yannacopoulou

Summary Automatic natural language processing systems require dictionaries of compound words in order to recognize these words when analyzing text automatically. This article presents the electronic dictionary of compound words in Modern Greek. This morphological dictionary contains around 28þ000 entries associated with a flexional code and filters. The flexional codes are used to describe flexional vectors, the presence of variants or accent displacements. The filters are used to describe the inflection of the words (simple or compound), meaning the cases or genders allowed in order to obtain every possible form of the word. This study is part of the work carried out jointly by the Aristotle University of Thessaloniki and Marne-la-Vallée University with a view to developing a complete and formalized description of the Greek language.


2008 ◽  
Vol 1 ◽  
Author(s):  
Aljoscha Burchardt ◽  
Sebastian Padó ◽  
Dennis Spohr ◽  
Anette Frank ◽  
Ulrich Heid

We present a general approach to formally modelling corpora with multi-layered annotation in a typed logical representation language, OWL DL. By defining abstractions over the corpus data, we can generalise from a large set of individual corpus annotations, thereby inducing a lexicon model. The resulting combined corpus and lexicon model can be interpreted as a graph structure that offers flexible querying functionality beyond current XML-based query languages. Its powerful methods for characterising and checking consistency can be used for incremental model refinement. In addition, the formalisation in a graph-based structure offers the means of defining flexible lexicon views over the corpus data. These views can be tailored for linguistic inspection or to define clean interfaces with other linguistic resources. We illustrate our approach by applying it to the syntactically and semantically annotated SALSA/TIGER corpus, a collection of German newspaper text.


Author(s):  
E. T. O'Toole ◽  
R. R. Hantgan ◽  
J. C. Lewis

Thrombocytes (TC), the avian equivalent of blood platelets, support hemostasis by aggregating at sites of injury. Studies in our lab suggested that fibrinogen (fib) is a requisite cofactor for TC aggregation but operates by an undefined mechanism. To study the interaction of fib with TC and to identify fib receptors on cells, fib was purified from pigeon plasma, conjugated to colloidal gold and used both to facilitate aggregation and as a receptor probe. Described is the application of computer assisted reconstruction and stereo whole mount microscopy to visualize the 3-D organization of fib receptors at sites of cell contact in TC aggregates and on adherent cells.Pigeon TC were obtained from citrated whole blood by differential centrifugation, washed with Ca++ free Hank's balanced salts containing 0.3% EDTA (pH 6.5) and resuspended in Ca++ free Hank's. Pigeon fib was isolated by precipitation with PEG-1000 and the purity assessed by SDS-PAGE. Fib was conjugated to 25nm colloidal gold by vortexing and the conjugates used as the ligand to identify fib receptors.


Author(s):  
A.M. Jones ◽  
A. Max Fiskin

If the tilt of a specimen can be varied either by the strategy of observing identical particles orientated randomly or by use of a eucentric goniometer stage, three dimensional reconstruction procedures are available (l). If the specimens, such as small protein aggregates, lack periodicity, direct space methods compete favorably in ease of implementation with reconstruction by the Fourier (transform) space approach (2). Regardless of method, reconstruction is possible because useful specimen thicknesses are always much less than the depth of field in an electron microscope. Thus electron images record the amount of stain in columns of the object normal to the recording plates. For single particles, practical considerations dictate that the specimen be tilted precisely about a single axis. In so doing a reconstructed image is achieved serially from two-dimensional sections which in turn are generated by a series of back-to-front lines of projection data.


Sign in / Sign up

Export Citation Format

Share Document