MODEL OF STANDARD FOR INFORMATION RETRIEVAL IN THE COLLECTION OF REGULATORY FRAMEWORK DOCUMENTS
The article proposes a model of the standards’ texts for information retrieval in the collection of documents the regulatory framework. It is proved that the standard means of information retrieval in the collection texts of standards are ineffective due to the compositional features of the texts and the wide use of generalized and abstract vocabulary. Distinctive stylistic features of standards’ texts in normative base are shown in compositional structure, logic of material representation, compactness. It is noted that the standards’ texts have the same structure of material presentation for all texts of this class, and also contain a limited set of structural elements. The description of structural elements of standards is given. It is proved that the compositional structure of the standard’s text has a significant impact on the results of information retrieval in the collection of documents the regulatory framework. The compositional structure of the standard in the Backus-Naur notations is presented. It is developed the model of the standards’ text in the form of a graph, the vertices and edges of which are full-fledged structural elements of the standard, significant both for the content of the text as a whole, and in terms of information retrieval. It is proved that the presentation of the standard’s text in the form of a graph makes it possible in the process of computer analysis the standard’s text to determine the type of structural element, the degree of nesting, by submitting the standard in the form of a finite set of its components.