scholarly journals Network attack characteristics of automatic data extraction technology

Author(s):  
Tianyin Pu ◽  
Zhengchan Rao ◽  
Zheng QIN
2013 ◽  
Vol 765-767 ◽  
pp. 1245-1248
Author(s):  
Pu Tian Yin ◽  
Rao Zheng Chan ◽  
Qin Zheng

attack automatic feature extraction technology is an important research of network security technology. From the network present situation research proceed with, to attack the automatic feature extraction technology for definition and classifycation, and for each category of technology are introduced in detail, and presents several attacks of automatic feature extraction technology, finally to present these technical deficiencies and the possible development trend are discussed.


2010 ◽  
Vol 20-23 ◽  
pp. 178-183
Author(s):  
Jun Hua Gu ◽  
Jie Song ◽  
Na Zhang ◽  
Yan Liu Liu

With the increasingly high-speed of the internet as well as the increase in the amount of data it contains, users are finding it more and more difficult to gain useful information from the web. How to extract accurate information from the Web efficiently has become an urgent problem. Web information extraction technology has emerged to solve this kind of problem. The method of Web information auto-extraction based on XML is designed through standardizing the HTML document using data translation algorism, forming an extracting rule base by learning the XPath expression of samples, and using extraction rule base to realize auto-extraction of pages of same kind. The results show that this approach should lead to a higher recall ratio and precision ratio, and the result should have a self-description, making it convenient for founding data extraction system of each domain.


2014 ◽  
Vol 12 (1) ◽  
pp. 15-20 ◽  
Author(s):  
N. Borisova

Abstract An approach for Ontology based Information Extraction (OBIE) from unstructured text in the Bulgarian language is presented in this paper. The presented method and algorithm provide a solution for automatic data extraction from text documents exploiting ontologies. To this end, in addition to the standard tools for processing language resources in an open source free software, a dictionary-based lemmatizer for Bulgarian has been developed and integrated. It is distributed as free software, publicly available to download and use under the GPL v3 license. Due to the specifics of inflection in Bulgarian the developed tools for lemmatization will contribute to improving the results of the POS tagger. This approach will offer opportunities for developing a dynamically created gazetteer that is, in combination with a few other generic GATE resources, capable of producing ontologybased annotations over the given content with regards to the given ontology. This algorithm can also be used in the processes of content creation and management of information and knowledge.


Sign in / Sign up

Export Citation Format

Share Document