document organization
Recently Published Documents


TOTAL DOCUMENTS

33
(FIVE YEARS 3)

H-INDEX

7
(FIVE YEARS 0)

2021 ◽  
Vol 20 (1) ◽  
pp. 168
Author(s):  
Paulo Daniel Marcos dos Santos ◽  
Daiana da Conceição Alves de Magalhães ◽  
Nelma Camêlo de Araujo

O tesauro é um método de esquema de listagem que funciona como instrumento de organização documental onde as palavras apresentam relação semântica dentro de um assunto/tema específico, e essa relação é estabelecida hierarquicamente por meio de descritores que estabelecem padrão e maior especificidade do tema trabalhado, a partir do tema escolhido “Crustáceos utilizados na culinária alagoana”. Durante a produção do presente artigo, tornou-se perceptível que a literatura acerca do tema frutos do mar no litoral alagoano não se encontra disposta de modo organizado, logo, foi constatado que o tema seria conveniente para elaboração deste tesauro experimental. Com base nas instruções do sistema Tesauro foi possível analisar, recuperar e indexar a informação, tornando o resgate desse tema disponível como um registro documental padronizado contribuindo para outros pesquisadores/estudiosos que demonstrem interesse na temática, seja pelo tema crustáceos ou pela estruturação.ABSTRACTThe thesaurus is a method of listing scheme that works as an instrument of document organization where words have a semantic relationship within a specific subject/theme, and this relationship is established hierarchically through descriptors that establish a pattern and greater specificity of the theme being worked on, from the chosen theme “Crustaceans used in Alagoas cuisine”. During the production of this article, it became noticeable that the literature on the theme of seafood on the coast of Alagoas is not arranged in an organized way, so it was found that the topic would be convenient for the elaboration of this experimental thesaurus. Based on the instructions of the Thesaurus system, it was possible to analyse, retrieve and index the information, making the rescue of this theme available as a standardized documental record, contributing to other researchers/scholars who show interest in the theme, whether by the theme of crustaceans or by structuring. 


Document organization is necessary for better utilization of documents. The major problem of organization online documents is so complex because documents should be grouped into its appropriate group during its appearance on the web. Classification is one of the best solutions to organize the documents. Naive Bayes categorization is playing a vital role in document organization. It is one of the simplest probabilistic Bayesian categorization and assumption that the effect of an attribute value on a given category is independent of the values. The document classification is the essential task of organization and necessary for efficient control of textual fact systems. The files may be classified as unconfirmed, supervised and semi supervised methods. In this paper, to review and study of various types of document organization approach using naive Bayesian classification and other related existing document organization methods.


2017 ◽  
Vol 21 (6) ◽  
pp. 480-497 ◽  
Author(s):  
Anna Potocki ◽  
Christine Ros ◽  
Nicolas Vibert ◽  
Jean-François Rouet

2016 ◽  
Vol 34 (1) ◽  
pp. 64-86 ◽  
Author(s):  
Jing Chen ◽  
Tian Tian Wang ◽  
Quan Lu

Purpose – The purpose of this paper is to propose a novel within-document analysis tool (DAT) topic hierarchy and context-based document analysis tool (THC-DAT) which enables users to interactively analyze any multi-topic document based on fine-grained and hierarchical topics automatically extracted from it. THC-DAT used hierarchical latent Dirichlet allocation method and took the context information into account so that it can reveal the relationships between latent topics and related texts in a document. Design/methodology/approach – The methodology is a case study. The authors reviewed the related literature first, then utilized a general “build and test” research model. After explaining the model, interface and functions of THC-DAT, a case study was presented using a scholarly paper that was analyzed with the tool. Findings – THC-DAT can organize and serve document topics and texts hierarchically and context based, which overcomes the drawbacks of traditional DATs. The navigation, browse, search and comparison functions of THC-DAT enable users to read, search and analyze multi-topic document efficiently and effectively. Practical implications – It can improve the document organization and services in digital libraries or e-readers, by helping users to interactively read, search and analyze documents efficiently and effectively, exploringly learn about unfamiliar topics with little cognitive burden, or deepen their understanding of a document. Originality/value – This paper designs a tool THC-DAT to analyze document in a THC way. It contributes to overcoming the coarse-analysis drawbacks of existing within-DATs.


Sign in / Sign up

Export Citation Format

Share Document