HEURISTIC CLASSIFICATION OF OFFICE DOCUMENTS

Document Processing Systems (DPSs) support office workers to manage information. Document classification is a major function of DPSs. By analyzing a document’s layout and conceptual structures, we present in this paper a sample-based approach to document classification. We represent a document’s layout structure by an ordered labeled tree through a procedure known as nested segmentation and represent the document’s conceptual structure by a set of attribute type pairs. The layout similarities between the document to be classified and sample documents are determined by a previously developed approximate tree matching toolkit. The conceptual similarities between the documents are determined by analyzing their contents and by calculating the degree of conceptual closeness. The document type is identified by computing both the layout and conceptual similarities between the document to be classified and the samples in the document sample base. Some experimental results are presented, which demonstrate the effectiveness of the proposed techniques.

Download Full-text

Automated Classification of Industry and Occupation Codes Using Document Classification Method

Neural Information Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-540-30499-9_127 ◽

2004 ◽

pp. 827-833

Author(s):

Heui Seok Lim ◽

Hyeoncheol Kim

Keyword(s):

Document Classification ◽

Classification Method ◽

Automated Classification

Download Full-text

A heuristic classification of woody plants based on contrasting shade and drought strategies

Tree Physiology ◽

10.1093/treephys/tpy146 ◽

2019 ◽

Vol 39 (5) ◽

pp. 767-781 ◽

Cited By ~ 2

Author(s):

Liang Wei ◽

Chonggang Xu ◽

Steven Jansen ◽

Hang Zhou ◽

Bradley O Christoffersen ◽

...

Keyword(s):

Woody Plants ◽

Heuristic Classification

Download Full-text

Stone tools and conceptual structure

Behavioral and Brain Sciences ◽

10.1017/s0140525x00038127 ◽

1995 ◽

Vol 18 (1) ◽

pp. 202-203

Author(s):

James Steele

Keyword(s):

Cognitive Ability ◽

Stone Tools ◽

Conceptual Structure ◽

Stone Tool ◽

Alternative Account ◽

Premotor Area ◽

Conceptual Structures

AbstractUnderstanding how conceptual structures inform stone tool production and use would help us resolve the issue of a pongid-hominid dichotomy in brain organisation and cognitive ability. Evidence from ideational apraxia suggests that the planning of linguistic and manipulative behaviours is not colocalized in homologous circuits. An alternative account in terms of the evolutionary expansion of the whole prefrontal-premotor area may be more plausible.

Download Full-text

Relational public administration: a synthesis and heuristic classification of relational approaches

Public Management Review ◽

10.1080/14719037.2019.1632921 ◽

2019 ◽

Vol 22 (9) ◽

pp. 1324-1346 ◽

Cited By ~ 6

Author(s):

Koen Bartels ◽

Nick Turnbull

Keyword(s):

Public Administration ◽

Heuristic Classification

Download Full-text

Heuristic classification of physical theories based on quantum correlations

The European Physical Journal Plus ◽

10.1140/epjp/i2013-13065-5 ◽

2013 ◽

Vol 128 (6) ◽

Cited By ~ 2

Author(s):

M. Ferrero ◽

J. L. Sánchez-Gómez

Keyword(s):

Quantum Correlations ◽

Heuristic Classification

Download Full-text

A Study into Math Document Classification using Deep Learning

10.5121/csit.2020.101702 ◽

2020 ◽

Author(s):

Fatimah Alshamari ◽

Abdou Youssef

Keyword(s):

Deep Learning ◽

Classification Performance ◽

Document Classification ◽

Design Parameters ◽

Technological Advancement ◽

Scientific Publications ◽

Specific Domain ◽

Decision Choices ◽

And Mathematics

Document classification is a fundamental task for many applications, including document annotation, document understanding, and knowledge discovery. This is especially true in STEM fields where the growth rate of scientific publications is exponential, and where the need for document processing and understanding is essential to technological advancement. Classifying a new publication into a specific domain based on the content of the document is an expensive process in terms of cost and time. Therefore, there is a high demand for a reliable document classification system. In this paper, we focus on classification of mathematics documents, which consist of English text and mathematics formulas and symbols. The paper addresses two key questions. The first question is whether math-document classification performance is impacted by math expressions and symbols, either alone or in conjunction with the text contents of documents. Our investigations show that Text-Only embedding produces better classification results. The second question we address is the optimization of a deep learning (DL) model, the LSTM combined with one dimension CNN, for math document classification. We examine the model with several input representations, key design parameters and decision choices, and choices of the best input representation for math documents classification.

Download Full-text

Structural systematization - the essential stage in building consistent classification of libraries

Scientific and Technical Libraries ◽

10.33186/1027-3689-2018-8-20-35 ◽

2018 ◽

pp. 20-35

Author(s):

Elena Poltavskaya

Keyword(s):

Conceptual Framework ◽

Public Libraries ◽

Intermediate Stage ◽

Social Institution ◽

Conceptual Structure ◽

Natural Classification ◽

Social Mission ◽

Conceptual Schemes ◽

The Ideal

The need for structural systematization to reveal and compare the conceptual framework for library forms separated into the theoretical type reflected in the ideal construct of “the Stolyarov’s library” is substantiated. The library form structure is determined in a vicarious manner through conceptual schemes. The concepts that correspond to appropriate library forms are represented as logical systems (as if the library is being established in reality) and through the schemes. The groups of the library type four elements reflect the conceptual schemes: libraries as a social institution (corresponds to public libraries) and personal libraries (individually and family used libraries). Using conceptual schemes for systematization enables to divide all the libraries, according to their structure, into two groups that differ significantly in their social mission (serving communities, or the society; and serving individuals, or individual families). Differentiating existent libraries by their conceptual structure would further enable to design a general and consistent hierarchical library classification. Structural systematization is the essential intermediate stage when developing natural classification.

Download Full-text

FrameNet's Frames vs. Levin's Verb Classes

Proceedings of the Annual Meeting of the Berkeley Linguistics Society ◽

10.3765/bls.v28i1.3816 ◽

2002 ◽

Vol 28 (1) ◽

pp. 27 ◽

Cited By ~ 9

Author(s):

Collin F. Baker ◽

Josef Ruppenhofer

Keyword(s):

Semantic Similarity ◽

Preliminary Investigation ◽

Verb Classes ◽

Conceptual Structures ◽

Semantic Frames ◽

English Verb

The classification of verbs in Levin's (1993) English Verb Classes and Alternations: A preliminary Investigation, on the basis of both intuitive semantic grouping and their participation in valence alternations, is often used by the NLP community as evidence of the semantic similarity of verbs (Jing & McKeown 1998; Lapata & Brew 1999; Kohl et al. 1998). In this paper, we compare the Levin classification with the work of the FrameNet project (Fillmore & Baker 2001), where words (not just verbs) are grouped according to the conceptual structures (frames) that underlie them and their combinatorial patterns are inductively derived from corpus evidence. This means that verbs grouped together in FrameNet (FN) might be semantically similar but have different (or no) alternations, and that verbs which share the same alternation might be represented in two different semantic frames.

Download Full-text

Document Classification of Filipino Online Scam Incident Text using Data Mining Techniques

2019 19th International Symposium on Communications and Information Technologies (ISCIT) ◽

10.1109/iscit.2019.8905242 ◽

2019 ◽

Author(s):

Eddie Bouy B. Palad ◽

Marivic S. Tangkeko ◽

Lissa Andrea K. Magpantay ◽

Glenn L. Sipin

Keyword(s):

Data Mining ◽

Document Classification ◽

Data Mining Techniques ◽

Using Data

Download Full-text

Integrating constructional semantics and conceptual metaphor

Constructions and Frames ◽

10.1075/cf.8.2.02sul ◽

2016 ◽

Vol 8 (2) ◽

pp. 141-165 ◽

Cited By ~ 3

Author(s):

Karen Sullivan

Keyword(s):

Cognitive Linguistics ◽

Conceptual Metaphor ◽

Conceptual Structure ◽

Unified Model ◽

Cognitive Grammar ◽

Frame Semantics ◽

Conceptual Metaphor Theory ◽

Conceptual Structures ◽

Metaphoric Language ◽

Metaphor Theory

Conceptual Metaphor Theory (CMT) aims to represent the conceptual structure of metaphors rather than the structure of metaphoric language. The theory does not explain which aspects of metaphoric language evoke which conceptual structures, for example. However, other theories within cognitive linguistics may be better suited to this task. These theories, once integrated, should make building a unified model of both the conceptual and linguistic aspects of metaphor possible. First, constructional approaches to syntax provide an explanation of how particular constructional slots are associated with different functions in evoking metaphor. Cognitive Grammar is especially effective in this regard. Second, Frame Semantics helps explain how the words or phrases that fill the relevant constructional slots evoke the source and target domains of metaphor. Though these theories do not yet integrate seamlessly, their combination already offers explanatory benefits, such as allowing generalizations across metaphoric and non-metaphoric language, and identifying the words that play a role in evoking metaphors, for example.

Download Full-text