semantic visualization
Recently Published Documents


TOTAL DOCUMENTS

27
(FIVE YEARS 5)

H-INDEX

6
(FIVE YEARS 1)

Author(s):  
Kiran Fahd ◽  
Sitalakshmi Venkatraman

AbstractScholarly communication of knowledge is predominantly document-based in digital repositories, and researchers find it tedious to automatically capture and process the semantics among related articles. Despite the present digital era of big data, there is a lack of visual representations of the knowledge present in scholarly articles, and a time-saving approach for a literature search and visual navigation is warranted. The majority of knowledge display tools cannot cope with current big data trends and pose limitations in meeting the requirements of automatic knowledge representation, storage, and dynamic visualization. To address this limitation, the main aim of this paper is to model the visualization of unstructured data and explore the feasibility of achieving visual navigation for researchers to gain insight into the knowledge hidden in scientific articles of digital repositories. Contemporary topics of research and practice, including modifiable risk factors leading to a dramatic increase in Alzheimer’s disease and other forms of dementia, warrant deeper insight into the evidence-based knowledge available in the literature. The goal is to provide researchers with a visual-based easy traversal through a digital repository of research articles. This paper takes the first step in proposing a novel integrated model using knowledge maps and next-generation graph datastores to achieve a semantic visualization with domain-specific knowledge, such as dementia risk factors. The model facilitates a deep conceptual understanding of the literature by automatically establishing visual relationships among the extracted knowledge from the big data resources of research articles. It also serves as an automated tool for a visual navigation through the knowledge repository for faster identification of dementia risk factors reported in scholarly articles. Further, it facilitates a semantic visualization and domain-specific knowledge discovery from a large digital repository and their associations. In this study, the implementation of the proposed model in the Neo4j graph data repository, along with the results achieved, is presented as a proof of concept. Using scholarly research articles on dementia risk factors as a case study, automatic knowledge extraction, storage, intelligent search, and visual navigation are illustrated. The implementation of contextual knowledge and its relationship for a visual exploration by researchers show promising results in the knowledge discovery of dementia risk factors. Overall, this study demonstrates the significance of a semantic visualization with the effective use of knowledge maps and paves the way for extending visual modeling capabilities in the future.


2021 ◽  
Vol 2 ◽  
pp. 1-7
Author(s):  
Evangelos Papadias ◽  
Margarita Kokla ◽  
Eleni Tomai

Abstract. A growing body of geospatial research has shifted the focus from fully structured to semistructured and unstructured content written in natural language. Natural language texts provide a wealth of knowledge about geospatial concepts, places, events, and activities that needs to be extracted and formalized to support semantic annotation, knowledge-based exploration, and semantic search. The paper presents a web-based prototype for the extraction of geospatial entities and concepts, and the subsequent semantic visualization and interactive exploration of the extraction results. A lightweight ontology anchored in natural language guides the interpretation of natural language texts and the extraction of relevant domain knowledge. The approach is applied on three heterogeneous sources which provide a wealth of spatial concepts and place names.


2021 ◽  
Author(s):  
Jingxuan Tu ◽  
Marc Verhagen ◽  
Brent Cochran ◽  
James Pustejovsky

2021 ◽  
pp. 762-778
Author(s):  
Delvin Ce Zhang ◽  
Hady W. Lauw

Author(s):  
Tuan M. V. Le ◽  
Hady W. Lauw

Semantic visualization integrates topic modeling and visualization, such that every document is associated with a topic distribution as well as visualization coordinates on a low-dimensional Euclidean space. We address the problem of semantic visualization for short texts. Such documents are increasingly common, including tweets, search snippets, news headlines, or status updates. Due to their short lengths, it is difficult to model semantics as the word co-occurrences in such a corpus are very sparse. Our approach is to incorporate auxiliary information, such as word embeddings from a larger corpus, to supplement the lack of co-occurrences. This requires the development of a novel semantic visualization model that seamlessly integrates visualization coordinates, topic distributions, and word vectors. We propose a model called GaussianSV, which outperforms pipelined baselines that derive topic models and visualization coordinates as disjoint steps, as well as semantic visualization baselines that do not consider word embeddings.


2016 ◽  
Vol 55 ◽  
pp. 1091-1133 ◽  
Author(s):  
Tuan M. V. Le ◽  
Hady W. Lauw

Visualization of high-dimensional data, such as text documents, is useful to map out the similarities among various data points. In the high-dimensional space, documents are commonly represented as bags of words, with dimensionality equal to the vocabulary size. Classical approaches to document visualization directly reduce this into visualizable two or three dimensions. Recent approaches consider an intermediate representation in topic space, between word space and visualization space, which preserves the semantics by topic modeling. While aiming for a good fit between the model parameters and the observed data, previous approaches have not considered the local consistency among data instances. We consider the problem of semantic visualization by jointly modeling topics and visualization on the intrinsic document manifold, modeled using a neighborhood graph. Each document has both a topic distribution and visualization coordinate. Specifically, we propose an unsupervised probabilistic model, called SEMAFORE, which aims to preserve the manifold in the lower-dimensional spaces through a neighborhood regularization framework designed for the semantic visualization task. To validate the efficacy of SEMAFORE, our comprehensive experiments on a number of real-life text datasets of news articles and Web pages show that the proposed methods outperform the state-of-the-art baselines on objective evaluation metrics.


Sign in / Sign up

Export Citation Format

Share Document