Exploring Earth Science Applications using Word Embeddings

Mapping Intimacies ◽

10.5194/egusphere-egu2020-9966 ◽

2020 ◽

Author(s):

Derek Koehl ◽

Carson Davis ◽

Rahul Ramachandran ◽

Udaysankar Nair ◽

Manil Maskey

Keyword(s):

Earth Science ◽

Controlled Vocabulary ◽

Word Embeddings ◽

Faceted Search ◽

Semantic Relationships ◽

Domain Specific ◽

The Earth ◽

Increase In Accuracy ◽

Fully Connected ◽

Analogy Prediction

Word embedding are numeric representations of text which capture meanings and semantic relationships in text. Embeddings can be constructed using different methods such as One Hot encoding, Frequency-based or Prediction-based approaches. Prediction-based approaches such as&#160; Word2Vec, can be used to generate word embeddings that can capture the underlying semantics and word relationships in a corpus. Word2Vec embeddings generated from domain specific corpus have been shown in studies to both predict relationships and augment word vectors to improve classifications. We describe results from two different experiments utilizing word embeddings for Earth science constructed from a corpus of over 20,000 journal papers using Word2Vec.&#160;The first experiment explores the analogy prediction performance of word embeddings built from the Earth science journal corpus and trained using domain-specific vocabulary. Our results demonstrate that the accuracy of domain-specific word embeddings in predicting Earth science analogy questions outperforms the ability of general corpus embedding to predict general analogy questions. While the results are as anticipated,&#160; the substantial increase in accuracy, particularly in the lexicographical domain was encouraging. The results point to the need for developing a comprehensive Earth science analogy test set that covers the full breadth of lexicographical and encyclopedic categories for validating word embeddings.The second experiment utilizes the word embeddings to augment metadata keyword classifications. Metadata describing NASA datasets have science keywords that are manually assigned which can lead to errors and inconsistencies. These science keywords are controlled vocabulary and are used to aid data discovery via faceted search and relevancy ranking. Given the small size of the number of metadata records with proper description and keywords, word embeddings were used for augmentation. A fully connected neural network was trained to suggest keywords given a description text. This approach provided the best accuracy at ~76% as compared to other methods tested.

Download Full-text

ES2Vec: Earth Science Metadata Keyword Assignment using Domain-Specific Word Embeddings

2020 SoutheastCon ◽

10.1109/southeastcon44009.2020.9249743 ◽

2020 ◽

Author(s):

Muthukumaran Ramasubramanian ◽

Hassan Muhammad ◽

Iksha Gurung ◽

Manil Maskey ◽

Rahul Ramachandran

Keyword(s):

Earth Science ◽

Word Embeddings ◽

Domain Specific

Download Full-text

SYSTEMATIC COMPOSITION OF MONOGRAPHICAL COLLECTIONS OF THE EARTH SCIENCE MUSEUM AT MOSCOW STATE UNIVERSITY

THE LIFE OF THE EARTH ◽

10.29003/m832.0514-7468.2018_41_4/464-471 ◽

2019 ◽

Vol 41 (4) ◽

pp. 464-471

Author(s):

Natalia Krupina ◽

Alla Prisyazhnaya

Keyword(s):

Earth Science ◽

Science Museum ◽

State University ◽

The Earth ◽

Systematic Composition ◽

Moscow State University

Download Full-text

Earthquake response strategies for Utah Geological and Mineral Survey and the earth-science community

10.34191/ofr-115 ◽

1988 ◽

Keyword(s):

Earth Science ◽

Earthquake Response ◽

Response Strategies ◽

The Earth ◽

Science Community

Download Full-text

THE EARTH SCIENCE LITERACY INITIATIVE (ESLI): A HISTORICAL PERSPECTIVE ON THE COMMUNITY-DRIVEN EFFORT TO IDENTIFY THE BIG IDEAS IN THE EARTH SCIENCES

10.1130/abs/2018am-320055 ◽

2018 ◽

Author(s):

Nicole LaDue ◽

Keyword(s):

Earth Sciences ◽

Historical Perspective ◽

Science Literacy ◽

Earth Science ◽

The Earth ◽

Big Ideas ◽

Literacy Initiative ◽

Earth Science Literacy

Download Full-text

Development and Evaluation of Novel Ophthalmology Domain-Specific Neural Word Embeddings to Predict Visual Prognosis

International Journal of Medical Informatics ◽

10.1016/j.ijmedinf.2021.104464 ◽

2021 ◽

pp. 104464

Author(s):

Sophia Wang ◽

Benjamin Tseng ◽

Tina Hernandez-Boussard

Keyword(s):

Word Embeddings ◽

Visual Prognosis ◽

Domain Specific

Download Full-text

THE EARTH SCIENCE CURRICULUM PROJECT

Journal of Geological Education ◽

10.5408/0022-1368-12.2.64 ◽

1964 ◽

Vol 12 (2) ◽

pp. 64-68

Author(s):

ROBERT L. HELLER

Keyword(s):

Science Curriculum ◽

Earth Science ◽

The Earth ◽

Curriculum Project

Download Full-text

In-Situ Cosmogenic 14C: Production and Examples of its Unique Applications in Studies of Terrestrial and Extraterrestrial Processes

Radiocarbon ◽

10.1017/s0033822200041394 ◽

2001 ◽

Vol 43 (2B) ◽

pp. 731-742 ◽

Cited By ~ 8

Author(s):

D Lal ◽

A J T Jull

Keyword(s):

Half Life ◽

Earth Science ◽

Chemical Properties ◽

Radioactive Isotopes ◽

Earth’S Atmosphere ◽

The Earth ◽

Wide Range ◽

History Of ◽

Earth's Atmosphere

Nuclear interactions of cosmic rays produce a number of stable and radioactive isotopes on the earth (Lai and Peters 1967). Two of these, 14C and 10Be, find applications as tracers in a wide variety of earth science problems by virtue of their special combination of attributes: 1) their source functions, 2) their half-lives, and 3) their chemical properties. The radioisotope, 14C (half-life = 5730 yr) produced in the earth's atmosphere was the first to be discovered (Anderson et al. 1947; Libby 1952). The next longer-lived isotope, also produced in the earth's atmosphere, 10Be (half-life = 1.5 myr) was discovered independently by two groups within a decade (Arnold 1956; Goel et al. 1957; Lal 1991a). Both the isotopes are produced efficiently in the earth's atmosphere, and also in solids on the earth's surface. Independently and jointly they serve as useful tracers for characterizing the evolutionary history of a wide range of materials and artifacts. Here, we specifically focus on the production of 14C in terrestrial solids, designated as in-situ-produced 14C (to differentiate it from atmospheric 14C, initially produced in the atmosphere). We also illustrate the application to several earth science problems. This is a relatively new area of investigations, using 14C as a tracer, which was made possible by the development of accelerator mass spectrometry (AMS). The availability of the in-situ 14C variety has enormously enhanced the overall scope of 14C as a tracer (singly or together with in-situ-produced 10Be), which eminently qualifies it as a unique tracer for studying earth sciences.

Download Full-text

Ensuring the Quality of Research Objects in the Earth Science Domain

2017 IEEE 13th International Conference on e-Science (e-Science) ◽

10.1109/escience.2017.62 ◽

2017 ◽

Author(s):

Andres Garcia-Silva ◽

Raul Palma ◽

Jose Manuel Gomez-Perez

Keyword(s):

Earth Science ◽

The Earth ◽

Science Domain ◽

Research Objects ◽

Quality Of Research

Download Full-text

Visual Exploration of Semantic Relationships in Neural Word Embeddings

IEEE Transactions on Visualization and Computer Graphics ◽

10.1109/tvcg.2017.2745141 ◽

2018 ◽

Vol 24 (1) ◽

pp. 553-562 ◽

Cited By ~ 26

Author(s):

Shusen Liu ◽

Peer-Timo Bremer ◽

Jayaraman J. Thiagarajan ◽

Vivek Srikumar ◽

Bei Wang ◽

...

Keyword(s):

Visual Exploration ◽

Word Embeddings ◽

Semantic Relationships

Download Full-text

LIVE PLANTS AS AN ADDITIONAL BOTANICAL COMPONENTOF THE THEMATIC EXPOSITION IN THE EARTH SCIENCE MUSEUM AT MOSCOW STATE UNIVERSITY

THE LIFE OF THE EARTH ◽

10.29003/m1777.0514-7468.2020_42_4/478-484 ◽

2020 ◽

Vol 42 (4) ◽

pp. 478-484

Author(s):

Kirill Golikov ◽

Ekaterina LAPTEVA ◽

A. SOCHIVKO

Keyword(s):

High Altitude ◽

Plant Communities ◽

Earth Science ◽

Life Forms ◽

Science Museum ◽

State University ◽

Structural Components ◽

The Earth ◽

The World ◽

Moscow State University

The article discusses the use of live plants as the botanical exposition component supplement of the “Natural areas” (hall № 17 “Natural zonality and its components” and № 20 “Desert, subtropical, tropical countries, high-altitude zone”) and “Physico-georaphic regions” (hall № 24 “Continents and parts of the world”) departments in order to visualize information presented in the Earth Science Museum. Demonstration of plants originating from different regions of the world representing different life forms and being structural components of various plant communities allows to visually characterizing thematic aspects of an exposition. That in turn reveal such principles of systematic nature organization as ecobiomorphic and phytocenotic.

Download Full-text