Graphical Keyword Service for Research Papers with Text-Mining Method

The study aimed at analyzing the keywords of the oil exploration research papers abstracts in 2012 and 2013 and using the random forests model to make the classification analysis in order to find the importance and similarities of 2012 and 2013 research trends. The contribution of the study included the following two points. First, the study used the text mining method in order to explore the content of oil exploration research paper abstracts. Second, the study applied the AdaBoost classification analysis to explore the relationship of the keywords between the two years’ keywords.

Download Full-text

Similarity Detection Using Latent Semantic Analysis Algorithm

International Journal of Emerging Research in Management and Technology ◽

10.23956/ijermt.v6i8.124 ◽

2018 ◽

Vol 6 (8) ◽

pp. 102

Author(s):

Priyanka R. Patil ◽

Shital A. Patil

Keyword(s):

Latent Semantic Analysis ◽

Latent Dirichlet Allocation ◽

Semantic Analysis ◽

Mining Method ◽

Research Papers ◽

Information Measures ◽

Automated Software ◽

Day By Day ◽

Ways Of Life ◽

Dirichlet Allocation

Similarity View is an application for visually comparing and exploring multiple models of text and collection of document. Friendbook finds ways of life of clients from client driven sensor information, measures the closeness of ways of life amongst clients, and prescribes companions to clients if their ways of life have high likeness. Roused by demonstrate a clients day by day life as life records, from their ways of life are separated by utilizing the Latent Dirichlet Allocation Algorithm. Manual techniques can't be utilized for checking research papers, as the doled out commentator may have lacking learning in the exploration disciplines. For different subjective views, causing possible misinterpretations. An urgent need for an effective and feasible approach to check the submitted research papers with support of automated software. A method like text mining method come to solve the problem of automatically checking the research papers semantically. The proposed method to finding the proper similarity of text from the collection of documents by using Latent Dirichlet Allocation (LDA) algorithm and Latent Semantic Analysis (LSA) with synonym algorithm which is used to find synonyms of text index wise by using the English wordnet dictionary, another algorithm is LSA without synonym used to find the similarity of text based on index. LSA with synonym rate of accuracy is greater when the synonym are consider for matching.

Download Full-text

Investigating Diseases and Chemicals in COVID-19 Literature with Text Mining (Preprint)

10.2196/preprints.21503 ◽

2020 ◽

Author(s):

Amir Karami ◽

Brandon Bookstaver ◽

Melissa Nolan

Keyword(s):

Text Mining ◽

Literature Review ◽

Topic Modeling ◽

Large Scale ◽

Clinical Manifestations ◽

International Health ◽

Research Papers ◽

Strategic Plans ◽

Funding Agencies ◽

The Relationship

BACKGROUND The COVID-19 pandemic has impacted nearly all aspects of life and has posed significant threats to international health and the economy. Given the rapidly unfolding nature of the current pandemic, there is an urgent need to streamline literature synthesis of the growing scientific research to elucidate targeted solutions. While traditional systematic literature review studies provide valuable insights, these studies have restrictions, including analyzing a limited number of papers, having various biases, being time-consuming and labor-intensive, focusing on a few topics, incapable of trend analysis, and lack of data-driven tools. OBJECTIVE This study fills the mentioned restrictions in the literature and practice by analyzing two biomedical concepts, clinical manifestations of disease and therapeutic chemical compounds, with text mining methods in a corpus containing COVID-19 research papers and find associations between the two biomedical concepts. METHODS This research has collected papers representing COVID-19 pre-prints and peer-reviewed research published in 2020. We used frequency analysis to find highly frequent manifestations and therapeutic chemicals, representing the importance of the two biomedical concepts. This study also applied topic modeling to find the relationship between the two biomedical concepts. RESULTS We analyzed 9,298 research papers published through May 5, 2020 and found 3,645 disease-related and 2,434 chemical-related articles. The most frequent clinical manifestations of disease terminology included COVID-19, SARS, cancer, pneumonia, fever, and cough. The most frequent chemical-related terminology included Lopinavir, Ritonavir, Oxygen, Chloroquine, Remdesivir, and water. Topic modeling provided 25 categories showing relationships between our two overarching categories. These categories represent statistically significant associations between multiple aspects of each category, some connections of which were novel and not previously identified by the scientific community. CONCLUSIONS Appreciation of this context is vital due to the lack of a systematic large-scale literature review survey and the importance of fast literature review during the current COVID-19 pandemic for developing treatments. This study is beneficial to researchers for obtaining a macro-level picture of literature, to educators for knowing the scope of literature, to journals for exploring most discussed disease symptoms and pharmaceutical targets, and to policymakers and funding agencies for creating scientific strategic plans regarding COVID-19.

Download Full-text

TREASURE: Text Mining Algorithm Based On Affinity Analysis and Set Intersection to Find the Action of Tuberculosis Drugs against Other Pathogens

Applied Sciences ◽

10.3390/app11156834 ◽

2021 ◽

Vol 11 (15) ◽

pp. 6834

Author(s):

Pradeepa Sampath ◽

Nithya Shree Sridhar ◽

Vimal Shanmuganathan ◽

Yangsun Lee

Keyword(s):

Text Mining ◽

Causes Of Death ◽

Intersection Property ◽

Research Papers ◽

Bacterial Diseases ◽

Mining Algorithm ◽

Set Intersection ◽

The World ◽

Medical Researchers

Tuberculosis (TB) is one of the top causes of death in the world. Though TB is known as the world’s most infectious killer, it can be treated with a combination of TB drugs. Some of these drugs can be active against other infective agents, in addition to TB. We propose a framework called TREASURE (Text mining algoRithm basEd on Affinity analysis and Set intersection to find the action of tUberculosis dRugs against other pathogEns), which particularly focuses on the extraction of various drug–pathogen relationships in eight different TB drugs, namely pyrazinamide, moxifloxacin, ethambutol, isoniazid, rifampicin, linezolid, streptomycin and amikacin. More than 1500 research papers from PubMed are collected for each drug. The data collected for this purpose are first preprocessed, and various relation records are generated for each drug using affinity analysis. These records are then filtered based on the maximum co-occurrence value and set intersection property to obtain the required inferences. The inferences produced by this framework can help the medical researchers in finding cures for other bacterial diseases. Additionally, the analysis presented in this model can be utilized by the medical experts in their disease and drug experiments.

Download Full-text

Complaint management model of manufacturing products using text mining and potential failure identification

The TQM Journal ◽

10.1108/tqm-05-2021-0145 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

Ririn Diar Astanti ◽

Ivana Carissa Sutanto ◽

The Jin Ai

Keyword(s):

Quality Management ◽

Text Mining ◽

Failure Mode ◽

Management System ◽

Main Part ◽

Mining Method ◽

Content Type ◽

Complaint Management ◽

Potential Failure ◽

Failure Identification

PurposeThis paper aims to propose a framework on complaint management system for quality management by applying the text mining method and potential failure identification that can support organization learning (OL). Customer complaints in the form of email text is the input of the framework, while the most frequent complaints are visualized using a Pareto diagram. The company can learn from this Pareto diagram and take action to improve their process.Design/methodology/approachThe first main part of the framework is creating a defect database from potential failure identification, which is the initial part of the failure mode and effect analysis technique. The second main part is the text mining of customer email complaints. The last part of the framework is matching the result of text mining with the defect database and presenting in the form of a Pareto diagram. After the framework is proposed, a case study is conducted to illustrate the applicability of the proposed method.FindingsBy using the defect database, the framework can interpret the customer email complaints into the list of most defect complained by customer using a Pareto diagram. The results of the Pareto diagram, based on the results of text mining of consumer complaints via email, can be used by a company to learn from complaint and to analyze the potential failure mode. This analysis helps company to take anticipatory action for avoiding potential failure mode happening in the future.Originality/valueThe framework on complaint management system for quality management by applying the text mining method and potential failure identification is proposed for the first time in this paper.

Download Full-text

A Case Study of a Text Mining Method for Discovering Evolutionary Patterns of Mobile Phone in Korea

Journal of the Korea Society of Computer and Information ◽

10.9708/jksci.2015.20.2.029 ◽

2015 ◽

Vol 20 (2) ◽

pp. 29-45

Author(s):

Byung-Won On

Keyword(s):

Text Mining ◽

Mobile Phone ◽

Mining Method ◽

Evolutionary Patterns

Download Full-text

SUPPORTING AIR TRANSPORT POLICIES USING BIG DATA ANALYTICS: A DESCRIPTIVE APPROACH BASED EMERGING TREND ANALYSIS

Journal of Air Transport Studies ◽

10.38008/jats.v8i1.40 ◽

2017 ◽

Vol 8 (1) ◽

pp. 51-72

Author(s):

Jin-seo Park

Keyword(s):

Qualitative Research ◽

Big Data ◽

Text Mining ◽

Research Methods ◽

Qualitative Research Methods ◽

Research Papers ◽

Emerging Trends ◽

The Future ◽

Descriptive Approach ◽

Core Issues

Qualitative research methods based on literature review or expert judgement have been used to find core issues, analyze emerging trends and discover promising areas for the future. Deriving results from large amounts of information under this approach is both costly and time consuming. Besides, there is a risk that the results may be influenced by the subjective opinion of experts. In order to make up for such weaknesses, the analysis paradigm for choosing future emerging trend is undergoing a shift toward mplementing qualitative research methods along with quantitative research methods like text mining in a mutually complementary manner. The hange used to implement recent studies is being witnessed in various areas such as the steel industry, the information and communications technology industry, the construction industry in architectural engineering and so on. This study focused on retrieving aviation-related core issues and the promising areas for the future from research papers pertaining to overall aviation areas through text mining method, which is one of the big data analysis techniques. This study has limitations in that its analysis for retrieving the aviation-related core issues and promising fields was restricted to research papers containing the keyword "aviation." However, it has significance in that it prepared a quantitative analysis model for continuously monitoring the derived core issues and emerging trends regarding the promising areas for the future in the aviation industry through the application of a big data-based descriptive approach.

Download Full-text