Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection

The main advantage of modern natural language processing methods is a possibility to turn an amorphous human-readable task into a strict mathematic form. That allows to extract chemical data and insights from articles and to find new semantic relations. We propose a universal engine for processing chemical and biological texts. We successfully tested it on various use-cases and applied to a case of searching a therapeutic agent for a COVID-19 disease by analyzing PubMed archive.

Download Full-text

EventEpi–A Natural Language Processing Framework for Event-Based Surveillance

10.1101/19006395 ◽

2019 ◽

Author(s):

Auss Abbood ◽

Alexander Ullrich ◽

Rüdiger Busche ◽

Stéphane Ghozzi

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Web Application ◽

Fine Tuning ◽

Entity Recognition ◽

World Health ◽

Support Vector ◽

Event Based ◽

Processing Framework

AbstractAccording to the World Health Organization (WHO), around 60% of all outbreaks are detected using informal sources. In many public health institutes, including the WHO and the Robert Koch Institute (RKI), dedicated groups of epidemiologists sift through numerous articles and newsletters to detect relevant events. This media screening is one important part of event-based surveillance (EBS). Reading the articles, discussing their relevance, and putting key information into a database is a time-consuming process. To support EBS, but also to gain insights into what makes an article and the event it describes relevant, we developed a natural-language-processing framework for automated information extraction and relevance scoring. First, we scraped relevant sources for EBS as done at RKI (WHO Disease Outbreak News and ProMED) and automatically extracted the articles’ key data: disease, country, date, and confirmed-case count. For this, we performed named entity recognition in two steps: EpiTator, an open-source epidemiological annotation tool, suggested many different possibilities for each. We trained a naive Bayes classifier to find the single most likely one using RKI’s EBS database as labels. Then, for relevance scoring, we defined two classes to which any article might belong: The article is relevant if it is in the EBS database and irrelevant otherwise. We compared the performance of different classifiers, using document and word embeddings. Two of the tested algorithms stood out: The multilayer perceptron performed best overall, with a precision of 0.19, recall of 0.50, specificity of 0.89, F1 of 0.28, and the highest tested index balanced accuracy of 0.46. The support-vector machine, on the other hand, had the highest recall (0.88) which can be of higher interest for epidemiologists. Finally, we integrated these functionalities into a web application called EventEpi where relevant sources are automatically analyzed and put into a database. The user can also provide any URL or text, that will be analyzed in the same way and added to the database. Each of these steps could be improved, in particular with larger labeled datasets and fine-tuning of the learning algorithms. The overall framework, however, works already well and can be used in production, promising improvements in EBS. The source code is publicly available at https://github.com/aauss/EventEpi.

Download Full-text

Application of natural language processing algorithms for extracting information from news articles in event-based surveillance

Canada Communicable Disease Report ◽

10.14745/ccdr.v46i06a06 ◽

2020 ◽

pp. 186-191

Author(s):

Victoria Ng ◽

Erin E Rees ◽

Jingcheng Niu ◽

Abdelhamid Zaghlool

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing Algorithms ◽

Event Based

Download Full-text

Prevalence of Financial Considerations Documented in Primary Care Encounters as Identified by Natural Language Processing Methods

JAMA Network Open ◽

10.1001/jamanetworkopen.2019.10399 ◽

2019 ◽

Vol 2 (8) ◽

pp. e1910399

Author(s):

Meliha Skaljic ◽

Ihsaan H. Patel ◽

Amelia M. Pellegrini ◽

Victor M. Castro ◽

Roy H. Perlis ◽

...

Keyword(s):

Primary Care ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing Methods

Download Full-text

Identification Technology of Grid Monitoring Alarm Event Based on Natural Language Processing and Deep Learning in China

Energies ◽

10.3390/en12173258 ◽

2019 ◽

Vol 12 (17) ◽

pp. 3258 ◽

Cited By ~ 2

Author(s):

Bai ◽

Sun ◽

Zang ◽

Zhang ◽

Shen ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Short Term Memory ◽

Event Identification ◽

Short Period ◽

Grid Monitoring ◽

Low Efficiency ◽

Event Based ◽

Power Dispatching

Power dispatching systems currently receive massive, complicated, and irregular monitoring alarms during their operation, which prevents the controllers from making accurate judgments on the alarm events that occur within a short period of time. In view of the current situation with the low efficiency of monitoring alarm information, this paper proposes a method based on natural language processing (NLP) and a hybrid model that combines long short-term memory (LSTM) and convolutional neural network (CNN) for the identification of grid monitoring alarm events. Firstly, the characteristics of the alarm information text were analyzed and induced and then preprocessed. Then, the monitoring alarm information was vectorized based on the Word2vec model. Finally, a monitoring alarm event identification model based on a combination of LSTM and CNN was established for the characteristics of the alarm information. The feasibility and effectiveness of the method in this paper were verified by comparison with multiple identification models.

Download Full-text

A Comparison of Natural Language Processing Methods for Automated Coding of Motivational Interviewing

Journal of Substance Abuse Treatment ◽

10.1016/j.jsat.2016.01.006 ◽

2016 ◽

Vol 65 ◽

pp. 43-50 ◽

Cited By ~ 22

Author(s):

Michael Tanana ◽

Kevin A. Hallgren ◽

Zac E. Imel ◽

David C. Atkins ◽

Vivek Srikumar

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Motivational Interviewing ◽

Language Processing ◽

Processing Methods ◽

Automated Coding

Download Full-text

Proceedings of the Conference Recent Advances in Natural Language Processing - Deep Learning for Natural Language Processing Methods and Applications

10.26615/978-954-452-072-4_xxx ◽

2021 ◽

Keyword(s):

Deep Learning ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing Methods ◽

Recent Advances

Download Full-text

Natural Language Processing methods for Document Matching

International Journal for Modern Trends in Science and Technology - RTT2020 ◽

10.46501/ijmtst061271 ◽

2020 ◽

Vol 6 (12) ◽

pp. 379-383

Author(s):

Maitri Patel and Dr Hemant D Vasava

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Original Work ◽

Educational Systems ◽

Processing Methods ◽

Similar Test ◽

Subject Examination ◽

Different Types ◽

Higher Educational

Data,Information or knoweldge,in this rapidly moving and growing world.we can find any kind of information on Internet.And this can be too useful,however for acedemic world too it is useful but along with it plagarism is highly in practice.Which makes orginality of work degrade and fraudly using someones original work and later not acknowleging them is becoming common.And some times teachers or professors could not identify the plagarised information provided.So higher educational systems nowadays use different types of tools to compare.Here we have an idea to match no of different documents like assignments of students to compare with each other to find out, did they copied each other’s work?Also an idea to compare ideal answeer sheet of particular subject examination to similar test sheets of students.Idea is to compare and on similarity basis we can rank them.Both approach is one kind and that is to compare documents.To identify plagarism there are many methods used already.So we could compare and develop them if needed.

Download Full-text

Comparing Natural Language Processing Methods to Cluster Construction Schedules

Journal of Construction Engineering and Management ◽

10.1061/(asce)co.1943-7862.0002165 ◽

2021 ◽

Vol 147 (10) ◽

pp. 04021136

Author(s):

Ying Hong ◽

Haiyan Xie ◽

Gary Bhumbra ◽

Ioannis Brilakis

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Processing Methods ◽

Construction Schedules

Download Full-text

Topic Modeling for Keyword Extraction: using Natural Language Processing methods for keyword extraction in Portal Min@s

Revista de Estudos da Linguagem ◽

10.17851/2237-2083.23.3.695-726 ◽

2015 ◽

Vol 23 (3) ◽

pp. 695 ◽

Cited By ~ 1

Author(s):

Arnaldo Candido Junior ◽

Célia Magalhães ◽

Helena Caseli ◽

Régis Zangirolami

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Topic Modeling ◽

Latent Dirichlet Allocation ◽

Keyword Extraction ◽

Processing Methods ◽

Dirichlet Allocation

Este artigo tem o objetivo da avaliar a aplicação de dois métodos automáticos eficientes na extração de palavras-chave, usados pelas comunidades da Linguística de Corpus e do Processamento da Língua Natural para gerar palavras-chave de textos literários: o WordSmith Tools e o Latent Dirichlet Allocation (LDA). As duas ferramentas escolhidas para este trabalho têm suas especificidades e técnicas diferentes de extração, o que nos levou a uma análise orientada para a sua performance. Objetivamos entender, então, como cada método funciona e avaliar sua aplicação em textos literários. Para esse fim, usamos análise humana, com conhecimento do campo dos textos usados. O método LDA foi usado para extrair palavras-chave por meio de sua integração com o Portal Min@s: Corpora de Fala e Escrita, um sistema geral de processamento de corpora, concebido para diferentes pesquisas de Linguística de Corpus. Os resultados do experimento confirmam a eficácia do WordSmith Tools e do LDA na extração de palavras-chave de um corpus literário, além de apontar que é necessária a análise humana das listas em um estágio anterior aos experimentos para complementar a lista gerada automaticamente, cruzando os resultados do WordSmith Tools e do LDA. Também indicam que a intuição linguística do analista humano sobre as listas geradas separadamente pelos dois métodos usados neste estudo foi mais favorável ao uso da lista de palavras-chave do WordSmith Tools.

Download Full-text