A Novel Approach to Paraphrase Hindi Sentences using Natural Language Processing

This technical report outlines the mechanisms and potential applications of SentiMental, a suite of natural language processing algorithm designed and implemented by Massimo Stella, Complex Science Consulting. The following technical report briefly outlines the novel approach of SentiMental in performing sentiment and emotional analysis by directly harnessing the whole structure of the mental lexicon rather than by using affect norms. Furthermore, this technical report outlines the direct emotional profiling and the visualisations currently implemented in version 0.1 of SentiMental. Features under development and current limitations are also outlined and discussed.This technical report is not meant as a publication. The author holds full copyright and any reproduction of parts of this report must be authorised by the copyright holder. SentiMental represents a work in progress, so do not hesitate to get in touch with the author for any potential feedback.

Download Full-text

Comparison of a semi-automatic annotation tool and a natural language processing application for the generation of clinical statement entries

Journal of the American Medical Informatics Association ◽

10.1136/amiajnl-2014-002991 ◽

2014 ◽

Vol 22 (1) ◽

pp. 132-142 ◽

Cited By ~ 2

Author(s):

Ching-Heng Lin ◽

Nai-Yuan Wu ◽

Wei-Shao Lai ◽

Der-Ming Liou

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Automatic Annotation ◽

Processing Application ◽

Entry Level ◽

Novel Approach ◽

Clinical Document ◽

F Measure ◽

Document Architecture

Abstract Background and objective Electronic medical records with encoded entries should enhance the semantic interoperability of document exchange. However, it remains a challenge to encode the narrative concept and to transform the coded concepts into a standard entry-level document. This study aimed to use a novel approach for the generation of entry-level interoperable clinical documents. Methods Using HL7 clinical document architecture (CDA) as the example, we developed three pipelines to generate entry-level CDA documents. The first approach was a semi-automatic annotation pipeline (SAAP), the second was a natural language processing (NLP) pipeline, and the third merged the above two pipelines. We randomly selected 50 test documents from the i2b2 corpora to evaluate the performance of the three pipelines. Results The 50 randomly selected test documents contained 9365 words, including 588 Observation terms and 123 Procedure terms. For the Observation terms, the merged pipeline had a significantly higher F-measure than the NLP pipeline (0.89 vs 0.80, p<0.0001), but a similar F-measure to that of the SAAP (0.89 vs 0.87). For the Procedure terms, the F-measure was not significantly different among the three pipelines. Conclusions The combination of a semi-automatic annotation approach and the NLP application seems to be a solution for generating entry-level interoperable clinical documents.

Download Full-text

A Composite Natural Language Processing and Information Retrieval Approach to Question Answering Using a Structured Knowledge Base

International Journal of Semantic Computing ◽

10.1142/s1793351x17400141 ◽

2017 ◽

Vol 11 (03) ◽

pp. 345-371

Author(s):

Avani Chandurkar ◽

Ajay Bansal

Keyword(s):

Information Retrieval ◽

Natural Language Processing ◽

Natural Language ◽

Knowledge Base ◽

Language Processing ◽

Question Answering ◽

Automated System ◽

Free Form ◽

Question Answering System ◽

Novel Approach

With the inception of the World Wide Web, the amount of data present on the Internet is tremendous. This makes the task of navigating through this enormous amount of data quite difficult for the user. As users struggle to navigate through this wealth of information, the need for the development of an automated system that can extract the required information becomes urgent. This paper presents a Question Answering system to ease the process of information retrieval. Question Answering systems have been around for quite some time and are a sub-field of information retrieval and natural language processing. The task of any Question Answering system is to seek an answer to a free form factual question. The difficulty of pinpointing and verifying the precise answer makes question answering more challenging than simple information retrieval done by search engines. The research objective of this paper is to develop a novel approach to Question Answering based on a composition of conventional approaches of Information Retrieval (IR) and Natural Language processing (NLP). The focus is on using a structured and annotated knowledge base instead of an unstructured one. The knowledge base used here is DBpedia and the final system is evaluated on the Text REtrieval Conference (TREC) 2004 questions dataset.

Download Full-text

CAREER RECOMMENDER: A NOVEL APPROACH TO SUGGEST JOBS AND POST-GRADUATION STREAMS

Asian Journal of Pharmaceutical and Clinical Research ◽

10.22159/ajpcr.2017.v10s1.19758 ◽

2017 ◽

Vol 10 (13) ◽

pp. 365

Author(s):

Prafful Nath Mathur ◽

Abhishek Dixit ◽

Sakkaravarthi Ramanathan

Keyword(s):

Natural Language Processing ◽

Text Mining ◽

Natural Language ◽

Language Processing ◽

Technical Skills ◽

Web Crawler ◽

Novel Approach ◽

Current Location ◽

Job Postings ◽

Mining Algorithms

To implement a novel approach to recommend jobs and colleges based on résumé of freshly graduated students. Job postings are crawled from web using a web crawler and stored in a customized database. College lists are also retrieved for post-graduation streams and stored in a database. Student résumé is stored and parsed using natural language processing methods to form a résumé model. Text mining algorithms are applied on this model to extract useful information (i.e., degree, technical skills, extracurricular skills, current location, and hobbies). This information is used to suggest matching jobs and colleges to the candidate.

Download Full-text

A novel approach to improve efficient information retrieval based on natural language processing

International Journal of Latest Trends in Engineering and Technology ◽

10.21172/1.73.039 ◽

2016 ◽

Vol 7 (3) ◽

Cited By ~ 1

Keyword(s):

Information Retrieval ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Novel Approach ◽

Efficient Information

Download Full-text

A Novel Approach to Music Genre Classification using Natural Language Processing and Spark

2020 14th International Conference on Ubiquitous Information Management and Communication (IMCOM) ◽

10.1109/imcom48794.2020.9001675 ◽

2020 ◽

Author(s):

Sharan Duggirala ◽

Teng-Sheng Moh

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Novel Approach ◽

Genre Classification ◽

Music Genre ◽

Music Genre Classification

Download Full-text

Improving Readability of Online Privacy Policies through DOOP: A Domain Ontology for Online Privacy

Digital ◽

10.3390/digital1040015 ◽

2021 ◽

Vol 1 (4) ◽

pp. 198-215

Author(s):

Dhiren A. Audich ◽

Rozita Dara ◽

Blair Nonnecke

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Domain Ontology ◽

Online Privacy ◽

Privacy Policy ◽

Proof Of Concept ◽

Privacy Policies ◽

Privacy Concerns ◽

Novel Approach

Privacy policies play an important part in informing users about their privacy concerns by operating as memorandums of understanding (MOUs) between them and online services providers. Research suggests that these policies are infrequently read because they are often lengthy, written in jargon, and incomplete, making them difficult for most users to understand. Users are more likely to read short excerpts of privacy policies if they pertain directly to their concern. In this paper, a novel approach and a proof-of-concept tool are proposed that reduces the amount of privacy policy text a user has to read. It does so using a domain ontology and natural language processing (NLP) to identify key areas of the policies that users should read to address their concerns and take appropriate action. Using the ontology to locate key parts of privacy policies, average reading times were substantially reduced from 29 to 32 min to 45 s.

Download Full-text