Natural Language Processing in OTF Computing: Challenges and the Need for Interactive Approaches

Frederik Bäumer; Joschka Kersting; Michaela Geierhos

doi:10.3390/computers8010022

Natural Language Processing in OTF Computing: Challenges and the Need for Interactive Approaches

Computers ◽

10.3390/computers8010022 ◽

2019 ◽

Vol 8 (1) ◽

pp. 22

Author(s):

Frederik Bäumer ◽

Joschka Kersting ◽

Michaela Geierhos

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Ad Hoc ◽

Domain Specific ◽

Compensation Process ◽

Language Requirement ◽

Chat Bot ◽

The Given ◽

Software Services

The vision of On-the-Fly (OTF) Computing is to compose and provide software services ad hoc, based on requirement descriptions in natural language. Since non-technical users write their software requirements themselves and in unrestricted natural language, deficits occur such as inaccuracy and incompleteness. These deficits are usually met by natural language processing methods, which have to face special challenges in OTF Computing because maximum automation is the goal. In this paper, we present current automatic approaches for solving inaccuracies and incompletenesses in natural language requirement descriptions and elaborate open challenges. In particular, we will discuss the necessity of domain-specific resources and show why, despite far-reaching automation, an intelligent and guided integration of end users into the compensation process is required. In this context, we present our idea of a chat bot that integrates users into the compensation process depending on the given circumstances.

Download Full-text

A Hindi Image Caption Generation Framework Using Deep Learning

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3432246 ◽

2021 ◽

Vol 20 (2) ◽

pp. 1-19

Author(s):

Santosh Kumar Mishra ◽

Rijul Dhir ◽

Sriparna Saha ◽

Pushpak Bhattacharyya

Keyword(s):

Computer Vision ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

English Language ◽

Image Captioning ◽

Textual Description ◽

Proposed Model ◽

Hindi Language ◽

The Given

Image captioning is the process of generating a textual description of an image that aims to describe the salient parts of the given image. It is an important problem, as it involves computer vision and natural language processing, where computer vision is used for understanding images, and natural language processing is used for language modeling. A lot of works have been done for image captioning for the English language. In this article, we have developed a model for image captioning in the Hindi language. Hindi is the official language of India, and it is the fourth most spoken language in the world, spoken in India and South Asia. To the best of our knowledge, this is the first attempt to generate image captions in the Hindi language. A dataset is manually created by translating well known MSCOCO dataset from English to Hindi. Finally, different types of attention-based architectures are developed for image captioning in the Hindi language. These attention mechanisms are new for the Hindi language, as those have never been used for the Hindi language. The obtained results of the proposed model are compared with several baselines in terms of BLEU scores, and the results show that our model performs better than others. Manual evaluation of the obtained captions in terms of adequacy and fluency also reveals the effectiveness of our proposed approach. Availability of resources : The codes of the article are available at https://github.com/santosh1821cs03/Image_Captioning_Hindi_Language ; The dataset will be made available: http://www.iitp.ac.in/∼ai-nlp-ml/resources.html .

Download Full-text

An intelligent chat-bot using natural language processing

International Journal of Engineering Research ◽

10.5958/2319-6890.2017.00019.8 ◽

2017 ◽

Vol 6 (5) ◽

pp. 281

Author(s):

Rishabh Shah ◽

Siddhant Lahoti ◽

K. Lavanya

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Chat Bot

Download Full-text

Mining Numbers in Text: A Survey

10.5772/intechopen.98540 ◽

2021 ◽

Author(s):

Minoru Yoshida ◽

Kenji Kita

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Ad Hoc ◽

Text Data ◽

Research Areas ◽

Recent Growth ◽

Recent Advances ◽

Almost All

Both words and numerals are tokens found in almost all documents but they have different properties. However, relatively little attention has been paid in numerals found in texts and many systems treated the numbers found in the document in ad-hoc ways, such as regarded them as mere strings in the same way as words, normalized them to zeros, or simply ignored them. Recent growth of natural language processing (NLP) research areas has change this situations and more and more attentions have been paid to the numeracy in documents. In this survey, we provide a quick overview of the history and recent advances of the research of mining such relations between numerals and words found in text data.

Download Full-text

College Information Chat-Bot System Based on Natural Language Processing

Journal of Xidian University ◽

10.37896/jxu14.5/086 ◽

2020 ◽

Vol 14 (5) ◽

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

College Information ◽

Chat Bot

Download Full-text

Statistical Unigram Analysis for Source Code Repository

International Journal of Semantic Computing ◽

10.1142/s1793351x18400123 ◽

2018 ◽

Vol 12 (02) ◽

pp. 237-260

Author(s):

Weifeng Xu ◽

Dianxiang Xu ◽

Abdulrahman Alatawi ◽

Omar El Ariss ◽

Yunkai Liu

Keyword(s):

Natural Language Processing ◽

Empirical Study ◽

Natural Language ◽

Programming Languages ◽

Language Processing ◽

Probabilistic Model ◽

Source Code ◽

Code Analysis ◽

Domain Specific ◽

Language Corpus

Unigram is a fundamental element of [Formula: see text]-gram in natural language processing. However, unigrams collected from a natural language corpus are unsuitable for solving problems in the domain of computer programming languages. In this paper, we analyze the properties of unigrams collected from an ultra-large source code repository. Specifically, we have collected 1.01 billion unigrams from 0.7 million open source projects hosted at GitHub.com. By analyzing these unigrams, we have discovered statistical properties regarding (1) how developers name variables, methods, and classes, and (2) how developers choose abbreviations. We describe a probabilistic model which relies on these properties for solving a well-known problem in source code analysis: how to expand a given abbreviation to its original indented word. Our empirical study shows that using the unigrams extracted from source code repository outperforms the using of the natural language corpus by 21% when solving the domain specific problems.

Download Full-text

A Principle-Based System for Syntactic Analysis

The Canadian Journal of Linguistics / La revue canadienne de linguistique ◽

10.1017/s0008413100014158 ◽

1991 ◽

Vol 36 (1) ◽

pp. 1-26 ◽

Cited By ~ 2

Author(s):

Matthew W. Crocker

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Ad Hoc ◽

Syntactic Analysis ◽

Large Sets ◽

Transition Network ◽

Traditional Approaches

Traditional approaches to natural language processing (NLP) can be considered construction-based. That is to say, they employ surface oriented, language specific rules, whether in the form of an Augmented Transition Network (ATN), logic grammar or some other grammar/parsing formalism. The problems of such approaches have always been apparent; they involve large sets of rules, often ad hoc, and their adequacy with respect to the grammar of the language is difficult to ensure.

Download Full-text

Natural Language Processing Techniques for Document Classification in IT Benchmarking - Automated Identification of Domain Specific Terms

Proceedings of the 17th International Conference on Enterprise Information Systems ◽

10.5220/0005462303600366 ◽

2015 ◽

Cited By ~ 2

Author(s):

Matthias Pfaff ◽

Helmut Krcmar

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Document Classification ◽

Automated Identification ◽

Domain Specific ◽

Processing Techniques ◽

It Benchmarking

Download Full-text

Domain specific word embeddings for natural language processing in radiology

Journal of Biomedical Informatics ◽

10.1016/j.jbi.2020.103665 ◽

2021 ◽

Vol 113 ◽

pp. 103665

Author(s):

Timothy L. Chen ◽

Max Emerling ◽

Gunvant R. Chaudhari ◽

Yeshwant R. Chillakuru ◽

Youngho Seo ◽

...

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Word Embeddings ◽

Domain Specific

Download Full-text

Pengembangan aplikasi chat bot guna membantu perusahaan dalam mengedukasi pelanggan dengan sistem natural language processing

TEKNO ◽

10.17977/um034v29i2p129-139 ◽

2019 ◽

Vol 29 (2) ◽

pp. 129

Author(s):

Yohanes Dhimas Firman Syahputra ◽

Syaad Patmanthara ◽

Heru Wahyu Herwanto

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Data Set ◽

Chat Bot

Hasil pengembangan aplikasi kecerdasan buatan berupa chat bot guna membantu perusahaan dalam melakukan edukasi customer dengan sistem natural language processing (NLP) diperoleh melalui metode pengembangan sistem. Dimana chat bot guna membantu perusahaan dalam melakukan edukasi customer dengan sistem NLP ini dikembangkan untuk komputer dapat melakukan tugas tertentu seperti yang dilakukan oleh manusia seperti robot chatting (chatbot), yaitu sistem yang mengadopsi pengetahuan manusia ke komputer, agar komputer dapat melakukan percakapan dengan pengguna. Kepintaran chatbot dalam menjawab pertanyaan ditentukan oleh banyaknya data set sehingga perbanyak data jawaban agar lebih banyak memahami pertanyaan dari pelanggan. Berdasarkan hasil uji coba pengembangan Chatbot yang telah dilakuan skor yang diproleh adalah 88,94%. Berdasarkan tabel kategori kelayakan, maka chat bot yang dikembangkan dalam penelitian dapat dinyatakan “sangat layak” untuk digunakan dalam pengembangannya.

Download Full-text

Depression Therapy Using Chatbot

International Journal for Modern Trends in Science and Technology - RTT2020 ◽

10.46501/ijmtst061260 ◽

2020 ◽

Vol 6 (12) ◽

pp. 323-327

Author(s):

Arkodeep Biswas and Ajay Kaushik

Keyword(s):

Artificial Intelligence ◽

Machine Learning ◽

Operating System ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Web Application ◽

Text Chat ◽

Communication History ◽

Chat Bot

The objective of this paper is to build a Web Application based on Virtual voice and chat Assistant. The current study focuses on development of voice and text/chat bot specifically. It is specially being built for people who feel depressed and insists them to talk open mindedly which in turn pacifies them. As the name of the application suggests, App: An application to pacify people and make them as happy as a cat would be with his or her mother (the reason why a cat purrs). We will be using Dialog flow for the application design and Machine Learning as a part of Artificial Intelligence for Natural Language Processing (NLP), an easiest way to use Machine Learning libraries. At the back-end we will be using a database to store the communication history between the user and the bot. This application will only work on devices with Web operating system version-5.0 and above.

Download Full-text