Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended Abstract)

Automatic image description generation is a challenging problem that has recently received a large amount of interest from the computer vision and natural language processing communities. In this survey, we classify the known approaches based on how they conceptualise this problem and provide a review of existing models, highlighting their advantages and disadvantages. Moreover, we give an overview of the benchmark image-text datasets and the evaluation measures that have been developed to assess the quality of machine-generated descriptions. Finally we explore future directions in the area of automatic image description.

Download Full-text

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures

Journal of Artificial Intelligence Research ◽

10.1613/jair.4900 ◽

2016 ◽

Vol 55 ◽

pp. 409-442 ◽

Cited By ~ 73

Author(s):

Raffaella Bernardi ◽

Ruket Cakici ◽

Desmond Elliott ◽

Aykut Erdem ◽

Erkut Erdem ◽

...

Keyword(s):

Language Processing ◽

Image Description ◽

Future Directions ◽

Evaluation Measures ◽

Advantages And Disadvantages ◽

Representational Space ◽

Retrieval Problem ◽

Image Descriptions ◽

Generation Problem

Automatic description generation from natural images is a challenging problem that has recently received a large amount of interest from the computer vision and natural language processing communities. In this survey, we classify the existing approaches based on how they conceptualize this problem, viz., models that cast description as either generation problem or as a retrieval problem over a visual or multimodal representational space. We provide a detailed review of existing models, highlighting their advantages and disadvantages. Moreover, we give an overview of the benchmark image datasets and the evaluation measures that have been developed to assess the quality of machine-generated image descriptions. Finally we extrapolate future directions in the area of automatic image description generation.

Download Full-text

RussianLanguage Thesauri: Automated Construction and Application For Natural Language Processing Tasks

Modeling and Analysis of Information Systems ◽

10.18255/1818-1015-2018-4-435-458 ◽

2018 ◽

Vol 25 (4) ◽

pp. 435-458

Author(s):

Nadezhda S. Lagutina ◽

Ksenia V. Lagutina ◽

Aleksey S. Adrianov ◽

Ilya V. Paramonov

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Russian Language ◽

Labor Costs ◽

Linguistic Resources ◽

Advantages And Disadvantages ◽

Text Corpora ◽

Linguistic Methods

The paper reviews the existing Russian-language thesauri in digital form and methods of their automatic construction and application. The authors analyzed the main characteristics of open access thesauri for scientific research, evaluated trends of their development, and their effectiveness in solving natural language processing tasks. The statistical and linguistic methods of thesaurus construction that allow to automate the development and reduce labor costs of expert linguists were studied. In particular, the authors considered algorithms for extracting keywords and semantic thesaurus relationships of all types, as well as the quality of thesauri generated with the use of these tools. To illustrate features of various methods for constructing thesaurus relationships, the authors developed a combined method that generates a specialized thesaurus fully automatically taking into account a text corpus in a particular domain and several existing linguistic resources. With the proposed method, experiments were conducted with two Russian-language text corpora from two subject areas: articles about migrants and tweets. The resulting thesauri were assessed by using an integrated assessment developed in the previous authors’ study that allows to analyze various aspects of the thesaurus and the quality of the generation methods. The analysis revealed the main advantages and disadvantages of various approaches to the construction of thesauri and the extraction of semantic relationships of different types, as well as made it possible to determine directions for future study.

Download Full-text

A Hindi Image Caption Generation Framework Using Deep Learning

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3432246 ◽

2021 ◽

Vol 20 (2) ◽

pp. 1-19

Author(s):

Santosh Kumar Mishra ◽

Rijul Dhir ◽

Sriparna Saha ◽

Pushpak Bhattacharyya

Keyword(s):

Computer Vision ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

English Language ◽

Image Captioning ◽

Textual Description ◽

Proposed Model ◽

Hindi Language ◽

The Given

Image captioning is the process of generating a textual description of an image that aims to describe the salient parts of the given image. It is an important problem, as it involves computer vision and natural language processing, where computer vision is used for understanding images, and natural language processing is used for language modeling. A lot of works have been done for image captioning for the English language. In this article, we have developed a model for image captioning in the Hindi language. Hindi is the official language of India, and it is the fourth most spoken language in the world, spoken in India and South Asia. To the best of our knowledge, this is the first attempt to generate image captions in the Hindi language. A dataset is manually created by translating well known MSCOCO dataset from English to Hindi. Finally, different types of attention-based architectures are developed for image captioning in the Hindi language. These attention mechanisms are new for the Hindi language, as those have never been used for the Hindi language. The obtained results of the proposed model are compared with several baselines in terms of BLEU scores, and the results show that our model performs better than others. Manual evaluation of the obtained captions in terms of adequacy and fluency also reveals the effectiveness of our proposed approach. Availability of resources : The codes of the article are available at https://github.com/santosh1821cs03/Image_Captioning_Hindi_Language ; The dataset will be made available: http://www.iitp.ac.in/∼ai-nlp-ml/resources.html .

Download Full-text

Sentiment of App with Word Vectors

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.f1416.0986s319 ◽

2019 ◽

Vol 8 (6S3) ◽

pp. 2156-2159

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Sentiment Analysis ◽

Language Processing ◽

Text Data ◽

Vector Representations ◽

Text Sentiment Analysis

Vector representations for language have been shown to be useful in a number of Natural Language Processing tasks. In this paper, we aim to investigate the effectiveness of word vector representations for the problem of Sentiment Analysis. In particular, we target three sub-tasks namely sentiment words extraction, polarity of sentiment words detection, and text sentiment prediction. We investigate the effectiveness of vector representations over different text data and evaluate the quality of domain-dependent vectors. Vector representations has been used to compute various vector-based features and conduct systematically experiments to demonstrate their effectiveness. Using simple vector based features can achieve better results for text sentiment analysis of APP.

Download Full-text

Explainability in Time Series Forecasting, Natural Language Processing, and Computer Vision

Explainable Artificial Intelligence: An Introduction to Interpretable Machine Learning ◽

10.1007/978-3-030-83356-5_7 ◽

2021 ◽

pp. 261-302

Author(s):

Uday Kamath ◽

John Liu

Keyword(s):

Computer Vision ◽

Time Series ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Time Series Forecasting

Download Full-text

Automatic Identification of Information Quality Metrics in Health News Stories

Frontiers in Public Health ◽

10.3389/fpubh.2020.515347 ◽

2020 ◽

Vol 8 ◽

Author(s):

Majed Al-Jefri ◽

Roger Evans ◽

Joon Lee ◽

Pietro Ghezzi

Keyword(s):

Machine Learning ◽

Health Care ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Information Quality ◽

Evaluation Process ◽

Health News ◽

News Stories

Objective: Many online and printed media publish health news of questionable trustworthiness and it may be difficult for laypersons to determine the information quality of such articles. The purpose of this work was to propose a methodology for the automatic assessment of the quality of health-related news stories using natural language processing and machine learning.Materials and Methods: We used a database from the website HealthNewsReview.org that aims to improve the public dialogue about health care. HealthNewsReview.org developed a set of criteria to critically analyze health care interventions' claims. In this work, we attempt to automate the evaluation process by identifying the indicators of those criteria using natural language processing-based machine learning on a corpus of more than 1,300 news stories. We explored features ranging from simple n-grams to more advanced linguistic features and optimized the feature selection for each task. Additionally, we experimented with the use of pre-trained natural language model BERT.Results: For some criteria, such as mention of costs, benefits, harms, and “disease-mongering,” the evaluation results were promising with an F1 measure reaching 81.94%, while for others the results were less satisfactory due to the dataset size, the need of external knowledge, or the subjectivity in the evaluation process.Conclusion: These used criteria are more challenging than those addressed by previous work, and our aim was to investigate how much more difficult the machine learning task was, and how and why it varied between criteria. For some criteria, the obtained results were promising; however, automated evaluation of the other criteria may not yet replace the manual evaluation process where human experts interpret text senses and make use of external knowledge in their assessment.

Download Full-text

Identifying Heart Failure Symptoms and Poor Self-Management in Home Healthcare: A Natural Language Processing Study

10.3233/shti210653 ◽

2021 ◽

Author(s):

Sena Chae ◽

Jiyoun Song ◽

Marietta Ojo ◽

Maxim Topaz

Keyword(s):

Heart Failure ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Symptom Management ◽

Home Healthcare ◽

Self Management ◽

Clinical Notes ◽

Patients With Heart Failure

The goal of this natural language processing (NLP) study was to identify patients in home healthcare with heart failure symptoms and poor self-management (SM). The preliminary lists of symptoms and poor SM status were identified, NLP algorithms were used to refine the lists, and NLP performance was evaluated using 2.3 million home healthcare clinical notes. The overall precision to identify patients with heart failure symptoms and poor SM status was 0.86. The feasibility of methods was demonstrated to identify patients with heart failure symptoms and poor SM documented in home healthcare notes. This study facilitates utilizing key symptom information and patients’ SM status from unstructured data in electronic health records. The results of this study can be applied to better individualize symptom management to support heart failure patients’ quality-of-life.

Download Full-text

Natural Language Processing

Annual Review of Applied Linguistics ◽

10.1017/s0267190500001446 ◽

1996 ◽

Vol 16 ◽

pp. 70-85 ◽

Cited By ~ 5

Author(s):

Thomas C. Rindflesch

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Computational Linguistics ◽

Language Processing ◽

Domain Knowledge ◽

The State ◽

Point Of View ◽

Computer Applications ◽

Significant Progress ◽

Future Directions

Work in computational linguistics began very soon after the development of the first computers (Booth, Brandwood and Cleave 1958), yet in the intervening four decades there has been a pervasive feeling that progress in computer understanding of natural language has not been commensurate with progress in other computer applications. Recently, a number of prominent researchers in natural language processing met to assess the state of the discipline and discuss future directions (Bates and Weischedel 1993). The consensus of this meeting was that increased attention to large amounts of lexical and domain knowledge was essential for significant progress, and current research efforts in the field reflect this point of view.

Download Full-text

Computer Vision and Natural Language Processing

ACM Computing Surveys ◽

10.1145/3009906 ◽

2017 ◽

Vol 49 (4) ◽

pp. 1-44 ◽

Cited By ~ 11

Author(s):

Peratham Wiriyathammabhum ◽

Douglas Summers-Stay ◽

Cornelia Fermüller ◽

Yiannis Aloimonos

Keyword(s):

Computer Vision ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing

Download Full-text

The Concept of Integrating Artificial Intelligence Technologies Into Human Resources in a Digital Paradigm

Management of the personnel and intellectual resources in Russia ◽

10.12737/2305-7807-2020-5-9 ◽

2020 ◽

Vol 9 (2) ◽

pp. 5-9

Author(s):

Oksana Chulanova

Keyword(s):

Artificial Intelligence ◽

Computer Vision ◽

Natural Language Processing ◽

Decision Support ◽

Speech Recognition ◽

Human Resources ◽

Natural Language ◽

Language Processing

The article discusses the capabilities of artificial intelligence technologies - technologies based on the use of artificial intelligence, including natural language processing, intellectual decision support, computer vision, speech recognition and synthesis, and promising methods of artificial intelligence. The results of the author's study and the analysis of artificial intelligence technologies and their capabilities for optimizing work with staff are presented. A study conducted by the author allowed us to develop an author's concept of integrating artificial intelligence technologies into work with personnel in the digital paradigm.

Download Full-text