Caption to Voice Bot for Assistive Vision

Nandita S

doi:10.22214/ijraset.2021.35244

Caption to Voice Bot for Assistive Vision

International Journal for Research in Applied Science and Engineering Technology ◽

10.22214/ijraset.2021.35244 ◽

2021 ◽

Vol 9 (VI) ◽

pp. 1304-1308

Author(s):

Nandita S

Keyword(s):

Artificial Intelligence ◽

Natural Language ◽

Language Processing ◽

Visually Impaired ◽

Rapid Development ◽

Read Aloud ◽

Research Groups ◽

Large Component ◽

Artificial Intelligence Research ◽

Image Caption

Over the last few years, with the rapid development of artificial intelligence, the generation of the caption of images has progressively caught the considerable interest of several artificial intelligence research groups and has become a fascinating and tedious mission. A large component of scene comprehension, which encompasses the knowledge of computer vision and natural language processing, is image caption, which automatically produces natural language explanations according to the content observed in an image. The applications of such an image caption are substantial and noteworthy. The prime intention of the project is to build an object detection and captioning module that produces captions from the features extracted from the input images fed to the module in the form of audio and interface it with a virtual text reader, a read-aloud technology. Additionally, both these features can be accomplished using live images. The module as a whole helps the visually impaired identify objects and their positions.

Download Full-text

An Overview of Image Caption Generation Methods

Computational Intelligence and Neuroscience ◽

10.1155/2020/3062706 ◽

2020 ◽

Vol 2020 ◽

pp. 1-13 ◽

Cited By ~ 2

Author(s):

Haoran Wang ◽

Yue Zhang ◽

Xiaosheng Yu

Keyword(s):

Artificial Intelligence ◽

Computer Vision ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Rapid Development ◽

Evaluation Criteria ◽

Arduous Task ◽

Image Caption Generation ◽

Image Caption

In recent years, with the rapid development of artificial intelligence, image caption has gradually attracted the attention of many researchers in the field of artificial intelligence and has become an interesting and arduous task. Image caption, automatically generating natural language descriptions according to the content observed in an image, is an important part of scene understanding, which combines the knowledge of computer vision and natural language processing. The application of image caption is extensive and significant, for example, the realization of human-computer interaction. This paper summarizes the related methods and focuses on the attention mechanism, which plays an important role in computer vision and is recently widely used in image caption generation tasks. Furthermore, the advantages and the shortcomings of these methods are discussed, providing the commonly used datasets and evaluation criteria in this field. Finally, this paper highlights some open challenges in the image caption task.

Download Full-text

An interactive dashboard to track themes, development maturity, and global equity in clinical artificial intelligence research

10.1101/2021.11.23.21266758 ◽

2021 ◽

Author(s):

Joe Zhang ◽

Stephen Whebell ◽

Jack Gallifant ◽

Sanjay Budhdeo ◽

Heather Mattie ◽

...

Keyword(s):

Artificial Intelligence ◽

Natural Language Processing ◽

Natural Language ◽

Real Time ◽

Language Processing ◽

Artificial Intelligence Research ◽

End To End ◽

Monitoring Progress

The global clinical artificial intelligence (AI) research landscape is constantly evolving, with heterogeneity across specialties, disease areas, geographical representation, and development maturity. Continual assessment of this landscape is important for monitoring progress. Taking advantage of developments in natural language processing (NLP), we produce an end-to-end NLP pipeline to automate classification and characterization of all original clinical AI research on MEDLINE, outputting real-time results to a public, interactive dashboard (https://aiforhealth.app/).

Download Full-text

Does higher education properly prepare graduates for the growing artificial intelligence market? Gaps identification using text mining

Human Systems Management ◽

10.3233/hsm-211179 ◽

2021 ◽

pp. 1-13

Author(s):

Lamiae Benhayoun ◽

Daniel Lang

Keyword(s):

Artificial Intelligence ◽

Natural Language Processing ◽

Text Mining ◽

Natural Language ◽

Language Processing ◽

Academic Training ◽

Market Requirements ◽

Job Advertisements ◽

The Individual

BACKGROUND: The renewed advent of Artificial Intelligence (AI) is inducing profound changes in the classic categories of technology professions and is creating the need for new specific skills. OBJECTIVE: Identify the gaps in terms of skills between academic training on AI in French engineering and Business Schools, and the requirements of the labour market. METHOD: Extraction of AI training contents from the schools’ websites and scraping of a job advertisements’ website. Then, analysis based on a text mining approach with a Python code for Natural Language Processing. RESULTS: Categorization of occupations related to AI. Characterization of three classes of skills for the AI market: Technical, Soft and Interdisciplinary. Skills’ gaps concern some professional certifications and the mastery of specific tools, research abilities, and awareness of ethical and regulatory dimensions of AI. CONCLUSIONS: A deep analysis using algorithms for Natural Language Processing. Results that provide a better understanding of the AI capability components at the individual and the organizational levels. A study that can help shape educational programs to respond to the AI market requirements.

Download Full-text

A Call to Action on Artificial Intelligence and Social Work Education: Lessons Learned from A Simulation Project Using Natural Language Processing

Journal of Teaching in Social Work ◽

10.1080/08841233.2020.1813234 ◽

2020 ◽

Vol 40 (5) ◽

pp. 501-518

Author(s):

Kenta Asakura ◽

Katherine Occhiuto ◽

Sarah Todd ◽

Cedar Leithead ◽

Robert Clapperton

Keyword(s):

Artificial Intelligence ◽

Natural Language Processing ◽

Social Work ◽

Natural Language ◽

Language Processing ◽

Social Work Education ◽

Lessons Learned ◽

Call To Action ◽

Work Education

Download Full-text

Research on the Application of NLP Artificial Intelligence Tools in University Natural Language Processing

IOP Conference Series Earth and Environmental Science ◽

10.1088/1755-1315/714/4/042018 ◽

2021 ◽

Vol 714 (4) ◽

pp. 042018

Author(s):

Aihong Yuan ◽

li Gao

Keyword(s):

Artificial Intelligence ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing

Download Full-text

Constructing Patent Maps Using Text Mining to Sustainably Detect Potential Technological Opportunities

Sustainability ◽

10.3390/su10103729 ◽

2018 ◽

Vol 10 (10) ◽

pp. 3729 ◽

Cited By ~ 5

Author(s):

Hei Wang ◽

Yung Chi ◽

Ping Hsin

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Rapid Development ◽

Word Sense Disambiguation ◽

Current Method ◽

Word Sense ◽

Competitive Advantages ◽

Patent Documents ◽

Technological Opportunities

With the advent of the knowledge economy, firms often compete for intellectual property rights. Being the first to acquire high-potential patents can assist firms in achieving future competitive advantages. To identify patents capable of being developed, firms often search for a focus by using existing patent documents. Because of the rapid development of technology, the number of patent documents is immense. A prominent topic among current firms is how to use this large number of patent documents to discover new business opportunities while avoiding conflicts with existing patents. In the search for technological opportunities, a crucial task is to present results in the form of an easily understood visualization. Currently, natural language processing can help in achieving this goal. In natural language processing, word sense disambiguation (WSD) is the problem of determining which “sense” (meaning) of a word is activated in a given context. Given a word and its possible senses, as defined by a dictionary, we classify the occurrence of a word in context into one or more of its sense classes. The features of the context (such as neighboring words) provide evidence for these classifications. The current method for patent document analysis warrants improvement in areas, such as the analysis of many dimensions and the development of recommendation methods. This study proposes a visualization method that supports semantics, reduces the number of dimensions formed by terms, and can easily be understood by users. Since polysemous words occur frequently in patent documents, we also propose a WSD method to decrease the calculated degrees of distortion between terms. An analysis of outlier distributions is used to construct a patent map capable of distinguishing similar patents. During the development of new strategies, the constructed patent map can assist firms in understanding patent distributions in commercial areas, thereby preventing patent infringement caused by the development of similar technologies. Subsequently, technological opportunities can be recommended according to the patent map, aiding firms in assessing relevant patents in commercial areas early and sustainably achieving future competitive advantages.

Download Full-text

A Brief Overview of Natural Language Processing and Artificial Intelligence

Natural Language Processing in Artificial Intelligence ◽

10.1201/9780367808495-8 ◽

2020 ◽

pp. 211-224

Author(s):

Sushree Bibhuprada B. Priyadarshini ◽

Amiya Bhusan Bagjadab ◽

Brojo Kishore Mishra

Keyword(s):

Artificial Intelligence ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing

Download Full-text

Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph15081750 ◽

2018 ◽

Vol 15 (8) ◽

pp. 1750 ◽

Cited By ~ 4

Author(s):

Seonho Kim ◽

Jungjoon Kim ◽

Hong-Woo Chun

Keyword(s):

Artificial Intelligence ◽

Time Series ◽

Feature Selection ◽

Deep Learning ◽

Natural Language Processing ◽

Data Analysis ◽

Natural Language ◽

Real Number ◽

Real Time ◽

Language Processing

Interest in research involving health-medical information analysis based on artificial intelligence, especially for deep learning techniques, has recently been increasing. Most of the research in this field has been focused on searching for new knowledge for predicting and diagnosing disease by revealing the relation between disease and various information features of data. These features are extracted by analyzing various clinical pathology data, such as EHR (electronic health records), and academic literature using the techniques of data analysis, natural language processing, etc. However, still needed are more research and interest in applying the latest advanced artificial intelligence-based data analysis technique to bio-signal data, which are continuous physiological records, such as EEG (electroencephalography) and ECG (electrocardiogram). Unlike the other types of data, applying deep learning to bio-signal data, which is in the form of time series of real numbers, has many issues that need to be resolved in preprocessing, learning, and analysis. Such issues include leaving feature selection, learning parts that are black boxes, difficulties in recognizing and identifying effective features, high computational complexities, etc. In this paper, to solve these issues, we provide an encoding-based Wave2vec time series classifier model, which combines signal-processing and deep learning-based natural language processing techniques. To demonstrate its advantages, we provide the results of three experiments conducted with EEG data of the University of California Irvine, which are a real-world benchmark bio-signal dataset. After converting the bio-signals (in the form of waves), which are a real number time series, into a sequence of symbols or a sequence of wavelet patterns that are converted into symbols, through encoding, the proposed model vectorizes the symbols by learning the sequence using deep learning-based natural language processing. The models of each class can be constructed through learning from the vectorized wavelet patterns and training data. The implemented models can be used for prediction and diagnosis of diseases by classifying the new data. The proposed method enhanced data readability and intuition of feature selection and learning processes by converting the time series of real number data into sequences of symbols. In addition, it facilitates intuitive and easy recognition, and identification of influential patterns. Furthermore, real-time large-capacity data analysis is facilitated, which is essential in the development of real-time analysis diagnosis systems, by drastically reducing the complexity of calculation without deterioration of analysis performance by data simplification through the encoding process.

Download Full-text

KeyBoard-Less Online Shopping for the Visually Impaired Using Natural Language Processing and Face Recognition Mechanism

Smart Intelligent Computing and Applications - Smart Innovation, Systems and Technologies ◽

10.1007/978-981-32-9690-9_25 ◽

2019 ◽

pp. 253-260 ◽

Cited By ~ 2

Author(s):

Srija Rallabhandy ◽

Sireesha Rodda

Keyword(s):

Natural Language Processing ◽

Face Recognition ◽

Natural Language ◽

Language Processing ◽

Visually Impaired ◽

Online Shopping ◽

Recognition Mechanism

Download Full-text

Face Detection and Natural Language Processing System Using Artificial Intelligence

Lecture Notes in Networks and Systems - Inventive Communication and Computational Technologies ◽

10.1007/978-981-15-0146-3_73 ◽

2020 ◽

pp. 773-780

Author(s):

H. S. Avani ◽

Ayushi Turkar ◽

C. D. Divya

Keyword(s):

Artificial Intelligence ◽

Natural Language Processing ◽

Natural Language ◽

Face Detection ◽

Language Processing ◽

Processing System ◽

Natural Language Processing System

Download Full-text