Slovene and Croatian word embeddings in terms of gender occupational analogies

In recent years, the use of deep neural networks and dense vector embeddings for text representation have led to excellent results in the field of computational understanding of natural language. It has also been shown that word embeddings often capture gender, racial and other types of bias. The article focuses on evaluating Slovene and Croatian word embeddings in terms of gender bias using word analogy calculations. We compiled a list of masculine and feminine nouns for occupations in Slovene and evaluated the gender bias of fastText, word2vec and ELMo embeddings with different configurations and different approaches to analogy calculations. The lowest occupational gender bias was observed with the fastText embeddings. Similarly, we compared different fastText embeddings on Croatian occupational analogies.

Download Full-text

A Survey on Bias in Deep NLP

Applied Sciences ◽

10.3390/app11073184 ◽

2021 ◽

Vol 11 (7) ◽

pp. 3184

Author(s):

Ismael Garrido-Muñoz ◽

Arturo Montejo-Ráez ◽

Fernando Martínez-Santiago ◽

L. Alfonso Ureña-López

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Natural Language Processing ◽

Probability Distribution ◽

Natural Language ◽

Network Design ◽

Language Processing ◽

Deep Neural Networks ◽

Learning Processes ◽

Relevant Issue

Deep neural networks are hegemonic approaches to many machine learning areas, including natural language processing (NLP). Thanks to the availability of large corpora collections and the capability of deep architectures to shape internal language mechanisms in self-supervised learning processes (also known as “pre-training”), versatile and performing models are released continuously for every new network design. These networks, somehow, learn a probability distribution of words and relations across the training collection used, inheriting the potential flaws, inconsistencies and biases contained in such a collection. As pre-trained models have been found to be very useful approaches to transfer learning, dealing with bias has become a relevant issue in this new scenario. We introduce bias in a formal way and explore how it has been treated in several networks, in terms of detection and correction. In addition, available resources are identified and a strategy to deal with bias in deep NLP is proposed.

Download Full-text

Ontological Relation Classification Using WordNet, Word Embeddings and Deep Neural Networks

Modelling and Implementation of Complex Systems - Lecture Notes in Networks and Systems ◽

10.1007/978-3-030-58861-8_10 ◽

2020 ◽

pp. 136-148

Author(s):

Ahlem Chérifa Khadir ◽

Ahmed Guessoum ◽

Hassina Aliane

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Word Embeddings ◽

Relation Classification

Download Full-text

Improving the accuracy using pre-trained word embeddings on deep neural networks for Turkish text classification

Physica A Statistical Mechanics and its Applications ◽

10.1016/j.physa.2019.123288 ◽

2020 ◽

Vol 541 ◽

pp. 123288 ◽

Cited By ~ 3

Author(s):

Murat Aydoğan ◽

Ali Karci

Keyword(s):

Neural Networks ◽

Text Classification ◽

Deep Neural Networks ◽

Word Embeddings ◽

Turkish Text

Download Full-text

Empirical evaluation of multi-task learning in deep neural networks for natural language processing

Neural Computing and Applications ◽

10.1007/s00521-020-05268-w ◽

2020 ◽

Author(s):

Jianquan Li ◽

Xiaokang Liu ◽

Wenpeng Yin ◽

Min Yang ◽

Liqun Ma ◽

...

Keyword(s):

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Deep Neural Networks ◽

Empirical Evaluation ◽

Task Learning

Download Full-text

Review Spam Detection Using Word Embeddings and Deep Neural Networks

IFIP Advances in Information and Communication Technology - Artificial Intelligence Applications and Innovations ◽

10.1007/978-3-030-19823-7_28 ◽

2019 ◽

pp. 340-350 ◽

Cited By ~ 3

Author(s):

Aliaksandr Barushka ◽

Petr Hajek

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Spam Detection ◽

Word Embeddings ◽

Review Spam

Download Full-text

Performance Comparison of Natural Language Processing Model Based on Deep Neural Networks

The Journal of Korean Institute of Communications and Information Sciences ◽

10.7840/kics.2019.44.7.1344 ◽

2019 ◽

Vol 44 (7) ◽

pp. 1344-1350

Author(s):

Taegyeom Lee ◽

Kyungseop Shin

Keyword(s):

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Deep Neural Networks ◽

Performance Comparison ◽

Model Based

Download Full-text

Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining

Neural Computing and Applications ◽

10.1007/s00521-020-04757-2 ◽

2020 ◽

Vol 32 (23) ◽

pp. 17259-17274 ◽

Cited By ~ 2

Author(s):

Petr Hajek ◽

Aliaksandr Barushka ◽

Michal Munk

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Word Embeddings ◽

Consumer Review ◽

Emotion Mining

Download Full-text

An Analysis of Machine Learning Algorithms and Deep Neural Networks for Email Spam Classification using Natural Language Processing

10.1109/soli54607.2021.9672398 ◽

2021 ◽

Author(s):

Md. Mohidul Hasan ◽

Syed Mahbubuz Zaman ◽

Md. Asif Talukdar ◽

Ayesha Siddika ◽

Md. Golam Rabiul Alam

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Deep Neural Networks ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Email Spam

Download Full-text

Learning Eligibility in Cancer Clinical Trials Using Deep Neural Networks

Applied Sciences ◽

10.3390/app8071206 ◽

2018 ◽

Vol 8 (7) ◽

pp. 1206 ◽

Cited By ~ 5

Author(s):

Aurelia Bustos ◽

Antonio Pertusa

Keyword(s):

Neural Networks ◽

Clinical Trials ◽

Deep Neural Networks ◽

Medical Knowledge ◽

Clinical Information ◽

Representation Learning ◽

Free Text ◽

Cancer Clinical Trials ◽

Word Embeddings ◽

New Treatments

Interventional cancer clinical trials are generally too restrictive, and some patients are often excluded on the basis of comorbidity, past or concomitant treatments, or the fact that they are over a certain age. The efficacy and safety of new treatments for patients with these characteristics are, therefore, not defined. In this work, we built a model to automatically predict whether short clinical statements were considered inclusion or exclusion criteria. We used protocols from cancer clinical trials that were available in public registries from the last 18 years to train word-embeddings, and we constructed a dataset of 6M short free-texts labeled as eligible or not eligible. A text classifier was trained using deep neural networks, with pre-trained word-embeddings as inputs, to predict whether or not short free-text statements describing clinical information were considered eligible. We additionally analyzed the semantic reasoning of the word-embedding representations obtained and were able to identify equivalent treatments for a type of tumor analogous with the drugs used to treat other tumors. We show that representation learning using deep neural networks can be successfully leveraged to extract the medical knowledge from clinical trial protocols for potentially assisting practitioners when prescribing treatments.

Download Full-text

Combining Word Embeddings and Deep Neural Networks for Job Offers and Resumes Classification in IT Recruitment Domain

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2021.0120774 ◽

2021 ◽

Vol 12 (7) ◽

Author(s):

Amine Habous ◽

El Habib Nfaoui

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Word Embeddings ◽

Job Offers

Download Full-text