scholarly journals Predicting Environmental Chemical Carcinogenicity using a Hybrid Machine-Learning Approach

2021 ◽  
Author(s):  
Sarita Limbu ◽  
Sivanesan Dakshanamurthy

Determining environmental chemical carcinogenicity is an urgent need as humans are increasingly exposed to these chemicals. In this study, we determined the carcinogenicity of wide variety real-life exposure chemicals in large scale. To determine chemical carcinogenicity, we have developed carcinogenicity prediction models based on the hybrid neural network (HNN) architecture. In the HNN model, we included new SMILES feature representation method, by modifying our previous 3D array representation of 1D SMILES simulated by the convolutional neural network (CNN). We used 653 molecular descriptors modeled by feed forward neural network (FFNN), and SMILES as chemical features to train the models. We have developed three types of machine learning models: binary classification models to predict chemical is a carcinogenic or non-carcinogenic, multiclass classification models to predict severity of the chemical carcinogenicity, and regression models to predict median toxic dose of the chemicals. Along with the hybrid neural network (HNN) model that we developed, Random Forest (RF), Bootstrap Aggregating (Bagging) and Adaptive Boosting (AdaBoost) methods were also used for binary and multiclass classification. Regression models were developed using HNN, RF, Support Vector Regressor (SVR), Gradient Boosting (GB), Kernel Ridge (KR), Decision Tree with AdaBoost (DT), KNeighbors (KN), and a consensus method. For binary classification, our HNN model predicted with an average accuracy of 74.33% and an average AUC of 0.806, for multiclass classification, the HNN model predicted with an average accuracy of 50.58% and an average micro-AUC of 0.68, and for regression model, the consensus method achieved R2 of 0.40. The predictive performance of our models based on a highly diverse chemicals is comparable to the literature reported models that included the similar and less diverse molecules. Our models can be used in identifying the potentially carcinogenic chemicals for a wide variety of chemical classes

Sentiment analysis, also known as Opinion Mining is one of the hottest topic Nowadays. in various social networking sites is one of the hottest topic and field nowadays. Here, we are using Twitter, the biggest web destinations for people to communicate with each other to perform the sentiment analysis and opinion mining by extracting the tweets by various users. The users can post brief text updates in twitter as it only allows 140 characters in one text message. Hashtags helps to search for tweets dealing with the specified subject. In previous researches, binary classification usually relies on the sentiment polarity(Positive , Negative and Neutral). The advantage is that multiple meaning of the same world might have different polarity, so it can be easily identified. In Multiclass classification, many tweets of one class are classified as if they belong to the others. The Neutral class presented the lowest precision in all the researches happened in this particular area. The set of tweets containing text and emoticon data will be classified into 13 classes. From each tweet, we extract different set of features using one hot encoding algorithm and use machine learning algorithms to perform classification. The entire tweets will be divided into training data sets and testing data sets. Training dataset will be pre-processed and classified using various Artificial Neural Network algorithms such as Reccurent Neural Network, Convolutional Neural Network etc. Moreover, the same procedure will be followed for the Text and Emoticon data. The developed model or system will be tested using the testing dataset. More precise and correct accuracy can be obtained or experienced using this multiclass classification of text and emoticons. 4 Key performance indicators will be used to evaluate the effectiveness of the corresponding approach.


2021 ◽  
Vol 4 (4) ◽  
pp. 299-310
Author(s):  
Vadim Yu. Skobtsov

The paper presents solutions to the actual problem of intelligent analysis of telemetry data from small satellites in order to detect its technical states. Neural network models based on modern deep learning architectures have been developed and investigated to solve the problem of binary classification of telemetry data. It makes possible to determine the normal and abnormal state of the small satellites or some of its subsystems. For the computer analysis, the data of the functioning of the small satellites navigation subsystem were used: a time series with a dimension of 121690 × 9. A comparative analysis was carried out of fully connected, onedimensional convolution and recurrent (GRU, LSTM) neural networks. We analyzed hybrid neural network models of various depths, which are sequential combinations of all three types of layers, including using the technology of adding residual connections of the ResNet family. Achieved results were compared with results of widespread neural network models AlexNet, LeNet, Inception, Xception, MobileNet, ResNet, and Yolo, modified for time series classification. The best result, in terms of classification accuracy at the stages of training, validation and testing, and the execution time of one training and validation epoch, were obtained by the developed hybrid neural network models of three types of layers: one-dimensional convolution, recurrent GRU and fully connected classification layers, using the technology of adding residual connections. In this case, the input data were normalized. The obtained classification accuracy at the training, validation and testing stages was 0.9821, 0.9665, 0.9690, respectively. The execution time of one learning and validation epoch was twelve seconds. At the same time, the modified Inception model showed the best alternative result in terms of accuracy: 0.9818, 0.9694, 0.9675. The execution time of one training and validation epoch was twenty seven seconds. That is, there was no increase in the classification accuracy when adapting the well-known neural network models used for image analysis. But the training and validation time in the case of the best Inception model increased by more than two times. Thus, proposed and analyzed hybrid neural network model showed the highest accuracy and minimum training and validation time in solving the considered problem according to compared with a number of developed and widely known and used deep neural network models.


Sensors ◽  
2020 ◽  
Vol 21 (1) ◽  
pp. 199
Author(s):  
Taekeun Hong ◽  
Jin-A Choi ◽  
Kiho Lim ◽  
Pankoo Kim

The classification and recommendation system for identifying social networking site (SNS) users’ interests plays a critical role in various industries, particularly advertising. Personalized advertisements help brands stand out from the clutter of online advertisements while enhancing relevance to consumers to generate favorable responses. Although most user interest classification studies have focused on textual data, the combined analysis of images and texts on user-generated posts can more precisely predict a consumer’s interests. Therefore, this research classifies SNS users’ interests by utilizing both texts and images. Consumers’ interests were defined using the Curlie directory, and various convolutional neural network (CNN)-based models and recurrent neural network (RNN)-based models were tested for our user interest classification system. In our hybrid neural network (NN) model, CNN-based classification models were used to classify images from users’ SNS postings while RNN-based classification models were used to classify textual data. The results of our extensive experiments show that the classification of users’ interests performed best when using texts and images together, at 96.55%, versus texts only, 41.38%, or images only, 93.1%. Our proposed system provides insights into personalized SNS advertising research and informs marketers on making (1) interest-based recommendations, (2) ranked-order recommendations, and (3) real-time recommendations.


2018 ◽  
Vol 38 (6) ◽  
Author(s):  
Binbin Wang ◽  
Li Xiao ◽  
Yang Liu ◽  
Jing Wang ◽  
Beihong Liu ◽  
...  

There is a disparity between the increasing application of digital retinal imaging to neonatal ocular screening and slowly growing number of pediatric ophthalmologists. Assistant tools that can automatically detect ocular disorders may be needed. In present study, we develop a deep convolutional neural network (DCNN) for automated classification and grading of retinal hemorrhage. We used 48,996 digital fundus images from 3770 newborns with retinal hemorrhage of different severity (grade 1, 2 and 3) and normal controls from a large cross-sectional investigation in China. The DCNN was trained for automated grading of retinal hemorrhage (multiclass classification problem: hemorrhage-free and grades 1, 2 and 3) and then validated for its performance level. The DCNN yielded an accuracy of 97.85 to 99.96%, and the area under the receiver operating characteristic curve was 0.989–1.000 in the binary classification of neonatal retinal hemorrhage (i.e., one classification vs. the others). The overall accuracy with regard to the multiclass classification problem was 97.44%. This is the first study to show that a DCNN can detect and grade neonatal retinal hemorrhage at high performance levels. Artificial intelligence will play more positive roles in ocular healthcare of newborns and children.


Author(s):  
P.L. Nikolaev

This article deals with method of binary classification of images with small text on them Classification is based on the fact that the text can have 2 directions – it can be positioned horizontally and read from left to right or it can be turned 180 degrees so the image must be rotated to read the sign. This type of text can be found on the covers of a variety of books, so in case of recognizing the covers, it is necessary first to determine the direction of the text before we will directly recognize it. The article suggests the development of a deep neural network for determination of the text position in the context of book covers recognizing. The results of training and testing of a convolutional neural network on synthetic data as well as the examples of the network functioning on the real data are presented.


Author(s):  
Navaamsini Boopalan ◽  
Agileswari K. Ramasamy ◽  
Farrukh Hafiz Nagi

Array sensors are widely used in various fields such as radar, wireless communications, autonomous vehicle applications, medical imaging, and astronomical observations fault diagnosis. Array signal processing is accomplished with a beam pattern which is produced by the signal's amplitude and phase at each element of array. The beam pattern can get rigorously distorted in case of failure of array element and effect its Signal to Noise Ratio (SNR) badly. This paper proposes on a Hybrid Neural Network layer weight Goal Attain Optimization (HNNGAO) method to generate a recovery beam pattern which closely resembles the original beam pattern with remaining elements in the array. The proposed HNNGAO method is compared with classic synthesize beam pattern goal attain method and failed beam pattern generated in MATLAB environment. The results obtained proves that the proposed HNNGAO method gives better SNR ratio with remaining working element in linear array compared to classic goal attain method alone. Keywords: Backpropagation; Feed-forward neural network; Goal attain; Neural networks; Radiation pattern; Sensor arrays; Sensor failure; Signal-to-Noise Ratio (SNR)


Sign in / Sign up

Export Citation Format

Share Document