scholarly journals Research on Maize Disease Recognition Method Based on Improved ResNet50

2021 ◽  
Vol 2021 ◽  
pp. 1-6
Author(s):  
Guowei Wang ◽  
Haiye Yu ◽  
Yuanyuan Sui

In order to solve the problem of accuracy and speed of disease identification in real-time spraying operation in maize field, an improved ResNet50 maize disease identification model was proposed. Firstly, this paper uses the Adam algorithm to optimize the model, adjusts the learning strategy through the inclined triangle learning rate, increases L2 regularization to reduce over fitting, and adopts exit strategy and ReLU incentive function. Then, the first convolution kernel of the ResNet50 model is modified into three 3 x 3 small convolution kernels. Finally, the ratio of training set to verification set is 3 : 1. Through experimental comparison, the recognition accuracy of the maize disease recognition model proposed in this paper is higher than that of other models. The image recognition accuracy in the data set is 98.52%, the image recognition accuracy in the farmland is 97.826%, and the average recognition speed is 204 ms, which meets the accuracy and speed requirements of maize field spraying operation and provides technical support for the research of maize field spraying equipment.

2011 ◽  
Vol 121-126 ◽  
pp. 2141-2145 ◽  
Author(s):  
Wei Gang Yan ◽  
Chang Jian Wang ◽  
Jin Guo

This paper proposes a new image segmentation algorithm to detect the flame image from video in enclosed compartment. In order to avoid the contamination of soot and water vapor, this method first employs the cubic root of four color channels to transform a RGB image to a pseudo-gray one. Then the latter is divided into many small stripes (child images) and OTSU is employed to perform child image segmentation. Lastly, these processed child images are reconstructed into a whole image. A computer program using OpenCV library is developed and the new method is compared with other commonly used methods such as edge detection and normal Otsu’s method. It is found that the new method has better performance in flame image recognition accuracy and can be used to obtain flame shape from experiment video with much noise.


2021 ◽  
Vol 39 (1B) ◽  
pp. 1-10
Author(s):  
Iman H. Hadi ◽  
Alia K. Abdul-Hassan

Speaker recognition depends on specific predefined steps. The most important steps are feature extraction and features matching. In addition, the category of the speaker voice features has an impact on the recognition process. The proposed speaker recognition makes use of biometric (voice) attributes to recognize the identity of the speaker. The long-term features were used such that maximum frequency, pitch and zero crossing rate (ZCR).  In features matching step, the fuzzy inner product was used between feature vectors to compute the matching value between a claimed speaker voice utterance and test voice utterances. The experiments implemented using (ELSDSR) data set. These experiments showed that the recognition accuracy is 100% when using text dependent speaker recognition.


Geofluids ◽  
2020 ◽  
Vol 2020 ◽  
pp. 1-11
Author(s):  
Dongsheng Wang ◽  
Jun Feng ◽  
Xinpeng Zhao ◽  
Yeping Bai ◽  
Yujie Wang ◽  
...  

It is difficult to form a method for recognizing the degree of infiltration of a tunnel lining. To solve this problem, we propose a recognition method by using a deep convolutional neural network. We carry out laboratory tests, prepare cement mortar specimens with different saturation levels, simulate different degrees of infiltration of tunnel concrete linings, and establish an infrared thermal image data set with different degrees of infiltration. Then, based on a deep learning method, the data set is trained using the Faster R-CNN+ResNet101 network, and a recognition model is established. The experiments show that the recognition model established by the deep learning method can be used to select cement mortar specimens with different degrees of infiltration by using an accurately minimized rectangular outer frame. This model shows that the classification recognition model for tunnel concrete lining infiltration established by the indoor experimental method has high recognition accuracy.


Author(s):  
F. ROLI

Recently, a kind of structured neural networks (SNNs) explicitly devoted to multisensor image recognition and aimed at allowing the interpretation of the "network behavior" was presented in Ref. 1. Experiments reported in Ref. 1 pointed out that SNNs provide a trade-off between recognition accuracy and interpretation of the network behavior. In this paper, the combination of multiple SNNs, each of which has been trained on the same data set, is proposed as a means to improve recognition results, while keeping the possibility of interpreting the network behavior. A simple method for interpreting the "collective behaviors" of such SNN ensembles is described. Such an interpretation method can be used to understand the different kinds of "solutions" learned by the SNNs belonging to an ensemble. In addition, as compared with the interpretation method presented in Ref. 1, it is shown that the knowledge embodied in an SNN can be translated into a set of understandable "recognition rules". Experimental results on the recognition of multisensor remote-sensing images (optical and radar images) are reported in terms of both recognition accuracy and network-behavior interpretation. An additional experiment on a multisource remote-sensing data set is described to show that SNNs can also be effectively used for multisource recognition tasks.


2020 ◽  
Author(s):  
dongshen ji ◽  
yanzhong zhao ◽  
zhujun zhang ◽  
qianchuan zhao

In view of the large demand for new coronary pneumonia covid19 image recognition samples,the recognition accuracy is not ideal.In this paper,a new coronary pneumonia positive image recognition method proposed based on small sample recognition. First, the CT image pictures are preprocessed, and the pictures are converted into the picture formats which are required for transfer learning. Secondly, perform small-sample image enhancement and expansion on the converted picture, such as miscut transformation, random rotation and translation, etc.. Then, multiple migration models are used to extract features and then perform feature fusion. Finally,the model is adjusted by fine-tuning.Then train the model to obtain experimental results. The experimental results show that our method has excellent recognition performance in the recognition of new coronary pneumonia images,even with only a small number of CT image samples.


2021 ◽  
Vol 271 ◽  
pp. 01039
Author(s):  
Dongsheng Ji ◽  
Yanzhong Zhao ◽  
Zhujun Zhang ◽  
Qianchuan Zhao

In view of the large demand for new coronary pneumonia covid19 image recognition samples, the recognition accuracy is not ideal. In this paper, a new coronary pneumonia positive image recognition method proposed based on small sample recognition. First, the CT image pictures are preprocessed, and the pictures are converted into the picture formats which are required for transfer learning. Secondly, small-sample image enhancement and extension are performed on the transformed image, such as staggered transformation, random rotation and translation, etc.. Then, multiple migration models are used to extract features and then perform feature fusion. Finally,the model is adjusted by fine-tuning. Then train the model to obtain experimental results. The experimental results show that our method has excellent recognition performance in the recognition of new coronary pneumonia images, even with only a small number of CT image samples.


2018 ◽  
Vol 176 ◽  
pp. 01035
Author(s):  
Jin Dai ◽  
Shuai Shao ◽  
Zu Wang ◽  
Xianjing Zhao

The traditional recognition method takes the low-level information of the image as the foundation. The image recognition center of gravity is biased towards the typical features, and achieves the effect of recognition by region-dependent segmentation. Because the general image segmentation is a regular rectangle, easily lead to the same target is divided into different sub-blocks, ignoring the image of the fuzzy part, so the image recognition is not complete. An image recognition algorithm based on threeway decision is proposed. It takes full advantage of effective information in the image, improving the image recognition accuracy. First, this method divided the image into three regions: positive region, negative region and delay decision region. Second, an iterative process is performed on the region of the delay decision. Final, image recognition is performed on the positive sample region. Based on the basic theory of the three-way decision, the more obvious the decision result is, the more iterations are, and the information is added to the classifier until the blurred part of image cannot be divided. Finally, to achieve the realize effective image recognition. This method simulates the process of human cognition effectively, and makes the utilization of the effective information reach the maximum in the recognition process. The results of the experimental analysis showed that the method is more concise and efficient, and the recognition accuracy is more accurate.


Sensors ◽  
2020 ◽  
Vol 20 (17) ◽  
pp. 4850
Author(s):  
Mingyu Gao ◽  
Chao Chen ◽  
Jie Shi ◽  
Chun Sing Lai ◽  
Yuxiang Yang ◽  
...  

Effective traffic sign recognition algorithms can assist drivers or automatic driving systems in detecting and recognizing traffic signs in real-time. This paper proposes a multiscale recognition method for traffic signs based on the Gaussian Mixture Model (GMM) and Category Quality Focal Loss (CQFL) to enhance recognition speed and recognition accuracy. Specifically, GMM is utilized to cluster the prior anchors, which are in favor of reducing the clustering error. Meanwhile, considering the most common issue in supervised learning (i.e., the imbalance of data set categories), the category proportion factor is introduced into Quality Focal Loss, which is referred to as CQFL. Furthermore, a five-scale recognition network with a prior anchor allocation strategy is designed for small target objects i.e., traffic sign recognition. Combining five existing tricks, the best speed and accuracy tradeoff on our data set (40.1% mAP and 15 FPS on a single 1080Ti GPU), can be achieved. The experimental results demonstrate that the proposed method is superior to the existing mainstream algorithms, in terms of recognition accuracy and recognition speed.


Circulation ◽  
2020 ◽  
Vol 142 (Suppl_3) ◽  
Author(s):  
Subrat Das ◽  
Matthew Epland ◽  
Jiang Yu ◽  
Ranjit Suri

Introduction: EKGs are the cornerstone of management in cardiovascular diseases. There have been multiple efforts to computerize the EKG interpretation with algorithms, which unfortunately are machine specific and proprietary. We propose the development of an image recognition model which can be used to read EKG strips (which use standard notations) and hence be used universally. Method: A convolutional neural network (CNN) was trained to classify 12-lead EKGs between seven clinically important diagnostic classes (Figure 1a). Pre-labeled EKG recordings (6-60s) from a publicly available data set on PhysioNet were used to construct the images. The EKG images displayed the 12 channel traces, of 2.5s each, on a consistent 4x3 grid at a resolution of 800x800 pixels (Figure 1a). The data set (23,336 images) was divided into training, tuning, and validation sets; containing 70%, 15%, and 15% of the images, respectively. An austere variation of the MobileNetV3 model was trained from the ground up on the labeled training set. Stochastic gradient descent (SGD) was used to minimize the cross-entropy loss. Training was halted when the tuning loss had not improved from its previous minimum by 0.05% over the past 10 epochs. Results: The model trained over 52 epochs of batches of 32 images. The model’s accuracy was tested using the validation set (which was not used for development of model) and reported as a confusion matrix (Figure 1b). The accuracy per class varies from 69-91%. Conclusion: We used a labeled dataset of EKG images to develop a CNN model to predict seven different diagnostic classes with good accuracy. This is a novel approach to EKG interpretation as an image recognition problem and thus generates the ability to create diagnostic algorithms that are not dependent on proprietary voltage signals generated by commercial EKG machines. With the addition of more images to the data set and higher computing power we are confident that we can achieve enhanced accuracy.


2021 ◽  
Vol 2021 ◽  
pp. 1-10
Author(s):  
Xiaoying Shen ◽  
Chao Yuan

With the development of the live broadcast industry, security issues in the live broadcast process have become increasingly apparent. At present, the supervision of various live broadcast platforms is basically in a state of human supervision. Manpower supervision is mainly through user reporting and platform supervision measures. However, there are a large number of live broadcast rooms at the same time, and only relying on human supervision can no longer meet the monitoring needs of live broadcasts. Based on this situation, this study proposes a violation information recognition method of a live-broadcasting platform based on machine learning technology. By analyzing the similarities and differences between normal live broadcasts and violation live broadcasts, combined with the characteristics of violation image data, this study mainly detects human skin color and sensitive parts. A prominent feature of violation images is that they contain a large area of naked skin, and the ratio of the area of naked skin to the overall image area of the violation image will exceed the threshold. Skin color recognition plays a role in initial target positioning. The accuracy of skin color recognition is directly related to the recognition accuracy of the entire system, so skin color recognition is the most important part of violation information recognition. Although there are many effective skin color recognition technologies, the accuracy and stability of skin color recognition still need to be improved due to the influence of various external factors, such as light intensity, light source color, and physical equipment. When it is detected that the area of the skin color in the live screen exceeds the threshold, it is preliminarily determined to be a suspected violation video. In order to improve the recognition accuracy, it is necessary to detect sensitive parts of the suspected video. Naked female breasts are a very obvious feature in violation images. This study uses a chest feature extraction method to detect the chest in the image. When the recognition result is a violation image, it is determined that the live broadcast involves violation content. The machine learning algorithm is simple to implement, and the parameters are easy to adjust. The classifier training requires a short time and is suitable for live violation information recognition scenarios. The experimental results on the adopted data set show that the method used in this article can effectively detect videos with violation content. The recognition rate is as high as 85.98%, which is suitable for a real-life environment and has good practical significance.


Sign in / Sign up

Export Citation Format

Share Document