scholarly journals Pneumonia Classification of Thorax Images using Convolutional Neural Networks

Jurnal INFORM ◽  
2020 ◽  
Vol 5 (2) ◽  
pp. 62
Author(s):  
Mahmud Suyuti ◽  
Endang Setyati

The digital image processing technique is a product of computing technology development. Medical image data processing based on a computer is a product of computing technology development that can help a doctor to diagnose and observe a patient. This study aimed to perform classification on the image of the thorax by using Convolutional Neural Network (CNN).  The data used in this study is lung thorax images that have previously been diagnosed by a doctor with two classes, namely normal and pneumonia. The amount of data is 2.200, 1.760 for training, and 440 for testing. Three stages are used in image processing, namely scaling, gray scaling, and scratching. This study used Convolutional Neural Network (CNN) method with architecture ResNet-50. In the field of object recognition, CNN is the best method because it has the advantage of being able to find its features of the object image by conducting the convolution process during training. CNN has several models or architectures; one of them is ResNet-50 or Residual Network. The selection of ResNet-50 architecture in this study aimed to reduce the loss of gradients at certain network-level depths during training because the object is a chest image of X-Ray that has a high level of visual similarity between some pathology. Moreover, several visual factors also affect the image so that to produce good accuracy requires a certain level of depth on the CNN network. Optimization during training used Adaptive Momentum (Adam) because it had a bias correction technique that provided better approximations to improve accuracy. The results of this study indicated the thorax image classification with an accuracy of 97.73%.

Jurnal INFORM ◽  
2020 ◽  
Vol 5 (2) ◽  
pp. 62-68
Author(s):  
Mahmud Suyuti ◽  
Endang Setyati

The digital image processing technique is a product of computing technology development. Medical image data processing based on a computer is a product of computing technology development that can help a doctor to diagnose and observe a patient. This study aimed to perform classification on the image of the thorax by using Convolutional Neural Network (CNN).  The data used in this study is lung thorax images that have previously been diagnosed by a doctor with two classes, namely normal and pneumonia. The amount of data is 2.200, 1.760 for training, and 440 for testing. Three stages are used in image processing, namely scaling, gray scaling, and scratching. This study used Convolutional Neural Network (CNN) method with architecture ResNet-50. In the field of object recognition, CNN is the best method because it has the advantage of being able to find its features of the object image by conducting the convolution process during training. CNN has several models or architectures; one of them is ResNet-50 or Residual Network. The selection of ResNet-50 architecture in this study aimed to reduce the loss of gradients at certain network-level depths during training because the object is a chest image of X-Ray that has a high level of visual similarity between some pathology. Moreover, several visual factors also affect the image so that to produce good accuracy requires a certain level of depth on the CNN network. Optimization during training used Adaptive Momentum (Adam) because it had a bias correction technique that provided better approximations to improve accuracy. The results of this study indicated the thorax image classification with an accuracy of 97.73%.


Author(s):  
Naureen Fathima

Abstract: Glaucoma is a disease that relates to the vision of human eye,Glaucoma is a disease that affects the human eye's vision. This sickness is regarded as an irreversible condition that causes eyesight degeneration. One of the most common causes of lifelong blindness is glaucoma in persons over the age of 40. Because of its trade-off between portability, size, and cost, fundus imaging is the most often utilised screening tool for glaucoma detection. Fundus imaging is a two-dimensional (2D) depiction of the three-dimensional (3D), semitransparent retinal tissues projected on to the imaging plane using reflected light. The idea plane that depicts the physical display screen through which a user perceives a virtual 3D scene is referred to as the "image plane”. The bulk of current algorithms for autonomous glaucoma assessment using fundus images rely on handcrafted segmentation-based features, which are influenced by the segmentation method used and the retrieved features. Convolutional neural networks (CNNs) are known for, among other things, their ability to learn highly discriminative features from raw pixel intensities. This work describes a computational technique for detecting glaucoma automatically. The major goal is to use a "image processing technique" to diagnose glaucoma using a fundus image as input. It trains datasets using a convolutional neural network (CNN). The Watershed algorithm is used for segmentation and is the most widely used technique in image processing. The following image processing processes are performed: region of interest, morphological procedures, and segmentation. This technique can be used to determine whether or not a person has Glaucoma. Keywords: Recommender system, item-based collaborative filtering, Natural Language Processing, Deep learning.


Lepidopterology is a branch of entomology concerning the scientific study of moths and the three superfamilies of butterflies. The project aims to help biology students in identifying butterfly without harming the insect. In the studies of lepidopterology, the students normally need to capture the butterflies with nets and dissect the insect to identify its family types. Computer vision is a study on how computers can be used to make high-level comprehension from the input of digital image and videos. By utilizing the latest Image Processing technique, it can identify the correct species of butterfly with high accuracy by using layers of node in a Convolutional Neural Network (CNN). The work process starts with data acquisition (mining the butterfly image automatically from google image search), pre-processing (converting image format and rotation), analyzing and understanding digital images (group images into folders), and to make assumptions of the high complication data from the real world in the process of producing numerical information that can be comprehend by machines in order to form conclusions. Benefits of using CNN is to reduce the need for human and physical intervention in identifying each of the butterfly characters. This makes it easier to expand the database in the future. The image is acquired using Fatkun Batch Downloader to download large number of images. The project is develop using Tensorflow in Ubuntu operating system and interface is in HTML connected to the Python script via Flask. The results of the experiment show that CNN can identify with 92.7 percent of final accuracy with learning saturation (overfitting) of 500 cycle. While testing results shows 62.5 percent of accuracy in predicting new datasets.


Entropy ◽  
2021 ◽  
Vol 23 (7) ◽  
pp. 816
Author(s):  
Pingping Liu ◽  
Xiaokang Yang ◽  
Baixin Jin ◽  
Qiuzhan Zhou

Diabetic retinopathy (DR) is a common complication of diabetes mellitus (DM), and it is necessary to diagnose DR in the early stages of treatment. With the rapid development of convolutional neural networks in the field of image processing, deep learning methods have achieved great success in the field of medical image processing. Various medical lesion detection systems have been proposed to detect fundus lesions. At present, in the image classification process of diabetic retinopathy, the fine-grained properties of the diseased image are ignored and most of the retinopathy image data sets have serious uneven distribution problems, which limits the ability of the network to predict the classification of lesions to a large extent. We propose a new non-homologous bilinear pooling convolutional neural network model and combine it with the attention mechanism to further improve the network’s ability to extract specific features of the image. The experimental results show that, compared with the most popular fundus image classification models, the network model we proposed can greatly improve the prediction accuracy of the network while maintaining computational efficiency.


2017 ◽  
Author(s):  
Febus Reidj G. Cruz ◽  
Dionis A. Padilla ◽  
Carlos C. Hortinela ◽  
Krissel C. Bucog ◽  
Mildred C. Sarto ◽  
...  

2012 ◽  
Vol 433-440 ◽  
pp. 727-732
Author(s):  
Anton Satria Prabuwono ◽  
Siti Rahayu Zulkipli ◽  
Doli Anggia Harahap ◽  
Wendi Usino ◽  
A. Hasniaty

Image processing is widely used in various fields of study including manufacturing as product inspection. In compact disc manufacturing, image processing has been implemented to recognize defect products. In this research, we implemented image processing technique as pre-processing processes. The aim is to acquire simple image to be processed and analyzed. In order to express the object from the image, the features were extracted using Invariant Moment (IM). Afterward, neural network was used to train the input from IM’s results. Thus, decision can be made whether the compact disc is accepted or rejected based on the training. Two experiments have been done in this research to evaluate 40 datasets of good and defective images of compact discs. The result shows that accuracy rate increased and can identify the quality of compact discs based on neural network training.


Author(s):  
Eko Yudhi Prastowo

Until now, wood has an irreplaceable function. Building materials, shipping, furniture, sports equipment, carvings and handicrafts using wood. Indonesia has more than 4,000 types of wood, so choosing the right wood is a challenge because choosing the wrong type of wood can make the quality of processed products decline and not as expected. In addition, proper identification of timber can also prevent illegal logging, especially on certain types of wood which are now increasingly scarce. Recognition to wood by looking directly is a difficult thing for ordinary people to do and can only be done by a wood expert, so it is necessary to find a method of recognizing wood that can be used by people independently. One method that can be used to identify type of wood is image processing based on characteristics of wood which include color, fiber direction and texture. This paper will describe recognition of wood-based image processing using Convolutional Neural Network (CNN) method. This method is derived from Neural Networks with addition of an extraction layer feature, which can reduce free parameters that are not needed by the system. Wood image data used in this study are four types of wood that are often used as raw materials for making houses and furniture, namely Glugu, Teak, Sengon and Waru. Results of this study were able to recognize four types of wood with an accuracy of 95% in 600 epochs/iteration, so that it can be used as a simple, easy and inexpensive wood recognition system.


Author(s):  
Kedsara Rakpongsiri

The objective of this research is to develop an instrument to measure eye fatigue and reaction time for decision making. The data were automatically analyzed by the iRIS-RT program designed to measure the contraction or expansion of pupils by means of image processing in order to analyze changes of pupil diameters. The image processing technique is a set of images with continuous and different temporal intervals. The images are then transferred into a computer, stored and analyzed to obtain changes of pupil diameters. Pupil diameter of each image in each time interval is measured and frequency of blinking is detected and displayed on the computer screen in an attempt to analyze eye fatigue and to design the assessment of reaction time. The test program is designed with red, green and blue windows that randomly changes according to the set time in order to determine reaction to the colors and accuracy of decision to select the colors that correspond to those on the test program. The test results with the iRIS-RT program among 40 male and female volunteer participants aged between 18 and 35years reveal that the program is able to preliminarily measure and process eye fatigue. Furthermore, it is able to store and record personal results in the computer and Cloud media that are accessible via global online computer and internet systems. The results can be displayed on personal computers and other mobile devices. With regard to satisfaction of the participants with operations of the device, it is found that the satisfaction was at a high level, or 66.7%, and at the highest level, or 33.3%, on the overall efficiency of the device. From interviewing the experts after the construction of the device on its performance, attributes, size, safety, installation and result display, their satisfaction was at a high level.


Author(s):  
Poonam Yerpude

Abstract: Communication is very imperative for daily life. Normal people use verbal language for communication while people with disabilities use sign language for communication. Sign language is a way of communicating by using the hand gestures and parts of the body instead of speaking and listening. As not all people are familiar with sign language, there lies a language barrier. There has been much research in this field to remove this barrier. There are mainly 2 ways in which we can convert the sign language into speech or text to close the gap, i.e. , Sensor based technique,and Image processing. In this paper we will have a look at the Image processing technique, for which we will be using the Convolutional Neural Network (CNN). So, we have built a sign detector, which will recognise the sign numbers from 1 to 10. It can be easily extended to recognise other hand gestures including alphabets (A- Z) and expressions. We are creating this model based on Indian Sign Language(ISL). Keywords: Multi Level Perceptron (MLP), Convolutional Neural Network (CNN), Indian Sign Language(ISL), Region of interest(ROI), Artificial Neural Network(ANN), VGG 16(CNN vision architecture model), SGD(Stochastic Gradient Descent).


Mekatronika ◽  
2021 ◽  
Vol 3 (1) ◽  
pp. 28-34
Author(s):  
Azmin Raziq Rizaman ◽  
Hazlina Selamat ◽  
Nurulaqilla Khamis

Analogue meter is a device that has been widely used in a various industry to monitor and obtain the reading of the measurement. Based on the conventional approach, the meter reading will be done continuously by the meter reader that might cause high tendency of human error during the observation. To minimize this fallacy, this approach taken in this paper enables the automation of this the process by obtaining the reading from an analogue meter using an image processing technique and send the output to the central database for further processing. By implementing this approach, observation efficacy can be improved. This paper describes the process on how to obtain the digitized reading of an analogue meter using images captured by a camera. The images are then processed using an image processing method and the Convolutional Neural Network (CNN) is used to determine the reading of the meter. Data is then sent to the MySQL database, as this approach was easily implemented and managed either on-premises or via the cloud. The use case in this study was based on the analogue meter for domestic electricity supply in Malaysia and results show that the meter reading can accurately be recognized using the proposed approach.


Sign in / Sign up

Export Citation Format

Share Document