Combining citizen science and deep learning to amplify expertise in neuroimaging

AbstractResearch in many fields has become increasingly reliant on large and complex datasets. “Big Data” holds untold promise to rapidly advance science by tackling new questions that cannot be answered with smaller datasets. While powerful, research with Big Data poses unique challenges, as many standard lab protocols rely on experts examining each one of the samples. This is not feasible for large-scale datasets because manual approaches are time-consuming and hence difficult to scale. Meanwhile, automated approaches lack the accuracy of examination by highly trained scientists and this may introduce major errors, sources of noise, and unforeseen biases into these large and complex datasets. Our proposed solution is to 1) start with a small, expertly labelled dataset, 2) amplify labels through web-based tools that engage citizen scientists, and 3) train machine learning on amplified labels to emulate expert decision making. As a proof of concept, we developed a system to quality control a large dataset of three-dimensional magnetic resonance images (MRI) of human brains. An initial dataset of 200 brain images labeled by experts were amplified by citizen scientists to label 722 brains, with over 80,000 ratings done through a simple web interface. A deep learning algorithm was then trained to predict data quality, based on a combination of the citizen scientist labels that accounts for differences in the quality of classification by different citizen scientists. In an ROC analysis (on left out test data), the deep learning network performed as well as a state-of-the-art, specialized algorithm (MRIQC) for quality control of T1-weighted images, each with an area under the curve of 0.99. Finally, as a specific practical application of the method, we explore how brain image quality relates to the replicability of a well established relationship between brain volume and age over development. Combining citizen science and deep learning can generalize and scale expert decision making; this is particularly important in emerging disciplines where specialized, automated tools do not already exist.

Download Full-text

Active Learning Plus Deep Learning Can Establish Cost-Effective and Robust Model for Multichannel Image: A Case on Hyperspectral Image Classification

Sensors ◽

10.3390/s20174975 ◽

2020 ◽

Vol 20 (17) ◽

pp. 4975

Author(s):

Fangyu Shi ◽

Zhaodi Wang ◽

Menghan Hu ◽

Guangtao Zhai

Keyword(s):

Deep Learning ◽

Active Learning ◽

Image Classification ◽

Large Scale ◽

Hyperspectral Image ◽

Image Annotation ◽

Learning Algorithm ◽

Magnetic Resonance Images ◽

Biological Engineering ◽

Hyperspectral Image Classification

Relying on large scale labeled datasets, deep learning has achieved good performance in image classification tasks. In agricultural and biological engineering, image annotation is time-consuming and expensive. It also requires annotators to have technical skills in specific areas. Obtaining the ground truth is difficult because natural images are expensive. In addition, images in these areas are usually stored as multichannel images, such as computed tomography (CT) images, magnetic resonance images (MRI), and hyperspectral images (HSI). In this paper, we present a framework using active learning and deep learning for multichannel image classification. We use three active learning algorithms, including least confidence, margin sampling, and entropy, as the selection criteria. Based on this framework, we further introduce an “image pool” to make full advantage of images generated by data augmentation. To prove the availability of the proposed framework, we present a case study on agricultural hyperspectral image classification. The results show that the proposed framework achieves better performance compared with the deep learning model. Manual annotation of all the training sets achieves an encouraging accuracy. In comparison, using active learning algorithm of entropy and image pool achieves a similar accuracy with only part of the whole training set manually annotated. In practical application, the proposed framework can remarkably reduce labeling effort during the model development and upadting processes, and can be applied to multichannel image classification in agricultural and biological engineering.

Download Full-text

Automatic detection of anteriorly displaced temporomandibular joint discs on magnetic resonance images using a deep learning algorithm

Dentomaxillofacial Radiology ◽

10.1259/dmfr.20210341 ◽

2021 ◽

Author(s):

Bolun Lin ◽

Mosha Cheng ◽

Shuze Wang ◽

Fulong Li ◽

Qing Zhou

Keyword(s):

Deep Learning ◽

Magnetic Resonance ◽

Temporomandibular Joint ◽

Diagnostic Criteria ◽

Orthodontic Treatment ◽

Data Augmentation ◽

Learning Algorithm ◽

Area Under The Curve ◽

Magnetic Resonance Images ◽

Learning Models

Objectives: This study aimed to develop models that can automatically detect anterior disc displacement (ADD) of the temporomandibular joint (TMJ) on magnetic resonance images (MRI) before orthodontic treatment to reduce the risk of developing serious complications after treatment. Methods: We used 9009 sagittal MRI of the TMJ as input and constructed three sets of deep learning models to detect ADD automatically. Deep learning models were developed using a convolutional neural network (CNN) based on the ResNet architecture and the “Imagenet” database. Five-fold cross-validation, over sampling, and data augmentation techniques were applied to reduce the risk of overfitting the model. The accuracy and area under the curve (AUC) of the three models were compared. Results: The performance of the maximum open mouth position model was excellent with accuracy and AUC of 0.970 (±0.007) and 0.990 (±0.005), respectively. For closed mouth position models the accuracy and AUC of diagnostic criteria One were 0.863 (±0.008) and 0.922 (±0.009), respectively significantly higher than that of diagnostic criteria two with an 0.839 (±0.013) (p = 0.009) and AUC of 0.885 (±0.018) (p = 0.003). The classification activation heat map also improved our understanding of the models and visually displayed the areas that play a key role in the model recognition process. Conclusion: Our CNN model resulted in high accuracy and AUC in detecting ADD and can therefore potentially be used by clinicians to assess ADD before orthodontic treatment and hence improve treatment outcomes.

Download Full-text

Multi Disease-Prediction Framework Using Hybrid Deep Learning: An Optimal Prediction Model (Preprint)

10.2196/preprints.22865 ◽

2020 ◽

Author(s):

Anusha Ampavathi ◽

Vijaya Saradhi T

Keyword(s):

Feature Extraction ◽

Big Data ◽

Deep Learning ◽

Weight Function ◽

Optimization Algorithm ◽

Large Scale ◽

Heuristic Algorithms ◽

Disease Prediction ◽

Health Care Decisions ◽

Proposed Model

UNSTRUCTURED Big data and its approaches are generally helpful for healthcare and biomedical sectors for predicting the disease. For trivial symptoms, the difficulty is to meet the doctors at any time in the hospital. Thus, big data provides essential data regarding the diseases on the basis of the patient’s symptoms. For several medical organizations, disease prediction is important for making the best feasible health care decisions. Conversely, the conventional medical care model offers input as structured that requires more accurate and consistent prediction. This paper is planned to develop the multi-disease prediction using the improvised deep learning concept. Here, the different datasets pertain to “Diabetes, Hepatitis, lung cancer, liver tumor, heart disease, Parkinson’s disease, and Alzheimer’s disease”, from the benchmark UCI repository is gathered for conducting the experiment. The proposed model involves three phases (a) Data normalization (b) Weighted normalized feature extraction, and (c) prediction. Initially, the dataset is normalized in order to make the attribute's range at a certain level. Further, weighted feature extraction is performed, in which a weight function is multiplied with each attribute value for making large scale deviation. Here, the weight function is optimized using the combination of two meta-heuristic algorithms termed as Jaya Algorithm-based Multi-Verse Optimization algorithm (JA-MVO). The optimally extracted features are subjected to the hybrid deep learning algorithms like “Deep Belief Network (DBN) and Recurrent Neural Network (RNN)”. As a modification to hybrid deep learning architecture, the weight of both DBN and RNN is optimized using the same hybrid optimization algorithm. Further, the comparative evaluation of the proposed prediction over the existing models certifies its effectiveness through various performance measures.

Download Full-text

Circular supply chain management with large scale group decision making in the big data era: The macro-micro model

Technological Forecasting and Social Change ◽

10.1016/j.techfore.2021.120791 ◽

2021 ◽

Vol 169 ◽

pp. 120791

Author(s):

Tsan-Ming Choi ◽

Yue Chen

Keyword(s):

Decision Making ◽

Big Data ◽

Supply Chain ◽

Supply Chain Management ◽

Group Decision Making ◽

Large Scale ◽

Group Decision ◽

Micro Model ◽

Chain Management

Download Full-text

Research on the Design of Government Affairs Platform in the Context of Big Data

Scientific Programming ◽

10.1155/2021/9936217 ◽

2021 ◽

Vol 2021 ◽

pp. 1-13

Author(s):

Qian Huang ◽

Xue Wen Li

Keyword(s):

Big Data ◽

Deep Learning ◽

Learning Algorithm ◽

The Internet ◽

Chinese Government ◽

Development Status ◽

Deep Learning Algorithm ◽

Government Management ◽

The Government ◽

The Impact

Big data is a massive and diverse form of unstructured data, which needs proper analysis and management. It is another great technological revolution after the Internet, the Internet of Things, and cloud computing. This paper firstly studies the related concepts and basic theories as the origin of research. Secondly, it analyzes in depth the problems and challenges faced by Chinese government management under the impact of big data. Again, we explore the opportunities that big data brings to government management in terms of management efficiency, administrative capacity, and public services and believe that governments should seize opportunities to make changes. Brainlike computing attempts to simulate the structure and information processing process of biological neural network. This paper firstly analyzes the development status of e-government at home and abroad, studies the service-oriented architecture (SOA) and web services technology, deeply studies the e-government and SOA theory, and discusses this based on the development status of e-government in a certain region. Then, the deep learning algorithm is used to construct the monitoring platform to monitor the government behavior in real time, and the deep learning algorithm is used to conduct in-depth mining to analyze the government's intention behavior.

Download Full-text

Research on Efficient Deep Learning Algorithm Based on ShuffleGhost in the Field of Virtual Reality

Wireless Communications and Mobile Computing ◽

10.1155/2021/1382781 ◽

2021 ◽

Vol 2021 ◽

pp. 1-11

Author(s):

Bangtong Huang ◽

Hongquan Zhang ◽

Zihong Chen ◽

Lingling Li ◽

Lihua Shi

Keyword(s):

Virtual Reality ◽

Deep Learning ◽

Large Scale ◽

Learning Algorithm ◽

Feature Maps ◽

Embedded Devices ◽

Feature Map ◽

Deep Learning Algorithm ◽

Proper Design ◽

The Cost

Deep learning algorithms are facing the limitation in virtual reality application due to the cost of memory, computation, and real-time computation problem. Models with rigorous performance might suffer from enormous parameters and large-scale structure, and it would be hard to replant them onto embedded devices. In this paper, with the inspiration of GhostNet, we proposed an efficient structure ShuffleGhost to make use of the redundancy in feature maps to alleviate the cost of computations, as well as tackling some drawbacks of GhostNet. Since GhostNet suffers from high computation of convolution in Ghost module and shortcut, the restriction of downsampling would make it more difficult to apply Ghost module and Ghost bottleneck to other backbone. This paper proposes three new kinds of ShuffleGhost structure to tackle the drawbacks of GhostNet. The ShuffleGhost module and ShuffleGhost bottlenecks are utilized by the shuffle layer and group convolution from ShuffleNet, and they are designed to redistribute the feature maps concatenated from Ghost Feature Map and Primary Feature Map. Besides, they eliminate the gap of them and extract the features. Then, SENet layer is adopted to reduce the computation cost of group convolution, as well as evaluating the importance of the feature maps which concatenated from Ghost Feature Maps and Primary Feature Maps and giving proper weights for the feature maps. This paper conducted some experiments and proved that the ShuffleGhostV3 has smaller trainable parameters and FLOPs with the ensurance of accuracy. And with proper design, it could be more efficient in both GPU and CPU side.

Download Full-text

A Systematic Analysis of Big Image Data Methodologies in Various Applications

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.e2307.039520 ◽

2020 ◽

Vol 9 (5) ◽

pp. 483-487

Keyword(s):

Big Data ◽

Deep Learning ◽

Large Scale ◽

Image Data ◽

Computational Time ◽

Process Data ◽

Systematic Analysis ◽

Large Scale Data ◽

Learning Techniques ◽

Effective Performance

Big data is large-scale data collected for knowledge discovery, it has been widely used in various applications. Big data often has image data from the various applications and requires effective technique to process data. In this paper, survey has been done in the big image data researches to analysis the effective performance of the methods. Deep learning techniques provides the effective performance compared to other methods included wavelet based methods. The deep learning techniques has the problem of requiring more computational time, and this can be overcome by lightweight methods.

Download Full-text

Software Abstractions for Large-Scale Deep Learning Models in Big Data Analytics

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2019.0100469 ◽

2019 ◽

Vol 10 (4) ◽

Author(s):

Ayaz H Khan ◽

Ali Mustafa ◽

Aneeq Yusuf ◽

Rehanullah Khan

Keyword(s):

Big Data ◽

Deep Learning ◽

Data Analytics ◽

Large Scale ◽

Big Data Analytics ◽

Learning Models

Download Full-text

Deep Learning Based Active Monitoring for Anti-collision between Vessels and Bridges

IABSE Symposium, Guimarães 2019: Towards a Resilient Built Environment Risk and Asset Management ◽

10.2749/guimaraes.2019.0487 ◽

2019 ◽

Author(s):

Limu Chen ◽

Ye Xia ◽

Dexiong Pan ◽

Chengbin Wang

Keyword(s):

Decision Making ◽

Deep Learning ◽

Object Detection ◽

Large Scale ◽

Data Augmentation ◽

Information Support ◽

Single Shot ◽

Active Monitoring ◽

Detection Model ◽

Comparison Results

<p>Deep-learning based navigational object detection is discussed with respect to active monitoring system for anti-collision between vessel and bridge. Motion based object detection method widely used in existing anti-collision monitoring systems is incompetent in dealing with complicated and changeable waterway for its limitations in accuracy, robustness and efficiency. The video surveillance system proposed contains six modules, including image acquisition, detection, tracking, prediction, risk evaluation and decision-making, and the detection module is discussed in detail. A vessel-exclusive dataset with tons of image samples is established for neural network training and a SSD (Single Shot MultiBox Detector) based object detection model with both universality and pertinence is generated attributing to tactics of sample filtering, data augmentation and large-scale optimization, which make it capable of stable and intelligent vessel detection. Comparison results with conventional methods indicate that the proposed deep-learning method shows remarkable advantages in robustness, accuracy, efficiency and intelligence. In-situ test is carried out at Songpu Bridge in Shanghai, and the results illustrate that the method is qualified for long-term monitoring and providing information support for further analysis and decision making.</p>

Download Full-text

Effective Statistical Methods for Big Data Analytics

Handbook of Research on Applied Cybernetics and Systems Science - Advances in Computational Intelligence and Robotics ◽

10.4018/978-1-5225-2498-4.ch014 ◽

2017 ◽

pp. 280-299 ◽

Cited By ~ 3

Author(s):

Cheng Meng ◽

Ye Wang ◽

Xinlian Zhang ◽

Abhyuday Mandal ◽

Wenxuan Zhong ◽

...

Keyword(s):

Decision Making ◽

Big Data ◽

Knowledge Discovery ◽

Statistical Methods ◽

Large Scale ◽

Big Data Analytics ◽

Divide And Conquer ◽

Data Driven ◽

The Past ◽

Large Scale Dataset

With advances in technologies in the past decade, the amount of data generated and recorded has grown enormously in virtually all fields of industry and science. This extraordinary amount of data provides unprecedented opportunities for data-driven decision-making and knowledge discovery. However, the task of analyzing such large-scale dataset poses significant challenges and calls for innovative statistical methods specifically designed for faster speed and higher efficiency. In this chapter, we review currently available methods for big data, with a focus on the subsampling methods using statistical leveraging and divide and conquer methods.

Download Full-text