scholarly journals Whole-cell segmentation of tissue images with human-level performance using large-scale data annotation and deep learning

Author(s):  
Noah F. Greenwald ◽  
Geneva Miller ◽  
Erick Moen ◽  
Alex Kong ◽  
Adam Kagel ◽  
...  
2021 ◽  
Author(s):  
Noah F. Greenwald ◽  
Geneva Miller ◽  
Erick Moen ◽  
Alex Kong ◽  
Adam Kagel ◽  
...  

AbstractUnderstanding the spatial organization of tissues is of critical importance for both basic and translational research. While recent advances in tissue imaging are opening an exciting new window into the biology of human tissues, interpreting the data that they create is a significant computational challenge. Cell segmentation, the task of uniquely identifying each cell in an image, remains a substantial barrier for tissue imaging, as existing approaches are inaccurate or require a substantial amount of manual curation to yield useful results. Here, we addressed the problem of cell segmentation in tissue imaging data through large-scale data annotation and deep learning. We constructed TissueNet, an image dataset containing >1 million paired whole-cell and nuclear annotations for tissue images from nine organs and six imaging platforms. We created Mesmer, a deep learning-enabled segmentation algorithm trained on TissueNet that performs nuclear and whole-cell segmentation in tissue imaging data. We demonstrated that Mesmer has better speed and accuracy than previous methods, generalizes to the full diversity of tissue types and imaging platforms in TissueNet, and achieves human-level performance for whole-cell segmentation. Mesmer enabled the automated extraction of key cellular features, such as subcellular localization of protein signal, which was challenging with previous approaches. We further showed that Mesmer could be adapted to harness cell lineage information present in highly multiplexed datasets. We used this enhanced version to quantify cell morphology changes during human gestation. All underlying code and models are released with permissive licenses as a community resource.


2017 ◽  
Vol 68 ◽  
pp. 32-42 ◽  
Author(s):  
Rodrigo F. Berriel ◽  
Franco Schmidt Rossi ◽  
Alberto F. de Souza ◽  
Thiago Oliveira-Santos

2021 ◽  
Vol 2021 ◽  
pp. 1-9
Author(s):  
Bowen Shen ◽  
Hao Zhang ◽  
Cong Li ◽  
Tianheng Zhao ◽  
Yuanning Liu

Traditional machine learning methods are widely used in the field of RNA secondary structure prediction and have achieved good results. However, with the emergence of large-scale data, deep learning methods have more advantages than traditional machine learning methods. As the number of network layers increases in deep learning, there will often be problems such as increased parameters and overfitting. We used two deep learning models, GoogLeNet and TCN, to predict RNA secondary results. And from the perspective of the depth and width of the network, improvements are made based on the neural network model, which can effectively improve the computational efficiency while extracting more feature information. We process the existing real RNA data through experiments, use deep learning models to extract useful features from a large amount of RNA sequence data and structure data, and then predict the extracted features to obtain each base’s pairing probability. The characteristics of RNA secondary structure and dynamic programming methods are used to process the base prediction results, and the structure with the largest sum of the probability of each base pairing is obtained, and this structure will be used as the optimal RNA secondary structure. We, respectively, evaluated GoogLeNet and TCN models based on 5sRNA, tRNA data, and tmRNA data, and compared them with other standard prediction algorithms. The sensitivity and specificity of the GoogLeNet model on the 5sRNA and tRNA data sets are about 16% higher than the best prediction results in other algorithms. The sensitivity and specificity of the GoogLeNet model on the tmRNA dataset are about 9% higher than the best prediction results in other algorithms. As deep learning algorithms’ performance is related to the size of the data set, as the scale of RNA data continues to expand, the prediction accuracy of deep learning methods for RNA secondary structure will continue to improve.


2019 ◽  
Vol 52 (1) ◽  
pp. 77-124 ◽  
Author(s):  
Giang Nguyen ◽  
Stefan Dlugolinsky ◽  
Martin Bobák ◽  
Viet Tran ◽  
Álvaro López García ◽  
...  

2021 ◽  
Vol 3 (6) ◽  
Author(s):  
Aditya Shrivastava ◽  
Jai Prakash V Verma ◽  
Swati Jain ◽  
Sanjay Garg

AbstractThis study presents a novel approach to predict a complete source to destination trajectory of a vehicle using a partial trajectory query. The proposed architecture is scalable to extremely large-scale data with respect to the dense road network. A deep learning model Long Short Term Memory (LSTM) has been used for analyzing the temporal data and predicting the complete trajectory. To handle a large amount of data, clustering of similar trajectory data is used that helps in reducing the search space. The clusters based on geographical locations and temporal values are used for training different LSTM models. The proposed approach is compared with the other published work on the parameters as Average distance error and one step prediction accuracy The one-step prediction accuracy is as good as 81% and Distance error are .33 Km. Our proposed approach termed Clustered LSTM is outperforming in both the parameters when compared with other reported results. The proposed solution is a clustering-based predictive model that effectively contributes to accurately handle the large scale data. The outcome of this study leads to improvise the navigation systems, route prediction, traffic management, and location-based recommendation systems.


Author(s):  
Rodrigo F. Berriel ◽  
Lucas Tabelini Torres ◽  
Vinicius B. Cardoso ◽  
Ranik Guidolini ◽  
Claudine Badue ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document