Methods for Automatic Image-Based Classification of Winged Insects Using Computational Techniques

Author(s):  
Allan Rodrigues Rebelo ◽  
Joao Marcos Garcia Fagundes ◽  
Luciano Antonio Digiampietri ◽  
Helton Hideraldo Bíscaro
Author(s):  
Huma Lodhi

Predicting mutagenicity is a complex and challenging problem in chemoinformatics. Ames test is a biological method to assess mutagenicity of molecules. The dynamic growth in the repositories of molecules establishes a need to develop and apply effective and efficient computational techniques to solving chemoinformatics problems such as identification and classification of mutagens. Machine learning methods provide effective solutions to chemoinformatics problems. This chapter presents an overview of the learning techniques that have been developed and applied to the problem of identification and classification of mutagens.


2020 ◽  
Vol 31 (06) ◽  
pp. 2050043
Author(s):  
Michele Rossi ◽  
Lea Terracini

In this paper, we show that a smooth toric variety [Formula: see text] of Picard number [Formula: see text] always admits a nef primitive collection supported on a hyperplane admitting non-trivial intersection with the cone [Formula: see text] of numerically effective divisors and cutting a facet of the pseudo-effective cone [Formula: see text], that is [Formula: see text]. In particular, this means that [Formula: see text] admits non-trivial and non-big numerically effective divisors. Geometrically, this guarantees the existence of a fiber type contraction morphism over a smooth toric variety of dimension and Picard number lower than those of [Formula: see text], so giving rise to a classification of smooth and complete toric varieties with [Formula: see text]. Moreover, we revise and improve results of Oda–Miyake by exhibiting an extension of the above result to projective, toric, varieties of dimension [Formula: see text] and Picard number [Formula: see text], allowing us to classifying all these threefolds. We then improve results of Fujino–Sato, by presenting sharp (counter)examples of smooth, projective, toric varieties of any dimension [Formula: see text] and Picard number [Formula: see text] whose non-trivial nef divisors are big, that is [Formula: see text]. Producing those examples represents an important goal of computational techniques in definitely setting an open geometric problem. In particular, for [Formula: see text], the given example turns out to be a weak Fano toric fourfold of Picard number 4.


1989 ◽  
Vol 1 (1) ◽  
pp. 1-18 ◽  
Author(s):  
David Sankoff ◽  
Pascale Rousseau

ABSTRACTA set of ordered rules for generating variants of a variable determines (a) underlying/surface distinctions among some of the variants and (b) a hierarchical classification of the variants. In the analytical framework of variable rules, frequency data on variant occurrences in context bear only on (b) and not on (a). We provide a combinatorial characterization and enumeration of the set of classifications on n variants, the set of underlying/surface configurations, and the set of rule orders. We describe the statistical and computational techniques for generalizing variable rule analysis to the inference of rule order. These procedures are applied to the problems of the reduction of syllable-final consonants <s>, <n>, and <r> in Caribbean Spanish (n = 3, 4, 6 variants, respectively). Previous analyses have tended to assume that successive weakenings occur in an intrinsic order determined by phonological strength. Our results show that aspiration and deletion can indeed be seen to be intrinsically ordered in both <s> and <r> reduction, though an unordered analysis is equally likely in the case of <s>. On the other hand, velarization and deletion of <n> are unordered, while vocalization is a subsequent process, independent of the other two. Similarly, spirantization, aspiration, and lateralization of <r> are unordered, as confirmed by data sets from both Puerto Rican and Panamanian speakers. Furthermore, with both <n> and <r>, intrinsically ordered rule schemata proved to be extremely unlikely by statistical criteria. Syllable-final consonant reduction then consists of largely independent processes, most of which occur simultaneously.


2020 ◽  
Vol 20 (S14) ◽  
Author(s):  
Sadiq Alinsaif ◽  
Jochen Lang

Abstract Background A various number of imaging modalities are available (e.g., magnetic resonance, x-ray, ultrasound, and biopsy) where each modality can reveal different structural aspects of tissues. However, the analysis of histological slide images that are captured using a biopsy is considered the gold standard to determine whether cancer exists. Furthermore, it can reveal the stage of cancer. Therefore, supervised machine learning can be used to classify histopathological tissues. Several computational techniques have been proposed to study histopathological images with varying levels of success. Often handcrafted techniques based on texture analysis are proposed to classify histopathological tissues which can be used with supervised machine learning. Methods In this paper, we construct a novel feature space to automate the classification of tissues in histology images. Our feature representation is to integrate various features sets into a new texture feature representation. All of our descriptors are computed in the complex Shearlet domain. With complex coefficients, we investigate not only the use of magnitude coefficients, but also study the effectiveness of incorporating the relative phase (RP) coefficients to create the input feature vector. In our study, four texture-based descriptors are extracted from the Shearlet coefficients: co-occurrence texture features, Local Binary Patterns, Local Oriented Statistic Information Booster, and segmentation-based Fractal Texture Analysis. Each set of these attributes captures significant local and global statistics. Therefore, we study them individually, but additionally integrate them to boost the accuracy of classifying the histopathology tissues while being fed to classical classifiers. To tackle the problem of high-dimensionality, our proposed feature space is reduced using principal component analysis. In our study, we use two classifiers to indicate the success of our proposed feature representation: Support Vector Machine (SVM) and Decision Tree Bagger (DTB). Results Our feature representation delivered high performance when used on four public datasets. As such, the best achieved accuracy: multi-class Kather (i.e., 92.56%), BreakHis (i.e., 91.73%), Epistroma (i.e., 98.04%), Warwick-QU (i.e., 96.29%). Conclusions Our proposed method in the Shearlet domain for the classification of histopathological images proved to be effective when it was investigated on four different datasets that exhibit different levels of complexity.


2014 ◽  
Vol 2014 ◽  
pp. 1-12 ◽  
Author(s):  
Muhammad Javed Iqbal ◽  
Ibrahima Faye ◽  
Brahim Belhaouari Samir ◽  
Abas Md Said

Bioinformatics has been an emerging area of research for the last three decades. The ultimate aims of bioinformatics were to store and manage the biological data, and develop and analyze computational tools to enhance their understanding. The size of data accumulated under various sequencing projects is increasing exponentially, which presents difficulties for the experimental methods. To reduce the gap between newly sequenced protein and proteins with known functions, many computational techniques involving classification and clustering algorithms were proposed in the past. The classification of protein sequences into existing superfamilies is helpful in predicting the structure and function of large amount of newly discovered proteins. The existing classification results are unsatisfactory due to a huge size of features obtained through various feature encoding methods. In this work, a statistical metric-based feature selection technique has been proposed in order to reduce the size of the extracted feature vector. The proposed method of protein classification shows significant improvement in terms of performance measure metrics: accuracy, sensitivity, specificity, recall, F-measure, and so forth.


2021 ◽  
Vol 12 (3) ◽  
pp. 32-47
Author(s):  
Chaitanya Pandey

A natural language processing (NLP) method was used to uncover various issues and sentiments surrounding COVID-19 from social media and get a deeper understanding of fluctuating public opinion in situations of wide-scale panic to guide improved decision making with the help of a sentiment analyser created for the automated extraction of COVID-19-related discussions based on topic modelling. Moreover, the BERT model was used for the sentiment classification of COVID-19 Reddit comments. These findings shed light on the importance of studying trends and using computational techniques to assess the human psyche in times of distress.


2021 ◽  
Author(s):  
Chaitanya Pandey

A Natural Language Processing (NLP) method was used to uncover various issues and sentiments surrounding COVID-19 from social media and get a deeper understanding of fluctuating public opinion in situations of wide-scale panic to guide improved decision making with the help of a sentiment analyser created for the automated extraction of COVID-19 related discussions based on topic modelling. Moreover, the BERT model was used for the sentiment classification of COVID-19 Reddit comments. These findings shed light on the importance of studying trends and using computational techniques to assess human psyche in times of distress.


2018 ◽  
Author(s):  
Anu Vazhayil ◽  
R Vinayakumar ◽  
KP Soman

ABSTRACTThe knowledge regarding the function of proteins is necessary as it gives a clear picture of biological processes. Nevertheless, there are many protein sequences found and added to the databases but lacks functional annotation. The laboratory experiments take a considerable amount of time for annotation of the sequences. This arises the need to use computational techniques to classify proteins based on their functions. In our work, we have collected the data from Swiss-Prot containing 40433 proteins which is grouped into 30 families. We pass it to recurrent neural network(RNN), long short term memory(LSTM) and gated recurrent unit(GRU) model and compare it by applying trigram with deep neural network and shallow neural network on the same dataset. Through this approach, we could achieve maximum of around 78% accuracy for the classification of protein families.


2018 ◽  
Vol 63 (2) ◽  
pp. 131-137 ◽  
Author(s):  
Karan Veer ◽  
Tanu Sharma

AbstractDual-channel evaluation of surface electromyogram (SEMG) signals acquired from amputee subjects using computational techniques for classification of arm motions is presented in this study. SEMG signals were classified by the neural network (NN) and interpretation was done using statistical techniques to extract the effectiveness of the recorded signals. From the results, it was observed that there exists a calculative difference in amplitude gain across different motions and that SEMG signals have great potential to classify arm motions. The outcomes indicated that the NN algorithm performs significantly better than other algorithms, with a classification rate (CR) of 96.40%. Analysis of variance (ANOVA) presents the results to validate the effectiveness of the recorded data to discriminate SEMG signals. The results are of significant thrust in identifying the operations that can be implemented for classifying upper-limb movements suitable for prostheses’ design.


Author(s):  
Heru Ismanto ◽  
Azhari Azhari ◽  
Suharto Suharto ◽  
Lincolin Arsyad

The development of the region cannot be separated from the concept of economic growth and the determination of the mainstay region as a regional center that is expected to have a positive impact on economic growth to the surrounding regions. In fact, the determination of the mainstay region is a difficult thing to do. Some cases of the determination of the mainstay region are mostly on the basis of the prerogative rights of the policy makers without carefully seeing the achievements of the development of a region. The objective of this study is to develop a classification model of the mainstay economic region using computational techniques. The decision tree methods of NBTree and J48 are used in this study and combined with Klassen typology. The results of this study show that J48 algorithm has better accuracy than NBTree in the formation process of decision tree. The accuracy of J48 is higher than NBTree i.e. 68.96%. The comparative result of the classification of the mainstay economic region between Klassen and J48 shows that there is a shift in the class position of the development quadrant. In Klassen classification, there are three regions that are categorized into the mainstay regions with advanced development and rapid growth (K1). Meanwhile, J48 results show that there is no region categorized into K1. However, the mainstay economic region on J48 is based on the level of development with the level below K1, i.e. K2. J48 classification results show that there are ten regencies that are categorized into the mainstay economic regions, namely Biak, Regency of Jayapura, Jayawijaya, Kerom, Merauke, Mimika, Nabire, Ndunga, Yapen, and the Municipality of Jayapura.


Sign in / Sign up

Export Citation Format

Share Document