SMVNet: Deep Learning Architectures for Accurate and Robust Multi-View Stereopsis

Deep learning techniques are being increasingly used in the scientific community as a consequence of the high computational capacity of current systems and the increase in the amount of data available as a result of the digitalisation of society in general and the industrial world in particular. In addition, the immersion of the field of edge computing, which focuses on integrating artificial intelligence as close as possible to the client, makes it possible to implement systems that act in real time without the need to transfer all of the data to centralised servers. The combination of these two concepts can lead to systems with the capacity to make correct decisions and act based on them immediately and in situ. Despite this, the low capacity of embedded systems greatly hinders this integration, so the possibility of being able to integrate them into a wide range of micro-controllers can be a great advantage. This paper contributes with the generation of an environment based on Mbed OS and TensorFlow Lite to be embedded in any general purpose embedded system, allowing the introduction of deep learning architectures. The experiments herein prove that the proposed system is competitive if compared to other commercial systems.

Download Full-text

Chest x-ray automated triage: a semiologic approach designed for clinical implementation, exploiting different types of labels through a combination of four Deep Learning architectures.

Computer Methods and Programs in Biomedicine ◽

10.1016/j.cmpb.2021.106130 ◽

2021 ◽

pp. 106130

Author(s):

Candelaria Mosquera ◽

Facundo Nahuel Diaz ◽

Fernando Binder ◽

José Martín Rabellino ◽

Sonia Elizabeth Benitez ◽

...

Keyword(s):

Deep Learning ◽

Clinical Implementation ◽

X Ray ◽

Different Types ◽

Chest X Ray ◽

Learning Architectures

Download Full-text

Multimodal Deep Learning and Visible-Light and Hyperspectral Imaging for Fruit Maturity Estimation

Sensors ◽

10.3390/s21041288 ◽

2021 ◽

Vol 21 (4) ◽

pp. 1288

Author(s):

Cinmayii A. Garillos-Manliguez ◽

John Y. Chiang

Keyword(s):

Deep Learning ◽

Visible Light ◽

Hyperspectral Imaging ◽

Morphological Changes ◽

Consumer Preference ◽

Hyperspectral Data ◽

Sensitivity Analyses ◽

Deep Convolutional Neural Networks ◽

Fruit Maturity ◽

Learning Architectures

Fruit maturity is a critical factor in the supply chain, consumer preference, and agriculture industry. Most classification methods on fruit maturity identify only two classes: ripe and unripe, but this paper estimates six maturity stages of papaya fruit. Deep learning architectures have gained respect and brought breakthroughs in unimodal processing. This paper suggests a novel non-destructive and multimodal classification using deep convolutional neural networks that estimate fruit maturity by feature concatenation of data acquired from two imaging modes: visible-light and hyperspectral imaging systems. Morphological changes in the sample fruits can be easily measured with RGB images, while spectral signatures that provide high sensitivity and high correlation with the internal properties of fruits can be extracted from hyperspectral images with wavelength range in between 400 nm and 900 nm—factors that must be considered when building a model. This study further modified the architectures: AlexNet, VGG16, VGG19, ResNet50, ResNeXt50, MobileNet, and MobileNetV2 to utilize multimodal data cubes composed of RGB and hyperspectral data for sensitivity analyses. These multimodal variants can achieve up to 0.90 F1 scores and 1.45% top-2 error rate for the classification of six stages. Overall, taking advantage of multimodal input coupled with powerful deep convolutional neural network models can classify fruit maturity even at refined levels of six stages. This indicates that multimodal deep learning architectures and multimodal imaging have great potential for real-time in-field fruit maturity estimation that can help estimate optimal harvest time and other in-field industrial applications.

Download Full-text

Application of Deep Learning Architectures for Cyber Security

Advanced Sciences and Technologies for Security Applications - Cybersecurity and Secure Information Systems ◽

10.1007/978-3-030-16837-7_7 ◽

2019 ◽

pp. 125-160 ◽

Cited By ~ 3

Author(s):

R. Vinayakumar ◽

K. P. Soman ◽

Prabaharan Poornachandran ◽

S. Akarsh

Keyword(s):

Deep Learning ◽

Cyber Security ◽

Learning Architectures

Download Full-text

Ensemble Deep Learning for Cervix Image Selection toward Improving Reliability in Automated Cervical Precancer Screening

Diagnostics ◽

10.3390/diagnostics10070451 ◽

2020 ◽

Vol 10 (7) ◽

pp. 451 ◽

Cited By ~ 2

Author(s):

Peng Guo ◽

Zhiyun Xue ◽

Zac Mtema ◽

Karen Yeates ◽

Ophira Ginsburg ◽

...

Keyword(s):

Deep Learning ◽

Learning Algorithm ◽

Binary Classification ◽

Digital Camera ◽

Deep Learning Algorithm ◽

Average Accuracy ◽

Acceptable Quality ◽

Cervical Precancer ◽

One Class Classification ◽

Learning Architectures

Automated Visual Examination (AVE) is a deep learning algorithm that aims to improve the effectiveness of cervical precancer screening, particularly in low- and medium-resource regions. It was trained on data from a large longitudinal study conducted by the National Cancer Institute (NCI) and has been shown to accurately identify cervices with early stages of cervical neoplasia for clinical evaluation and treatment. The algorithm processes images of the uterine cervix taken with a digital camera and alerts the user if the woman is a candidate for further evaluation. This requires that the algorithm be presented with images of the cervix, which is the object of interest, of acceptable quality, i.e., in sharp focus, with good illumination, without shadows or other occlusions, and showing the entire squamo-columnar transformation zone. Our prior work has addressed some of these constraints to help discard images that do not meet these criteria. In this work, we present a novel algorithm that determines that the image contains the cervix to a sufficient extent. Non-cervix or other inadequate images could lead to suboptimal or wrong results. Manual removal of such images is labor intensive and time-consuming, particularly in working with large retrospective collections acquired with inadequate quality control. In this work, we present a novel ensemble deep learning method to identify cervix images and non-cervix images in a smartphone-acquired cervical image dataset. The ensemble method combined the assessment of three deep learning architectures, RetinaNet, Deep SVDD, and a customized CNN (Convolutional Neural Network), each using a different strategy to arrive at its decision, i.e., object detection, one-class classification, and binary classification. We examined the performance of each individual architecture and an ensemble of all three architectures. An average accuracy and F-1 score of 91.6% and 0.890, respectively, were achieved on a separate test dataset consisting of more than 30,000 smartphone-captured images.

Download Full-text

Acoustic Scene Classification using Deep Learning Architectures

2021 6th International Conference for Convergence in Technology (I2CT) ◽

10.1109/i2ct51068.2021.9418177 ◽

2021 ◽

Author(s):

Spoorthy. V ◽

Manjunath Mulimani ◽

Shashidhar G. Koolagudi

Keyword(s):

Deep Learning ◽

Scene Classification ◽

Learning Architectures

Download Full-text

Notice of Removal: Performance analysis of deep learning architectures for ultrasonic NDE applications

2017 IEEE International Ultrasonics Symposium (IUS) ◽

10.1109/ultsym.2017.8091729 ◽

2017 ◽

Cited By ~ 1

Author(s):

Kushal Virupakshappa ◽

Erdal Oruklu

Keyword(s):

Deep Learning ◽

Performance Analysis ◽

Ultrasonic Nde ◽

Learning Architectures

Download Full-text

Automated pneumonia detection on chest X-ray images: A deep learning approach with different optimizers and transfer learning architectures

Measurement ◽

10.1016/j.measurement.2021.109953 ◽

2021 ◽

pp. 109953

Author(s):

Adhiyaman Manickam ◽

Jianmin Jiang ◽

Yu Zhou ◽

Abhinav Sagar ◽

Rajkumar Soundrapandiyan ◽

...

Keyword(s):

Deep Learning ◽

Transfer Learning ◽

Learning Approach ◽

X Ray ◽

Chest X Ray ◽

Learning Architectures

Download Full-text

Automated Source Code Generation and Auto-Completion Using Deep Learning: Comparing and Discussing Current Language Model-Related Approaches

AI ◽

10.3390/ai2010001 ◽

2021 ◽

Vol 2 (1) ◽

pp. 1-16

Author(s):

Juan Cruz-Benito ◽

Sanjay Vishwakarma ◽

Francisco Martin-Fernandez ◽

Ismael Faro

Keyword(s):

Deep Learning ◽

Learning Community ◽

Programming Languages ◽

Language Processing ◽

Code Generation ◽

Language Model ◽

Language Models ◽

Stochastic Gradient Descent ◽

Network Architectures ◽

Learning Architectures

In recent years, the use of deep learning in language models has gained much attention. Some research projects claim that they can generate text that can be interpreted as human writing, enabling new possibilities in many application areas. Among the different areas related to language processing, one of the most notable in applying this type of modeling is programming languages. For years, the machine learning community has been researching this software engineering area, pursuing goals like applying different approaches to auto-complete, generate, fix, or evaluate code programmed by humans. Considering the increasing popularity of the deep learning-enabled language models approach, we found a lack of empirical papers that compare different deep learning architectures to create and use language models based on programming code. This paper compares different neural network architectures like Average Stochastic Gradient Descent (ASGD) Weight-Dropped LSTMs (AWD-LSTMs), AWD-Quasi-Recurrent Neural Networks (QRNNs), and Transformer while using transfer learning and different forms of tokenization to see how they behave in building language models using a Python dataset for code generation and filling mask tasks. Considering the results, we discuss each approach’s different strengths and weaknesses and what gaps we found to evaluate the language models or to apply them in a real programming context.

Download Full-text

Deep Learning Approach for the Morphological Synthesis in Malayalam and Tamil at the Character Level

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3457976 ◽

2021 ◽

Vol 20 (6) ◽

pp. 1-17

Author(s):

B. Premjith ◽

K. P. Soman

Keyword(s):

Deep Learning ◽

Short Term Memory ◽

Conditional Random Field ◽

Short Term ◽

Long Short Term Memory ◽

Target Languages ◽

Main Components ◽

Learning Architectures ◽

Gated Recurrent Unit ◽

Character Sequence

Morphological synthesis is one of the main components of Machine Translation (MT) frameworks, especially when any one or both of the source and target languages are morphologically rich. Morphological synthesis is the process of combining two words or two morphemes according to the Sandhi rules of the morphologically rich language. Malayalam and Tamil are two languages in India which are morphologically abundant as well as agglutinative. Morphological synthesis of a word in these two languages is challenging basically because of the following reasons: (1) Abundance in morphology; (2) Complex Sandhi rules; (3) The possibilty in Malayalam to form words by combining words that belong to different syntactic categories (for example, noun and verb); and (4) The construction of a sentence by combining multiple words. We formulated the task of the morphological generation of nouns and verbs of Malayalam and Tamil as a character-to-character sequence tagging problem. In this article, we used deep learning architectures like Recurrent Neural Network (RNN) , Long Short-Term Memory Networks (LSTM) , Gated Recurrent Unit (GRU) , and their stacked and bidirectional versions for the implementation of morphological synthesis at the character level. In addition to that, we investigated the performance of the combination of the aforementioned deep learning architectures and the Conditional Random Field (CRF) in the morphological synthesis of nouns and verbs in Malayalam and Tamil. We observed that the addition of CRF to the Bidirectional LSTM/GRU architecture achieved more than 99% accuracy in the morphological synthesis of Malayalam and Tamil nouns and verbs.

Download Full-text