Deep learning approaches to pattern extraction and recognition in paintings and drawings: an overview

Neural Computing and Applications ◽

10.1007/s00521-021-05893-z ◽

2021 ◽

Author(s):

Giovanna Castellano ◽

Gennaro Vessio

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Computer Science ◽

Visual Arts ◽

Visual Art ◽

Learning Approaches ◽

Pattern Extraction ◽

Art Collections ◽

Recent Advances ◽

Art Community

AbstractThis paper provides an overview of some of the most relevant deep learning approaches to pattern extraction and recognition in visual arts, particularly painting and drawing. Recent advances in deep learning and computer vision, coupled with the growing availability of large digitized visual art collections, have opened new opportunities for computer science researchers to assist the art community with automatic tools to analyse and further understand visual arts. Among other benefits, a deeper understanding of visual arts has the potential to make them more accessible to a wider population, ultimately supporting the spread of culture.

Download Full-text

Fashion Product Classification through Deep Learning and Computer Vision

Applied Sciences ◽

10.3390/app9071385 ◽

2019 ◽

Vol 9 (7) ◽

pp. 1385 ◽

Cited By ~ 4

Author(s):

Luca Donati ◽

Eleonora Iotti ◽

Giulio Mordonini ◽

Andrea Prati

Keyword(s):

Image Processing ◽

Computer Vision ◽

Feature Extraction ◽

Deep Learning ◽

Template Matching ◽

Learning Approaches ◽

Visual Classification ◽

Product Classification ◽

Processing Techniques

Visual classification of commercial products is a branch of the wider fields of object detection and feature extraction in computer vision, and, in particular, it is an important step in the creative workflow in fashion industries. Automatically classifying garment features makes both designers and data experts aware of their overall production, which is fundamental in order to organize marketing campaigns, avoid duplicates, categorize apparel products for e-commerce purposes, and so on. There are many different techniques for visual classification, ranging from standard image processing to machine learning approaches: this work, made by using and testing the aforementioned approaches in collaboration with Adidas AG™, describes a real-world study aimed at automatically recognizing and classifying logos, stripes, colors, and other features of clothing, solely from final rendering images of their products. Specifically, both deep learning and image processing techniques, such as template matching, were used. The result is a novel system for image recognition and feature extraction that has a high classification accuracy and which is reliable and robust enough to be used by a company like Adidas. This paper shows the main problems and proposed solutions in the development of this system, and the experimental results on the Adidas AG™ dataset.

Download Full-text

A survey on Deep Learning Based Eye Gaze Estimation Methods

Journal of Innovative Image Processing - December 2019 ◽

10.36548/jiip.2021.3.003 ◽

2021 ◽

Vol 3 (3) ◽

pp. 190-207

Author(s):

S. K. B. Sangeetha

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Near Infrared ◽

Eye Gaze ◽

Infrared Image ◽

Estimation Methods ◽

Learning Technology ◽

Learning Approaches ◽

Gaze Tracking ◽

Inference Models

In recent years, deep-learning systems have made great progress, particularly in the disciplines of computer vision and pattern recognition. Deep-learning technology can be used to enable inference models to do real-time object detection and recognition. Using deep-learning-based designs, eye tracking systems could determine the position of eyes or pupils, regardless of whether visible-light or near-infrared image sensors were utilized. For growing electronic vehicle systems, such as driver monitoring systems and new touch screens, accurate and successful eye gaze estimates are critical. In demanding, unregulated, low-power situations, such systems must operate efficiently and at a reasonable cost. A thorough examination of the different deep learning approaches is required to take into consideration all of the limitations and opportunities of eye gaze tracking. The goal of this research is to learn more about the history of eye gaze tracking, as well as how deep learning contributed to computer vision-based tracking. Finally, this research presents a generalized system model for deep learning-driven eye gaze direction diagnostics, as well as a comparison of several approaches.

Download Full-text

BUILDING GENERALIZATION USING DEEP LEARNING

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xlii-4-565-2018 ◽

2018 ◽

Vol XLII-4 ◽

pp. 565-572 ◽

Cited By ~ 4

Author(s):

M. Sester ◽

Y. Feng ◽

F. Thiemann

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Physical Reality ◽

Training Data ◽

Data Sets ◽

Learning Approaches ◽

Depth Analysis ◽

Map Series ◽

Training Examples ◽

Future Work

Abstract. Cartographic generalization is a problem, which poses interesting challenges to automation. Whereas plenty of algorithms have been developed for the different sub-problems of generalization (e.g. simplification, displacement, aggregation), there are still cases, which are not generalized adequately or in a satisfactory way. The main problem is the interplay between different operators. In those cases the benchmark is the human operator, who is able to design an aesthetic and correct representation of the physical reality.Deep Learning methods have shown tremendous success for interpretation problems for which algorithmic methods have deficits. A prominent example is the classification and interpretation of images, where deep learning approaches outperform the traditional computer vision methods. In both domains &ndash; computer vision and cartography &ndash; humans are able to produce a solution; a prerequisite for this is, that there is the possibility to generate many training examples for the different cases. Thus, the idea in this paper is to employ Deep Learning for cartographic generalizations tasks, especially for the task of building generalization. An advantage of this task is the fact that many training data sets are available from given map series. The approach is a first attempt using an existing network.In the paper, the details of the implementation will be reported, together with an in depth analysis of the results. An outlook on future work will be given.

Download Full-text

Towards robots with geologist eyes? Computer vision and Deep Learning approaches to field samples analysis

10.5194/egusphere-egu21-11144 ◽

2021 ◽

Author(s):

Antoine Bouziat ◽

Sylvain Desroziers ◽

Abdoulaye Koroko ◽

Antoine Lechevallier ◽

Mathieu Feraille ◽

...

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Decision Tree ◽

Use Cases ◽

Mineral Species ◽

Learning Approaches ◽

Learning Methods ◽

Data Set ◽

Field Samples ◽

Do So

Automation and robotics raise growing interests in the mining industry. If not already a reality, it is no more science fiction to imagine autonomous robots routinely participating in the exploration and extraction of mineral raw materials in the near future. Among the various scientific and technical issues to be addressed towards this objective, this study focuses on the automation of real-time characterisation of rock images captured on the field, either to discriminate rock types and mineral species or to detect small elements such as mineral grains or metallic nuggets. To do so, we investigate the potential of methods from the Computer Vision community, a subfield of Artificial Intelligence dedicated to image processing. In particular, we aim at assessing the potential of Deep Learning approaches and convolutional neuronal networks (CNN) for the analysis of field samples pictures, highlighting key challenges before an industrial use in operational contexts.In a first initiative, we appraise Deep Learning methods to classify photographs of macroscopic rock samples between 12 lithological families. Using the architecture of reference CNN and a collection of 2700 images, we achieve a prediction accuracy above 90% for new pictures of good photographic quality. Nonetheless we then seek to improve the robustness of the method for on-the-fly field photographs. To do so, we train an additional CNN to automatically separate the rock sample from the background, with a detection algorithm. We also introduce a more sophisticated classification method combining a set of several CNN with a decision tree. The CNN are specifically trained to recognise petrological features such as textures, structures or mineral species, while the decision tree mimics the naturalist methodology for lithological identification.In a second initiative, we evaluate Deep Learning techniques to spot and delimitate specific elements in finer-scale images. We use a data set of carbonate thin sections with various species of microfossils. The data comes from a sedimentology study but analogies can be drawn with igneous geology use cases. We train four state-of-the-art Deep Learning methods for object detection with a limited data set of 15 annotated images. The results on 130 other thin section images are then qualitatively assessed by expert geologists, and precisions and inference times quantitatively measured. The four models show good capabilities in detecting and categorising the microfossils. However differences in accuracy and performance are underlined, leading to recommendations for comparable projects in a mining context.Altogether, this study illustrates the power of Computer Vision and Deep Learning approaches to automate rock image analysis. However, to make the most of these technologies in mining activities, stimulating research opportunities lies in adapting the algorithms to the geological use cases, embedding as much geological knowledge as possible in the statistical models, and mitigating the number of training data to be manually interpreted beforehand.&#160; &#160;

Download Full-text

Real-Time Object Detection for Aiding Visually Impaired using Deep Learning

International Journal of Engineering and Advanced Technology - Regular Issue ◽

10.35940/ijeat.d8374.049420 ◽

2020 ◽

Vol 9 (4) ◽

pp. 1600-1605 ◽

Cited By ~ 1

Keyword(s):

Computer Vision ◽

Deep Learning ◽

Internet Of Things ◽

Computer Science ◽

Visually Impaired ◽

Vision Loss ◽

Assistive Device ◽

Blind People ◽

Normal People ◽

The People

This research aims to create an assistive device for the people who are suffering from vision loss or impairment. The device is designed for blind people to overcome the daily challenges they face which may be perceived to be trivial to normal people. The device is created by using advance computer science technologies such as deep learning, computer vision and internet of things. The device created would be able to detect and classify daily objects and give a voice feedback to the user who is handicapped with blindness.

Download Full-text

Learning Cartographic Building Generalization with Deep Convolutional Neural Networks

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi8060258 ◽

2019 ◽

Vol 8 (6) ◽

pp. 258 ◽

Cited By ~ 13

Author(s):

Yu Feng ◽

Frank Thiemann ◽

Monika Sester

Keyword(s):

Neural Networks ◽

Computer Vision ◽

Deep Learning ◽

Convolutional Neural Networks ◽

Physical Reality ◽

Learning Approaches ◽

Deep Convolutional Neural Networks ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Cartographic Generalization

Cartographic generalization is a problem, which poses interesting challenges to automation. Whereas plenty of algorithms have been developed for the different sub-problems of generalization (e.g., simplification, displacement, aggregation), there are still cases, which are not generalized adequately or in a satisfactory way. The main problem is the interplay between different operators. In those cases the human operator is the benchmark, who is able to design an aesthetic and correct representation of the physical reality. Deep learning methods have shown tremendous success for interpretation problems for which algorithmic methods have deficits. A prominent example is the classification and interpretation of images, where deep learning approaches outperform traditional computer vision methods. In both domains-computer vision and cartography-humans are able to produce good solutions. A prerequisite for the application of deep learning is the availability of many representative training examples for the situation to be learned. As this is given in cartography (there are many existing map series), the idea in this paper is to employ deep convolutional neural networks (DCNNs) for cartographic generalizations tasks, especially for the task of building generalization. Three network architectures, namely U-net, residual U-net and generative adversarial network (GAN), are evaluated both quantitatively and qualitatively in this paper. They are compared based on their performance on this task at target map scales 1:10,000, 1:15,000 and 1:25,000, respectively. The results indicate that deep learning models can successfully learn cartographic generalization operations in one single model in an implicit way. The residual U-net outperforms the others and achieved the best generalization performance.

Download Full-text

Machine Learning and Deep Learning Approaches for Brain Disease Diagnosis: Principles and Recent Advances

IEEE Access ◽

10.1109/access.2021.3062484 ◽

2021 ◽

Vol 9 ◽

pp. 37622-37655

Author(s):

Protima Khan ◽

Md. Fazlul Kader ◽

S. M. Riazul Islam ◽

Aisha B. Rahman ◽

Md. Shahriar Kamal ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Disease Diagnosis ◽

Brain Disease ◽

Learning Approaches ◽

Recent Advances

Download Full-text

Deep Learning does not Replace Bayesian Modeling : Comparing research use via citation counting

10.22541/au.163253673.34591907/v1 ◽

2021 ◽

Author(s):

Breck Baldwin

Keyword(s):

Artificial Intelligence ◽

Popular Culture ◽

Deep Learning ◽

Computer Science ◽

Bayesian Modeling ◽

Citation Count ◽

Funding Agency ◽

Research Use ◽

Learning Approaches ◽

The Impact

One could be excused for assuming that deep learning had or will soon usurp all credible work in reasoning, artificial intelligence and statistics, but like most ‘meme’ class broad generalizations the concept does not hold up to scrutiny. Memes don’t generally matter since the experts will always know better but in the case of Bayesian software like Stan and PyMC3 even its developers and advocates bemoan the apparent dominance of deep learning as manifested in popular culture, breathtaking performance and most problematically from funding agency peer review that impacts our ability to further advance the field. The facts however do not support the assumed dominance of deep learning in science upon closer examination. This letter simply makes the argument by the crudest of possible metrics, citation count, that once Computer Science is subtracted, Bayesian software accounts for nearly a third of research citations. Stan and PyMC3 dominate some fields, PyTorch, Keras and TensorFlow dominate others with lots of variation in between. Bayesian and deep learning approaches are related but very different technologies in goals, implementation and applicability with little actual overlap so this is not a surprise. While deep learning is backed by industry behemoths (Google, Facebook) the Bayesian efforts are not and it would behoove funders to recognize the impact of Bayesian software given its centrality to science.

Download Full-text