Understanding the state of the Art in Animal detection and classification using computer vision technologies

Author(s):  
Gabriel S. Ferrante ◽  
Felipe M. Rodrigues ◽  
Fernando R. H. Andrade ◽  
Rudinei Goularte ◽  
Rodolfo I. Meneguette
Author(s):  
Sebastian Hoppe Nesgaard Jensen ◽  
Mads Emil Brix Doest ◽  
Henrik Aanæs ◽  
Alessio Del Bue

AbstractNon-rigid structure from motion (nrsfm), is a long standing and central problem in computer vision and its solution is necessary for obtaining 3D information from multiple images when the scene is dynamic. A main issue regarding the further development of this important computer vision topic, is the lack of high quality data sets. We here address this issue by presenting a data set created for this purpose, which is made publicly available, and considerably larger than the previous state of the art. To validate the applicability of this data set, and provide an investigation into the state of the art of nrsfm, including potential directions forward, we here present a benchmark and a scrupulous evaluation using this data set. This benchmark evaluates 18 different methods with available code that reasonably spans the state of the art in sparse nrsfm. This new public data set and evaluation protocol will provide benchmark tools for further development in this challenging field.


Plant Methods ◽  
2019 ◽  
Vol 15 (1) ◽  
Author(s):  
Haipeng Xiong ◽  
Zhiguo Cao ◽  
Hao Lu ◽  
Simon Madec ◽  
Liang Liu ◽  
...  

Abstract Background Grain yield of wheat is greatly associated with the population of wheat spikes, i.e., $$spike~number~\text {m}^{-2}$$spikenumberm-2. To obtain this index in a reliable and efficient way, it is necessary to count wheat spikes accurately and automatically. Currently computer vision technologies have shown great potential to automate this task effectively in a low-end manner. In particular, counting wheat spikes is a typical visual counting problem, which is substantially studied under the name of object counting in Computer Vision. TasselNet, which represents one of the state-of-the-art counting approaches, is a convolutional neural network-based local regression model, and currently benchmarks the best record on counting maize tassels. However, when applying TasselNet to wheat spikes, it cannot predict accurate counts when spikes partially present. Results In this paper, we make an important observation that the counting performance of local regression networks can be significantly improved via adding visual context to the local patches. Meanwhile, such context can be treated as part of the receptive field without increasing the model capacity. We thus propose a simple yet effective contextual extension of TasselNet—TasselNetv2. If implementing TasselNetv2 in a fully convolutional form, both training and inference can be greatly sped up by reducing redundant computations. In particular, we collected and labeled a large-scale wheat spikes counting (WSC) dataset, with 1764 high-resolution images and 675,322 manually-annotated instances. Extensive experiments show that, TasselNetv2 not only achieves state-of-the-art performance on the WSC dataset ($$91.01\%$$91.01% counting accuracy) but also is more than an order of magnitude faster than TasselNet (13.82 fps on $$912\times 1216$$912×1216 images). The generality of TasselNetv2 is further demonstrated by advancing the state of the art on both the Maize Tassels Counting and ShanghaiTech Crowd Counting datasets. Conclusions This paper describes TasselNetv2 for counting wheat spikes, which simultaneously addresses two important use cases in plant counting: improving the counting accuracy without increasing model capacity, and improving efficiency without sacrificing accuracy. It is promising to be deployed in a real-time system with high-throughput demand. In particular, TasselNetv2 can achieve sufficiently accurate results when training from scratch with small networks, and adopting larger pre-trained networks can further boost accuracy. In practice, one can trade off the performance and efficiency according to certain application scenarios. Code and models are made available at: https://tinyurl.com/TasselNetv2.


Author(s):  
André Victória Matias ◽  
João Gustavo Atkinson Amorim ◽  
Luiz Antonio Buschetto Macarini ◽  
Allan Cerentini ◽  
Alexandre Sherlley Casimiro Onofre ◽  
...  

Author(s):  
João M.F. Rodrigues ◽  
Pedro J.S. Cardoso ◽  
Jânio Monteiro ◽  
Célia M.Q. Ramos

Smart systems make decisions incorporating data available from different sensing in a way to control and make smart actions. In this context, smart actions consist in augmenting user's actions and/or decisions by using devices or additional information. Those actions could and should be different from user to user, depending on its characteristics and needs. To obtain smart actions adapted to the users, it is necessary to detect the user's individualities on-the-fly. This chapter focuses on how augmented intelligence can leverage smart systems, addressing: (a) the definitions and relations of artificial intelligence, augmented intelligence, and smart systems, namely the state of the art on how to extract human features that can be used to develop augmented intelligent systems (using only computer vision methods); (b) a brief explanation of a “describing people integrated framework”, a framework to extract user information automatically without any user intervention; and (c) a description of several implemented smart systems, including a future work perspective.


2021 ◽  
Vol 30 (1) ◽  
pp. 23-43
Author(s):  
Roopalakshmi R

In this pandemic-prone era, health is of utmost concern for everyone and hence eating good quality fruits is very much essential for sound health. Unfortunately, nowadays it is quite very difficult to obtain naturally ripened fruits, due to existence of chemically ripened fruits being ripened using hazardous chemicals such as calcium carbide. However, most of the state-of-the art techniques are primarily focusing on identification of chemically ripened fruits with the help of computer vision-based approaches, which are less effective towards quantification of chemical contaminations present in the sample fruits. To solve these issues, a new framework for chemical ripening and contamination detection is presented, which employs both visual and IR spectrometric signatures in two different stages. The experiments conducted on both the GUI tool as well as hardware-based setups, clearly demonstrate the efficiency of the proposed framework in terms of detection confidence levels followed by the percentage of presence of chemicals in the sample fruit.


Sensors ◽  
2018 ◽  
Vol 18 (8) ◽  
pp. 2408 ◽  
Author(s):  
Eugenio Ivorra ◽  
Mario Ortega ◽  
José Catalán ◽  
Santiago Ezquerro ◽  
Luis Lledó ◽  
...  

Assistive technologies help all persons with disabilities to improve their accessibility in all aspects of their life. The AIDE European project contributes to the improvement of current assistive technologies by developing and testing a modular and adaptive multimodal interface customizable to the individual needs of people with disabilities. This paper describes the computer vision algorithms part of the multimodal interface developed inside the AIDE European project. The main contribution of this computer vision part is the integration with the robotic system and with the other sensory systems (electrooculography (EOG) and electroencephalography (EEG)). The technical achievements solved herein are the algorithm for the selection of objects using the gaze, and especially the state-of-the-art algorithm for the efficient detection and pose estimation of textureless objects. These algorithms were tested in real conditions, and were thoroughly evaluated both qualitatively and quantitatively. The experimental results of the object selection algorithm were excellent (object selection over 90%) in less than 12 s. The detection and pose estimation algorithms evaluated using the LINEMOD database were similar to the state-of-the-art method, and were the most computationally efficient.


2017 ◽  
Author(s):  
Arnadi Murtiyoso ◽  
Deni Suwardhi

AbstrakDewasa ini teknik pencocokan citra banyak dimanfaatkan dalam fotogrametri. Metode yang berasal dari ranah penglihatan komputer ini dapat mempermudah sejumlah tugas dalam fotogrametri yang sebelumnya harus dikerjakan secara manual. Fotogrametri sendiri merupakan teknik yang sering digunakan dalam dokumentasi cagar budaya dan arkeologi untuk menghasilkan model trimatra (3D). Fotogrametri dapat menjadi alternatif ataupun komplemen dari teknik pemindaian laser dalam menghasilkan model 3D yang akurat. Makalah ini bermaksud merangkum secara umum teknik pencocokan citra otomatis yang banyak digunakan di perangkat lunak fotogrametri modern, dengan menyertakan sejumlah contoh kasus untuk penggunaannya dalam rangka dokumentasi cagar budaya.Kata kunci: Pencocokan Citra, Fotogrametri, Cagar Budaya, Dokumentasi, Model 3D AbstractToday the image matching technique is used extensively in photogrammetry. This method, which originated from the computer vision domain, facilitates several tasks within the photogrammetric workflow which previously had to be performed manually. Photogrammetry itself is a technique which is often employed in the documentation of heritage as well as archaeology in order to create 3D models. Photogrammetry can be an alternative or a complement to laser scanning in producing accurate and reality-based 3D models. This paper will summarize the state of the art of the image matching technique often used in modern photogrammetry software packages. Several case studies of its use in the area of heritage documentation will also be presented.Keywords: Image Matching, Photogrammetry, Heritage, Documentation, 3D Model


Author(s):  
T. A. Welton

Various authors have emphasized the spatial information resident in an electron micrograph taken with adequately coherent radiation. In view of the completion of at least one such instrument, this opportunity is taken to summarize the state of the art of processing such micrographs. We use the usual symbols for the aberration coefficients, and supplement these with £ and 6 for the transverse coherence length and the fractional energy spread respectively. He also assume a weak, biologically interesting sample, with principal interest lying in the molecular skeleton remaining after obvious hydrogen loss and other radiation damage has occurred.


2003 ◽  
Vol 48 (6) ◽  
pp. 826-829 ◽  
Author(s):  
Eric Amsel
Keyword(s):  

1968 ◽  
Vol 13 (9) ◽  
pp. 479-480
Author(s):  
LEWIS PETRINOVICH
Keyword(s):  

Sign in / Sign up

Export Citation Format

Share Document