ResMem-Net: memory based deep CNN for image memorability estimation

PeerJ Computer Science ◽

10.7717/peerj-cs.767 ◽

2021 ◽

Vol 7 ◽

pp. e767

Author(s):

Arockia Praveen ◽

Abdulfattah Noorwali ◽

Duraimurugan Samiayya ◽

Mohammad Zubair Khan ◽

Durai Raj Vincent P M ◽

...

Keyword(s):

Deep Learning ◽

Large Scale ◽

Mean Squared Error ◽

State Of The Art ◽

Rank Correlation ◽

Current State ◽

Intermediate Layers ◽

Better Than ◽

Made In ◽

Memory Efficient

Image memorability is a very hard problem in image processing due to its subjective nature. But due to the introduction of Deep Learning and the large availability of data and GPUs, great strides have been made in predicting the memorability of an image. In this paper, we propose a novel deep learning architecture called ResMem-Net that is a hybrid of LSTM and CNN that uses information from the hidden layers of the CNN to compute the memorability score of an image. The intermediate layers are important for predicting the output because they contain information about the intrinsic properties of the image. The proposed architecture automatically learns visual emotions and saliency, shown by the heatmaps generated using the GradRAM technique. We have also used the heatmaps and results to analyze and answer one of the most important questions in image memorability: “What makes an image memorable?”. The model is trained and evaluated using the publicly available Large-scale Image Memorability dataset (LaMem) from MIT. The results show that the model achieves a rank correlation of 0.679 and a mean squared error of 0.011, which is better than the current state-of-the-art models and is close to human consistency (p = 0.68). The proposed architecture also has a significantly low number of parameters compared to the state-of-the-art architecture, making it memory efficient and suitable for production.

Download Full-text

Crop Rotation Modeling for Deep Learning-Based Parcel Classification from Satellite Time Series

Remote Sensing ◽

10.3390/rs13224599 ◽

2021 ◽

Vol 13 (22) ◽

pp. 4599

Author(s):

Félix Quinton ◽

Loic Landrieu

Keyword(s):

Time Series ◽

Deep Learning ◽

Crop Rotation ◽

Large Scale ◽

State Of The Art ◽

Crop Rotations ◽

Learning Approach ◽

Type Mapping ◽

Current State ◽

Crop Type

While annual crop rotations play a crucial role for agricultural optimization, they have been largely ignored for automated crop type mapping. In this paper, we take advantage of the increasing quantity of annotated satellite data to propose to model simultaneously the inter- and intra-annual agricultural dynamics of yearly parcel classification with a deep learning approach. Along with simple training adjustments, our model provides an improvement of over 6.3% mIoU over the current state-of-the-art of crop classification, and a reduction of over 21% of the error rate. Furthermore, we release the first large-scale multi-year agricultural dataset with over 300,000 annotated parcels.

Download Full-text

Multimodel Deep Learning for Person Detection in Aerial Images

Electronics ◽

10.3390/electronics9091459 ◽

2020 ◽

Vol 9 (9) ◽

pp. 1459

Author(s):

Mirela Kundid Vasić ◽

Vladan Papić

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Contextual Information ◽

Geographical Area ◽

Aerial Images ◽

Person Detection ◽

Current State ◽

Aerial Vehicle ◽

Novel Method ◽

Better Than

In this paper, we propose a novel method for person detection in aerial images of nonurban terrain gathered by an Unmanned Aerial Vehicle (UAV), which plays an important role in Search And Rescue (SAR) missions. The UAV in SAR operations contributes significantly due to the ability to survey a larger geographical area from an aerial viewpoint. Because of the high altitude of recording, the object of interest (person) covers a small part of an image (around 0.1%), which makes this task quite challenging. To address this problem, a multimodel deep learning approach is proposed. The solution consists of two different convolutional neural networks in region proposal, as well as in the classification stage. Additionally, contextual information is used in the classification stage in order to improve the detection results. Experimental results tested on the HERIDAL dataset achieved precision of 68.89% and a recall of 94.65%, which is better than current state-of-the-art methods used for person detection in similar scenarios. Consequently, it may be concluded that this approach is suitable for usage as an auxiliary method in real SAR operations.

Download Full-text

Learning spatiotemporal features with 3D DenseNet and attention for gesture recognition

International Journal of Electrical Engineering Education ◽

10.1177/0020720919894196 ◽

2019 ◽

pp. 002072091989419

Author(s):

Honegzhe Liu ◽

Zhifang Deng ◽

Cheng Xu

Keyword(s):

Gesture Recognition ◽

Transition Layer ◽

Large Scale ◽

State Of The Art ◽

Attention Mechanism ◽

Spatiotemporal Features ◽

Current State ◽

Feature Extractor ◽

Dynamic Gestures ◽

Better Than

Gesture recognition aims at understanding dynamic gestures of the human body and is one of the most important ways of human–computer interaction; to extract more effective spatiotemporal features in gesture videos for more accurate gesture classification, a novel feature extractor network, spatiotemporal attention 3D DenseNet is proposed in this study. We extend DenseNet with 3D kernels and Refined Temporal Transition Layer based on Temporal Transition Layer, and we also explore attention mechanism in 3D ConvNets. We embed the Refined Temporal Transition Layer and attention mechanism in DenseNet3D, named the proposed network “spatiotemporal attention 3D DenseNet.” Our experiments show that our Refined Temporal Transition Layer performs better than Temporal Transition Layer and the proposed spatiotemporal attention 3D DenseNet in each modality outperforms the current state-of-the-art methods on the ChaLearn LAP Large-Scale Isolated gesture dataset. The code and pretrained model are released in https://github.com/dzf19927/STA3D .

Download Full-text

Developments in Rechargeable Mno2Electrodes for Lithium Batteries

MRS Proceedings ◽

10.1557/proc-135-585 ◽

1988 ◽

Vol 135 ◽

Cited By ~ 3

Author(s):

Michael M Thackeray

Keyword(s):

Lithium Batteries ◽

Lithium Battery ◽

Electrode Materials ◽

State Of The Art ◽

Structural Features ◽

Rechargeable Batteries ◽

Current State ◽

Alternative Systems ◽

Battery Technology ◽

Made In

AbstractConsiderable efforts are in progress to develop rechargeable batteries as alternative systems to the nickel-cadmium battery. In this regard, several advances have been made in ambient-temperature lithium battery technology, and specifically in the engineering of rechargeable lithium/manganese dioxide cells. This paper reviews the current state of the art in rechargeable Li/MnO2battery technology; particular attention is paid to the structural features of various MnO2electrode materials which influence their electrochemical and cycling behaviour in lithium cells.

Download Full-text

SHEDR: An End-to-End Deep Neural Event Detection and Recommendation Framework for Hyperlocal News Using Social Media

INFORMS Journal on Computing ◽

10.1287/ijoc.2021.1112 ◽

2021 ◽

Author(s):

Yuheng Hu ◽

Yili Hong

Keyword(s):

Neural Network ◽

Social Media ◽

Deep Learning ◽

Event Detection ◽

Large Scale ◽

Short Term Memory ◽

State Of The Art ◽

Neural Network Models ◽

Neural Event ◽

End To End

Residents often rely on newspapers and television to gather hyperlocal news for community awareness and engagement. More recently, social media have emerged as an increasingly important source of hyperlocal news. Thus far, the literature on using social media to create desirable societal benefits, such as civic awareness and engagement, is still in its infancy. One key challenge in this research stream is to timely and accurately distill information from noisy social media data streams to community members. In this work, we develop SHEDR (social media–based hyperlocal event detection and recommendation), an end-to-end neural event detection and recommendation framework with a particular use case for Twitter to facilitate residents’ information seeking of hyperlocal events. The key model innovation in SHEDR lies in the design of the hyperlocal event detector and the event recommender. First, we harness the power of two popular deep neural network models, the convolutional neural network (CNN) and long short-term memory (LSTM), in a novel joint CNN-LSTM model to characterize spatiotemporal dependencies for capturing unusualness in a region of interest, which is classified as a hyperlocal event. Next, we develop a neural pairwise ranking algorithm for recommending detected hyperlocal events to residents based on their interests. To alleviate the sparsity issue and improve personalization, our algorithm incorporates several types of contextual information covering topic, social, and geographical proximities. We perform comprehensive evaluations based on two large-scale data sets comprising geotagged tweets covering Seattle and Chicago. We demonstrate the effectiveness of our framework in comparison with several state-of-the-art approaches. We show that our hyperlocal event detection and recommendation models consistently and significantly outperform other approaches in terms of precision, recall, and F-1 scores. Summary of Contribution: In this paper, we focus on a novel and important, yet largely underexplored application of computing—how to improve civic engagement in local neighborhoods via local news sharing and consumption based on social media feeds. To address this question, we propose two new computational and data-driven methods: (1) a deep learning–based hyperlocal event detection algorithm that scans spatially and temporally to detect hyperlocal events from geotagged Twitter feeds; and (2) A personalized deep learning–based hyperlocal event recommender system that systematically integrates several contextual cues such as topical, geographical, and social proximity to recommend the detected hyperlocal events to potential users. We conduct a series of experiments to examine our proposed models. The outcomes demonstrate that our algorithms are significantly better than the state-of-the-art models and can provide users with more relevant information about the local neighborhoods that they live in, which in turn may boost their community engagement.

Download Full-text

A Survey of Graphical Page Object Detection with Deep Neural Networks

10.20944/preprints202104.0739.v1 ◽

2021 ◽

Author(s):

Jwalin Bhatt ◽

Khurram Azeem Hashmi ◽

Muhammad Zeshan Afzal ◽

Didier Stricker

Keyword(s):

Deep Learning ◽

Object Detection ◽

Conceptual Understanding ◽

Deep Neural Networks ◽

State Of The Art ◽

Learning Approaches ◽

Document Images ◽

Essential Information ◽

Current State ◽

High Level

In any document, graphical elements like tables, figures, and formulas contain essential information. The processing and interpretation of such information require specialized algorithms. Off-the-shelf OCR components cannot process this information reliably. Therefore, an essential step in document analysis pipelines is to detect these graphical components. It leads to a high-level conceptual understanding of the documents that makes digitization of documents viable. Since the advent of deep learning, the performance of deep learning-based object detection has improved many folds. In this work, we outline and summarize the deep learning approaches for detecting graphical page objects in the document images. Therefore, we discuss the most relevant deep learning-based approaches and state-of-the-art graphical page object detection in document images. This work provides a comprehensive understanding of the current state-of-the-art and related challenges. Furthermore, we discuss leading datasets along with the quantitative evaluation. Moreover, it discusses briefly the promising directions that can be utilized for further improvements.

Download Full-text

Using plants to remediate or manage metal-polluted soils: an overview on the current state of phytotechnologies

Acta Scientiarum Agronomy ◽

10.4025/actasciagron.v43i1.58283 ◽

2021 ◽

Vol 43 ◽

pp. e58283

Author(s):

Clístenes Williams Araújo do Nascimento ◽

Caroline Miranda Biondi ◽

Fernando Bruno Vieira da Silva ◽

Luiz Henrique Vieira Lima

Keyword(s):

Large Scale ◽

State Of The Art ◽

Cost Effective ◽

Time Frame ◽

Contaminated Site ◽

Polluted Soils ◽

Mining Sites ◽

Current State ◽

Remedial Actions ◽

The Tropics

Soil contamination by metals threatens both the environment and human health and hence requires remedial actions. The conventional approach of removing polluted soils and replacing them with clean soils (excavation) is very costly for low-value sites and not feasible on a large scale. In this scenario, phytoremediation emerged as a promising cost-effective and environmentally-friendly technology to render metals less bioavailable (phytostabilization) or clean up metal-polluted soils (phytoextraction). Phytostabilization has demonstrable successes in mining sites and brownfields. On the other hand, phytoextraction still has few examples of successful applications. Either by using hyperaccumulating plants or high biomass plants induced to accumulate metals through chelator addition to the soil, major phytoextraction bottlenecks remain, mainly the extended time frame to remediation and lack of revenue from the land during the process. Due to these drawbacks, phytomanagement has been proposed to provide economic, environmental, and social benefits until the contaminated site returns to productive usage. Here, we review the evolution, promises, and limitations of these phytotechnologies. Despite the lack of commercial phytoextraction operations, there have been significant advances in understanding phytotechnologies' main constraints. Further investigation on new plant species, especially in the tropics, and soil amendments can potentially provide the basis to transform phytoextraction into an operational metal clean-up technology in the future. However, at the current state of the art, phytotechnology is moving the focus from remediation technologies to pollution attenuation and palliative cares.

Download Full-text

A Robust Context-Based Deep Learning Approach for Highly Imbalanced Hyperspectral Classification

Computational Intelligence and Neuroscience ◽

10.1155/2021/9923491 ◽

2021 ◽

Vol 2021 ◽

pp. 1-17

Author(s):

Juan F. Ramirez Rochac ◽

Nian Zhang ◽

Lara A. Thompson ◽

Tolessa Deksissa

Keyword(s):

Deep Learning ◽

State Of The Art ◽

Mineral Exploration ◽

Classification Models ◽

Noise Resistance ◽

Deep Convolutional Neural Networks ◽

Current State ◽

Feature Augmentation ◽

Active Research ◽

Hyperspectral Classification

Hyperspectral imaging is an area of active research with many applications in remote sensing, mineral exploration, and environmental monitoring. Deep learning and, in particular, convolution-based approaches are the current state-of-the-art classification models. However, in the presence of noisy hyperspectral datasets, these deep convolutional neural networks underperform. In this paper, we proposed a feature augmentation approach to increase noise resistance in imbalanced hyperspectral classification. Our method calculates context-based features, and it uses a deep convolutional neuronet (DCN). We tested our proposed approach on the Pavia datasets and compared three models, DCN, PCA + DCN, and our context-based DCN, using the original datasets and the datasets plus noise. Our experimental results show that DCN and PCA + DCN perform well on the original datasets but not on the noisy datasets. Our robust context-based DCN was able to outperform others in the presence of noise and was able to maintain a comparable classification accuracy on clean hyperspectral images.

Download Full-text

A Comprehensive Taxonomy of Dynamic Texture Representation

ACM Computing Surveys ◽

10.1145/3487892 ◽

2023 ◽

Vol 55 (1) ◽

pp. 1-39

Author(s):

Thanh Tuan Nguyen ◽

Thanh Phuong Nguyen

Keyword(s):

Large Scale ◽

Environmental Changes ◽

State Of The Art ◽

The State ◽

Future Research ◽

Research Activities ◽

Potential Applications ◽

Benchmark Datasets ◽

Negative Impacts ◽

Made In

Representing dynamic textures (DTs) plays an important role in many real implementations in the computer vision community. Due to the turbulent and non-directional motions of DTs along with the negative impacts of different factors (e.g., environmental changes, noise, illumination, etc.), efficiently analyzing DTs has raised considerable challenges for the state-of-the-art approaches. For 20 years, many different techniques have been introduced to handle the above well-known issues for enhancing the performance. Those methods have shown valuable contributions, but the problems have been incompletely dealt with, particularly recognizing DTs on large-scale datasets. In this article, we present a comprehensive taxonomy of DT representation in order to purposefully give a thorough overview of the existing methods along with overall evaluations of their obtained performances. Accordingly, we arrange the methods into six canonical categories. Each of them is then taken in a brief presentation of its principal methodology stream and various related variants. The effectiveness levels of the state-of-the-art methods are then investigated and thoroughly discussed with respect to quantitative and qualitative evaluations in classifying DTs on benchmark datasets. Finally, we point out several potential applications and the remaining challenges that should be addressed in further directions. In comparison with two existing shallow DT surveys (i.e., the first one is out of date as it was made in 2005, while the newer one (published in 2016) is an inadequate overview), we believe that our proposed comprehensive taxonomy not only provides a better view of DT representation for the target readers but also stimulates future research activities.

Download Full-text

Optimal Passive Damper Positioning Techniques

Design Optimization of Active and Passive Structural Control Systems ◽

10.4018/978-1-4666-2029-2.ch004 ◽

2013 ◽

pp. 85-111 ◽

Cited By ~ 1

Author(s):

Arun M. Puthanpurayil ◽

Rajesh P Dhakal ◽

Athol J. Carr

Keyword(s):

Seismic Event ◽

State Of The Art ◽

Sensitivity Study ◽

Optimal Distribution ◽

Comparative Sensitivity ◽

Numerical Studies ◽

Current State ◽

Passive Damper ◽

Damping Model ◽

Made In

A consolidated review of the current-state-of-the-art on optimal damper positioning techniques is presented in this chapter. The inherent assumptions made in previous research are discussed and substantiated with numerical studies. Earlier studies have shown that optimal distribution of dampers is sensitive to in-structure damping. In this chapter the significance of optimal distribution of dampers coupled with the necessity for the use of a more realistic in-structure damping model is qualitatively illustrated using a comparative sensitivity study. The effect of inherent assumption of linearity of the parent frame on the ‘optimality’ is also investigated. It is shown that linearity assumption imposed on the parent frame in a major seismic event may not be justified; thereby raising doubts on the scope of optimality techniques proposed in literature.

Download Full-text