An anisotropic heat diffusion model for enhancing the extraction of underground rail track fasteners under extremely low and uneven illumination conditions

During the past two decades, subway systems have become one of the most dominant infrastructural developments in China at an unprecedented pace and scale. More than 60 metro lines in 25 cities have been completed, transporting more than 70 million passengers daily. Operating the subway systems safely and efficiently is a continuously pressing demand from both the management companies and the public. Although many automated or semi-automated methods for extracting critical components of the rail track systems, e.g. rail, fastener, sleeper, etc., have significantly improved the productivity of routine inspection, the unique challenges posed by the subway systems have hindered these existing methods from successful implementation because of the extremely low illumination in the underground environment, whereas additional artificial lighting often poses extremely uneven illumination. In this study, a generalized local illumination adaptation model using an anisotropic heat equation is proposed to dynamically adjust the acquired rail track images with extremely low and uneven illumination conditions. An integration flow is then proposed to seamlessly incorporate the proposed model into the state-of-the-art automated fastener detection algorithms. The results show that the proposed local illumination adaptation model can significantly improve the performance of the tested state-of-the-art fastener detection algorithms when they are applied to the images collected in the environment with extremely low and uneven illumination conditions, e.g. subway systems.

Download Full-text

MRFF-YOLO: A Multi-Receptive Fields Fusion Network for Remote Sensing Target Detection

Remote Sensing ◽

10.3390/rs12193118 ◽

2020 ◽

Vol 12 (19) ◽

pp. 3118

Author(s):

Danqing Xu ◽

Yiquan Wu

Keyword(s):

Remote Sensing ◽

Object Detection ◽

Target Detection ◽

State Of The Art ◽

Receptive Fields ◽

Aerial Images ◽

Detection Rates ◽

Detection Algorithms ◽

Proposed Model ◽

Small Targets

High-altitude remote sensing target detection has problems related to its low precision and low detection rate. In order to enhance the performance of detecting remote sensing targets, a new YOLO (You Only Look Once)-V3-based algorithm was proposed. In our improved YOLO-V3, we introduced the concept of multi-receptive fields to enhance the performance of feature extraction. Therefore, the proposed model was termed Multi-Receptive Fields Fusion YOLO (MRFF-YOLO). In addition, to address the flaws of YOLO-V3 in detecting small targets, we increased the detection layers from three to four. Moreover, in order to avoid gradient fading, the structure of improved DenseNet was chosen in the detection layers. We compared our approach (MRFF-YOLO) with YOLO-V3 and other state-of-the-art target detection algorithms on an Remote Sensing Object Detection (RSOD) dataset and a dataset of Object Detection in Aerial Images (UCS-AOD). With a series of improvements, the mAP (mean average precision) of MRFF-YOLO increased from 77.10% to 88.33% in the RSOD dataset and increased from 75.67% to 90.76% in the UCS-AOD dataset. The leaking detection rates are also greatly reduced, especially for small targets. The experimental results showed that our approach achieved better performance than traditional YOLO-V3 and other state-of-the-art models for remote sensing target detection.

Download Full-text

Bi-Directional Generation for Unsupervised Domain Adaptation

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6137 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6615-6622 ◽

Cited By ~ 2

Author(s):

Guanglei Yang ◽

Haifeng Xia ◽

Mingli Ding ◽

Zhengming Ding

Keyword(s):

Domain Adaptation ◽

State Of The Art ◽

Target Domain ◽

Unsupervised Domain Adaptation ◽

Cross Domain ◽

Proposed Model ◽

Latent Space ◽

Domain Alignment ◽

Domain Information ◽

Adaptation Model

Unsupervised domain adaptation facilitates the unlabeled target domain relying on well-established source domain information. The conventional methods forcefully reducing the domain discrepancy in the latent space will result in the destruction of intrinsic data structure. To balance the mitigation of domain gap and the preservation of the inherent structure, we propose a Bi-Directional Generation domain adaptation model with consistent classifiers interpolating two intermediate domains to bridge source and target domains. Specifically, two cross-domain generators are employed to synthesize one domain conditioned on the other. The performance of our proposed method can be further enhanced by the consistent classifiers and the cross-domain alignment constraints. We also design two classifiers which are jointly optimized to maximize the consistency on target sample prediction. Extensive experiments verify that our proposed model outperforms the state-of-the-art on standard cross domain visual benchmarks.

Download Full-text

Enhanced context-aware recommendation using topic modeling and particle swarm optimization

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-210331 ◽

2021 ◽

pp. 1-16

Author(s):

Ibtissem Gasmi ◽

Mohamed Walid Azizi ◽

Hassina Seridi-Bouchelaghem ◽

Nabiha Azizi ◽

Samir Brahim Belhaouari

Keyword(s):

Topic Modeling ◽

Latent Dirichlet Allocation ◽

State Of The Art ◽

Weighting Function ◽

Contextual Factors ◽

Pearson Correlation ◽

Correlation Coefficients ◽

Pso Algorithm ◽

Context Aware ◽

Proposed Model

Context-Aware Recommender System (CARS) suggests more relevant services by adapting them to the user’s specific context situation. Nevertheless, the use of many contextual factors can increase data sparsity while few context parameters fail to introduce the contextual effects in recommendations. Moreover, several CARSs are based on similarity algorithms, such as cosine and Pearson correlation coefficients. These methods are not very effective in the sparse datasets. This paper presents a context-aware model to integrate contextual factors into prediction process when there are insufficient co-rated items. The proposed algorithm uses Latent Dirichlet Allocation (LDA) to learn the latent interests of users from the textual descriptions of items. Then, it integrates both the explicit contextual factors and their degree of importance in the prediction process by introducing a weighting function. Indeed, the PSO algorithm is employed to learn and optimize weights of these features. The results on the Movielens 1 M dataset show that the proposed model can achieve an F-measure of 45.51% with precision as 68.64%. Furthermore, the enhancement in MAE and RMSE can respectively reach 41.63% and 39.69% compared with the state-of-the-art techniques.

Download Full-text

RLC-GNN: An Improved Deep Architecture for Spatial-Based Graph Neural Network with Application to Fraud Detection

Applied Sciences ◽

10.3390/app11125656 ◽

2021 ◽

Vol 11 (12) ◽

pp. 5656

Author(s):

Yufan Zeng ◽

Jiashan Tang

Keyword(s):

Numerical Experiments ◽

State Of The Art ◽

Single Layer ◽

Fraud Detection ◽

Layer By Layer ◽

Residual Structure ◽

Detection Algorithms ◽

Deep Architecture ◽

Graph Neural Networks ◽

Node Embeddings

Graph neural networks (GNNs) have been very successful at solving fraud detection tasks. The GNN-based detection algorithms learn node embeddings by aggregating neighboring information. Recently, CAmouflage-REsistant GNN (CARE-GNN) is proposed, and this algorithm achieves state-of-the-art results on fraud detection tasks by dealing with relation camouflages and feature camouflages. However, stacking multiple layers in a traditional way defined by hop leads to a rapid performance drop. As the single-layer CARE-GNN cannot extract more information to fix the potential mistakes, the performance heavily relies on the only one layer. In order to avoid the case of single-layer learning, in this paper, we consider a multi-layer architecture which can form a complementary relationship with residual structure. We propose an improved algorithm named Residual Layered CARE-GNN (RLC-GNN). The new algorithm learns layer by layer progressively and corrects mistakes continuously. We choose three metrics—recall, AUC, and F1-score—to evaluate proposed algorithm. Numerical experiments are conducted. We obtain up to 5.66%, 7.72%, and 9.09% improvements in recall, AUC, and F1-score, respectively, on Yelp dataset. Moreover, we also obtain up to 3.66%, 4.27%, and 3.25% improvements in the same three metrics on the Amazon dataset.

Download Full-text

A Deep Learning Approach to Predict Autism Spectrum Disorder Using Multisite Resting-State fMRI

Applied Sciences ◽

10.3390/app11083636 ◽

2021 ◽

Vol 11 (8) ◽

pp. 3636

Author(s):

Faria Zarin Subah ◽

Kaushik Deb ◽

Pranab Kumar Dhar ◽

Takeshi Koshiba

Keyword(s):

Autism Spectrum Disorder ◽

Resting State ◽

State Of The Art ◽

Resting State Fmri ◽

Autism Spectrum ◽

Spectrum Disorder ◽

Bootstrap Analysis ◽

Proposed Model ◽

Art Methods ◽

The Mean

Autism spectrum disorder (ASD) is a complex and degenerative neuro-developmental disorder. Most of the existing methods utilize functional magnetic resonance imaging (fMRI) to detect ASD with a very limited dataset which provides high accuracy but results in poor generalization. To overcome this limitation and to enhance the performance of the automated autism diagnosis model, in this paper, we propose an ASD detection model using functional connectivity features of resting-state fMRI data. Our proposed model utilizes two commonly used brain atlases, Craddock 200 (CC200) and Automated Anatomical Labelling (AAL), and two rarely used atlases Bootstrap Analysis of Stable Clusters (BASC) and Power. A deep neural network (DNN) classifier is used to perform the classification task. Simulation results indicate that the proposed model outperforms state-of-the-art methods in terms of accuracy. The mean accuracy of the proposed model was 88%, whereas the mean accuracy of the state-of-the-art methods ranged from 67% to 85%. The sensitivity, F1-score, and area under receiver operating characteristic curve (AUC) score of the proposed model were 90%, 87%, and 96%, respectively. Comparative analysis on various scoring strategies show the superiority of BASC atlas over other aforementioned atlases in classifying ASD and control.

Download Full-text

Equivariant Adversarial Network for Image-to-image Translation

ACM Transactions on Multimedia Computing Communications and Applications ◽

10.1145/3458280 ◽

2021 ◽

Vol 17 (2s) ◽

pp. 1-14

Author(s):

Masoumeh Zareapoor ◽

Jie Yang

Keyword(s):

State Of The Art ◽

Generative Models ◽

Generative Model ◽

Target Domain ◽

Adversarial Network ◽

Proposed Model ◽

Image Translation ◽

Great Performance ◽

Representative Model ◽

The Ideal

Image-to-Image translation aims to learn an image from a source domain to a target domain. However, there are three main challenges, such as lack of paired datasets, multimodality, and diversity, that are associated with these problems and need to be dealt with. Convolutional neural networks (CNNs), despite of having great performance in many computer vision tasks, they fail to detect the hierarchy of spatial relationships between different parts of an object and thus do not form the ideal representative model we look for. This article presents a new variation of generative models that aims to remedy this problem. We use a trainable transformer, which explicitly allows the spatial manipulation of data within training. This differentiable module can be augmented into the convolutional layers in the generative model, and it allows to freely alter the generated distributions for image-to-image translation. To reap the benefits of proposed module into generative model, our architecture incorporates a new loss function to facilitate an effective end-to-end generative learning for image-to-image translation. The proposed model is evaluated through comprehensive experiments on image synthesizing and image-to-image translation, along with comparisons with several state-of-the-art algorithms.

Download Full-text

Rewiew of Fracture and Fatigue in Ceramic Matrix Composites

Applied Mechanics Reviews ◽

10.1115/1.3097345 ◽

2000 ◽

Vol 53 (6) ◽

pp. 147-174 ◽

Cited By ~ 14

Author(s):

Victor Birman ◽

Larry W. Byrd

Keyword(s):

High Temperature ◽

State Of The Art ◽

Ceramic Matrix Composites ◽

Ceramic Matrix ◽

Review Article ◽

Successful Implementation ◽

Matrix Composites ◽

Recent Developments ◽

Hostile Environments

A review of recent developments and state-of-the-art in research and understanding of damage and fatigue of ceramic matrix composites is presented. Both laminated as well as woven configurations are considered. The work on the effects of high temperature on fracture and fatigue of ceramic matrix composites is emphasized, because these materials are usually designed to operate in hostile environments. Based on a detailed discussion of the mechanisms of failure, the problems that have to be addressed for a successful implementation of ceramic matrix composites in design and practical operational structures are outlined. This review article includes 317 references.

Download Full-text

License Plate Tilt Correction: A Review

Engineering and Technology Journal ◽

10.30684/etj.v39i1b.1839 ◽

2021 ◽

Vol 39 (1B) ◽

pp. 101-116

Author(s):

Nada N. Kamal ◽

Enas Tariq

Keyword(s):

State Of The Art ◽

Recognition System ◽

License Plate ◽

License Plate Recognition ◽

Tilt Correction ◽

Detection Algorithms ◽

Essential Step ◽

Tilting Angle ◽

Different Types ◽

Line Fitting

Tilt correction is an essential step in the license plate recognition system (LPR). The main goal of this article is to provide a review of the various methods that are presented in the literature and used to correct different types of tilt that appear in the digital image of the license plates (LP). This theoretical survey will enable the researchers to have an overview of the available implemented tilt detection and correction algorithms. That’s how this review will simplify for the researchers the choice to determine which of the available rotation correction and detection algorithms to implement while designing their LPR system. This review also simplifies the decision for the researchers to choose whether to combine two or more of the existing algorithms or simply create a new efficient one. This review doesn’t recite the described models in the literature in a hard-narrative tale, but instead, it clarifies how the tilt correction stage is divided based on its initial steps. The steps include: locating the plate corners, finding the tilting angle of the plate, then, correcting its horizontal, vertical, and sheared inclination. For the tilt correction stage, this review clarifies how state-of-the-art literature handled each step individually. As a result, it has been noticed that line fitting, Hough transform, and Randon transform are the most used methods to correct the tilt of a LP.

Download Full-text

Improving Multilingual Sentence Embedding using Bi-directional Dual Encoder with Additive Margin Softmax

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/746 ◽

2019 ◽

Author(s):

Yinfei Yang ◽

Gustavo Hernandez Abrego ◽

Steve Yuan ◽

Mandy Guo ◽

Qinlan Shen ◽

...

Keyword(s):

United Nations ◽

State Of The Art ◽

Cosine Similarity ◽

Retrieval Task ◽

Parallel Corpus ◽

Similar Performance ◽

Second Stage ◽

Current State ◽

Proposed Model ◽

Document Level

In this paper, we present an approach to learn multilingual sentence embeddings using a bi-directional dual-encoder with additive margin softmax. The embeddings are able to achieve state-of-the-art results on the United Nations (UN) parallel corpus retrieval task. In all the languages tested, the system achieves P@1 of 86% or higher. We use pairs retrieved by our approach to train NMT models that achieve similar performance to models trained on gold pairs. We explore simple document-level embeddings constructed by averaging our sentence embeddings. On the UN document-level retrieval task, document embeddings achieve around 97% on P@1 for all experimented language pairs. Lastly, we evaluate the proposed model on the BUCC mining task. The learned embeddings with raw cosine similarity scores achieve competitive results compared to current state-of-the-art models, and with a second-stage scorer we achieve a new state-of-the-art level on this task.

Download Full-text

Multi-agent Attentional Activity Recognition

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/186 ◽

2019 ◽

Cited By ~ 3

Author(s):

Kaixuan Chen ◽

Lina Yao ◽

Dalin Zhang ◽

Bin Guo ◽

Zhiwen Yu

Keyword(s):

Activity Recognition ◽

State Of The Art ◽

Body Part ◽

Body Parts ◽

Temporal Attention ◽

Attention Model ◽

Proposed Model ◽

Collective Motions ◽

Multi Agent ◽

Real World Datasets

Multi-modality is an important feature of sensor based activity recognition. In this work, we consider two inherent characteristics of human activities, the spatially-temporally varying salience of features and the relations between activities and corresponding body part motions. Based on these, we propose a multi-agent spatial-temporal attention model. The spatial-temporal attention mechanism helps intelligently select informative modalities and their active periods. And the multiple agents in the proposed model represent activities with collective motions across body parts by independently selecting modalities associated with single motions. With a joint recognition goal, the agents share gained information and coordinate their selection policies to learn the optimal recognition model. The experimental results on four real-world datasets demonstrate that the proposed model outperforms the state-of-the-art methods.

Download Full-text