Instance-Level Facial Attributes Transfer with Geometry-Aware Flow

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019111 ◽

2019 ◽

Vol 33 ◽

pp. 9111-9118 ◽

Cited By ~ 6

Author(s):

Weidong Yin ◽

Ziwei Liu ◽

Chen Change Loy

Keyword(s):

High Resolution ◽

Target Face ◽

Training Data ◽

Semantic Level ◽

Facial Landmarks ◽

Face Images ◽

Facial Attributes

We address the problem of instance-level facial attribute transfer without paired training data, e.g., faithfully transferring the exact mustache from a source face to a target face. This is a more challenging task than the conventional semantic-level attribute transfer, which only preserves the generic attribute style instead of instance-level traits. We propose the use of geometry-aware flow, which serves as a wellsuited representation for modeling the transformation between instance-level facial attributes. Specifically, we leverage the facial landmarks as the geometric guidance to learn the differentiable flows automatically, despite of the large pose gap existed. Geometry-aware flow is able to warp the source face attribute into the target face context and generate a warp-and-blend result. To compensate for the potential appearance gap between source and target faces, we propose a hallucination sub-network that produces an appearance residual to further refine the warp-and-blend result. Finally, a cycle-consistency framework consisting of both attribute transfer module and attribute removal module is designed, so that abundant unpaired face images can be used as training data. Extensive evaluations validate the capability of our approach in transferring instance-level facial attributes faithfully across large pose and appearance gaps. Thanks to the flow representation, our approach can readily be applied to generate realistic details on high-resolution images1.

Download Full-text

Assessing Riyadh’s Urban Change Utilizing High-Resolution Imagery

Land ◽

10.3390/land8120193 ◽

2019 ◽

Vol 8 (12) ◽

pp. 193

Author(s):

Ali Alghamdi ◽

Anthony R. Cummings

Keyword(s):

Maximum Likelihood ◽

High Resolution ◽

Accuracy Assessment ◽

Middle Eastern ◽

Object Oriented ◽

Training Data ◽

High Resolution Imagery ◽

Significant Research ◽

Desert Areas ◽

Spot 5

The implications of change on local processes have attracted significant research interest in recent times. In urban settings, green spaces and forests have attracted much attention. Here, we present an assessment of change within the predominantly desert Middle Eastern city of Riyadh, an understudied setting. We utilized high-resolution SPOT 5 data and two classification techniques—maximum likelihood classification and object-oriented classification—to study the changes in Riyadh between 2004 and 2014. Imagery classification was completed with training data obtained from the SPOT 5 dataset, and an accuracy assessment was completed through a combination of field surveys and an application developed in ESRI Survey 123 tool. The Survey 123 tool allowed residents of Riyadh to present their views on land cover for the 2004 and 2014 imagery. Our analysis showed that soil or ‘desert’ areas were converted to roads and buildings to accommodate for Riyadh’s rapidly growing population. The object-oriented classifier provided higher overall accuracy than the maximum likelihood classifier (74.71% and 73.79% vs. 92.36% and 90.77% for 2004 and 2014). Our work provides insights into the changes within a desert environment and establishes a foundation for understanding change in this understudied setting.

Download Full-text

Multimodal Low Resolution Face and Frontal Gait Recognition from Surveillance Video

Electronics ◽

10.3390/electronics10091013 ◽

2021 ◽

Vol 10 (9) ◽

pp. 1013

Author(s):

Sayan Maity ◽

Mohamed Abdel-Mottaleb ◽

Shihab S. Asfour

Keyword(s):

Face Recognition ◽

High Resolution ◽

Gait Cycle ◽

Gait Recognition ◽

Recognition System ◽

Multimodal Fusion ◽

Surveillance Video ◽

Low Resolution ◽

Model Free ◽

Face Images

Biometric identification using surveillance video has attracted the attention of many researchers as it can be applicable not only for robust identification but also personalized activity monitoring. In this paper, we present a novel multimodal recognition system that extracts frontal gait and low-resolution face images from frontal walking surveillance video clips to perform efficient biometric recognition. The proposed study addresses two important issues in surveillance video that did not receive appropriate attention in the past. First, it consolidates the model-free and model-based gait feature extraction approaches to perform robust gait recognition only using the frontal view. Second, it uses a low-resolution face recognition approach which can be trained and tested using low-resolution face information. This eliminates the need for obtaining high-resolution face images to create the gallery, which is required in the majority of low-resolution face recognition techniques. Moreover, the classification accuracy on high-resolution face images is considerably higher. Previous studies on frontal gait recognition incorporate assumptions to approximate the average gait cycle. However, we quantify the gait cycle precisely for each subject using only the frontal gait information. The approaches available in the literature use the high resolution images obtained in a controlled environment to train the recognition system. However, in our proposed system we train the recognition algorithm using the low-resolution face images captured in the unconstrained environment. The proposed system has two components, one is responsible for performing frontal gait recognition and one is responsible for low-resolution face recognition. Later, score level fusion is performed to fuse the results of the frontal gait recognition and the low-resolution face recognition. Experiments conducted on the Face and Ocular Challenge Series (FOCS) dataset resulted in a 93.5% Rank-1 for frontal gait recognition and 82.92% Rank-1 for low-resolution face recognition, respectively. The score level multimodal fusion resulted in 95.9% Rank-1 recognition, which demonstrates the superiority and robustness of the proposed approach.

Download Full-text

Drill-Core Mineral Abundance Estimation Using Hyperspectral and High-Resolution Mineralogical Data

Remote Sensing ◽

10.3390/rs12071218 ◽

2020 ◽

Vol 12 (7) ◽

pp. 1218

Author(s):

Laura Tuşa ◽

Mahdi Khodadadzadeh ◽

Cecilia Contreras ◽

Kasra Rafiezadeh Shahi ◽

Margret Fuchs ◽

...

Keyword(s):

Machine Learning ◽

High Resolution ◽

Ore Deposits ◽

Machine Learning Algorithms ◽

Training Data ◽

Support Vector ◽

Drill Core ◽

Data Types ◽

Mineralogical Characterization ◽

Core Samples

Due to the extensive drilling performed every year in exploration campaigns for the discovery and evaluation of ore deposits, drill-core mapping is becoming an essential step. While valuable mineralogical information is extracted during core logging by on-site geologists, the process is time consuming and dependent on the observer and individual background. Hyperspectral short-wave infrared (SWIR) data is used in the mining industry as a tool to complement traditional logging techniques and to provide a rapid and non-invasive analytical method for mineralogical characterization. Additionally, Scanning Electron Microscopy-based image analyses using a Mineral Liberation Analyser (SEM-MLA) provide exhaustive high-resolution mineralogical maps, but can only be performed on small areas of the drill-cores. We propose to use machine learning algorithms to combine the two data types and upscale the quantitative SEM-MLA mineralogical data to drill-core scale. This way, quasi-quantitative maps over entire drill-core samples are obtained. Our upscaling approach increases result transparency and reproducibility by employing physical-based data acquisition (hyperspectral imaging) combined with mathematical models (machine learning). The procedure is tested on 5 drill-core samples with varying training data using random forests, support vector machines and neural network regression models. The obtained mineral abundance maps are further used for the extraction of mineralogical parameters such as mineral association.

Download Full-text

Adapted discriminative coupled mappings for low-resolution face recognition with one-shot

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-212454 ◽

2021 ◽

pp. 1-15

Author(s):

Yongjie Chu ◽

Touqeer Ahmad ◽

Lindu Zhao

Keyword(s):

Face Recognition ◽

Law Enforcement ◽

High Resolution ◽

Domain Adaptation ◽

Face Image ◽

Discriminative Learning ◽

Low Resolution ◽

Face Images ◽

World Environment

Low-resolution face recognition with one-shot is a prevalent problem encountered in law enforcement, where it generally requires to recognize the low-resolution face images captured by surveillance cameras with the only one high-resolution profile face image in the database. The problem is very tough because the available samples is quite few and the quality of unknown images is quite low. To effectively address this issue, this paper proposes Adapted Discriminative Coupled Mappings (AdaDCM) approach, which integrates domain adaptation and discriminative learning. To achieve good domain adaptation performance for small size dataset, a new domain adaptation technique called Bidirectional Locality Matching-based Domain Adaptation (BLM-DA) is first developed. Then the proposed AdaDCM is formulated by unifying BLM-DA and discriminative coupled mappings into a single framework. AdaDCM is extensively evaluated on FERET, LFW, and SCface databases, which includes LR face images obtained in constrained, unconstrained, and real-world environment. The promising results on these datasets demonstrate the effectiveness of AdaDCM in LR face recognition with one-shot.

Download Full-text

Characterizing the Spatial and Temporal Availability of Very High Resolution Satellite Imagery in Google Earth and Microsoft Bing Maps as a Source of Reference Data

Land ◽

10.3390/land7040118 ◽

2018 ◽

Vol 7 (4) ◽

pp. 118 ◽

Cited By ~ 18

Author(s):

Myroslava Lesiv ◽

Linda See ◽

Juan Laso Bayas ◽

Tobias Sturn ◽

Dmitry Schepaschenko ◽

...

Keyword(s):

High Resolution ◽

Satellite Imagery ◽

Urban Areas ◽

Reference Data ◽

Temporal Distribution ◽

Google Earth ◽

Training Data ◽

Visual Interpretation ◽

The Usa ◽

Very High

Very high resolution (VHR) satellite imagery from Google Earth and Microsoft Bing Maps is increasingly being used in a variety of applications from computer sciences to arts and humanities. In the field of remote sensing, one use of this imagery is to create reference data sets through visual interpretation, e.g., to complement existing training data or to aid in the validation of land-cover products. Through new applications such as Collect Earth, this imagery is also being used for monitoring purposes in the form of statistical surveys obtained through visual interpretation. However, little is known about where VHR satellite imagery exists globally or the dates of the imagery. Here we present a global overview of the spatial and temporal distribution of VHR satellite imagery in Google Earth and Microsoft Bing Maps. The results show an uneven availability globally, with biases in certain areas such as the USA, Europe and India, and with clear discontinuities at political borders. We also show that the availability of VHR imagery is currently not adequate for monitoring protected areas and deforestation, but is better suited for monitoring changes in cropland or urban areas using visual interpretation.

Download Full-text

Efficient Patch-Wise Semantic Segmentation for Large-Scale Remote Sensing Images

Sensors ◽

10.3390/s18103232 ◽

2018 ◽

Vol 18 (10) ◽

pp. 3232 ◽

Cited By ~ 17

Author(s):

Yan Liu ◽

Qirui Ren ◽

Jiahui Geng ◽

Meng Ding ◽

Jiangyun Li

Keyword(s):

Remote Sensing ◽

High Resolution ◽

Large Scale ◽

Semantic Segmentation ◽

Remote Sensing Image ◽

Training Data ◽

Land Resources ◽

Remote Sensing Images ◽

Training Strategy ◽

The Impact

Efficient and accurate semantic segmentation is the key technique for automatic remote sensing image analysis. While there have been many segmentation methods based on traditional hand-craft feature extractors, it is still challenging to process high-resolution and large-scale remote sensing images. In this work, a novel patch-wise semantic segmentation method with a new training strategy based on fully convolutional networks is presented to segment common land resources. First, to handle the high-resolution image, the images are split as local patches and then a patch-wise network is built. Second, training data is preprocessed in several ways to meet the specific characteristics of remote sensing images, i.e., color imbalance, object rotation variations and lens distortion. Third, a multi-scale training strategy is developed to solve the severe scale variation problem. In addition, the impact of conditional random field (CRF) is studied to improve the precision. The proposed method was evaluated on a dataset collected from a capital city in West China with the Gaofen-2 satellite. The dataset contains ten common land resources (Grassland, Road, etc.). The experimental results show that the proposed algorithm achieves 54.96% in terms of mean intersection over union (MIoU) and outperforms other state-of-the-art methods in remote sensing image segmentation.

Download Full-text

High Resolution Ancestry Deconvolution for Next Generation Genomic Data

10.1101/2021.09.19.460980 ◽

2021 ◽

Author(s):

Helgi Hilmarsson ◽

Arvind S. Kumar ◽

Richa Rastogi ◽

Carlos D. Bustamante ◽

Daniel Mas Montserrat ◽

...

Keyword(s):

High Resolution ◽

Prediction Models ◽

Association Studies ◽

Training Data ◽

Genome Wide Association Studies ◽

Whole Genome ◽

Next Generation ◽

Genome Data ◽

Computational Performance ◽

Genetic Risk Prediction

ABSTRACTAs genome-wide association studies and genetic risk prediction models are extended to globally diverse and admixed cohorts, ancestry deconvolution has become an increasingly important tool. Also known as local ancestry inference (LAI), this technique identifies the ancestry of each region of an individual’s genome, thus permitting downstream analyses to account for genetic effects that vary between ancestries. Since existing LAI methods were developed before the rise of massive, whole genome biobanks, they are computationally burdened by these large next generation datasets. Current LAI algorithms also fail to harness the potential of whole genome sequences, falling well short of the accuracy that such high variant densities can enable. Here we introduce Gnomix, a set of algorithms that address each of these points, achieving higher accuracy and swifter computational performance than any existing LAI method, while also enabling portable models that are particularly useful when training data are not shareable due to privacy or other restrictions. We demonstrate Gnomix (and its swift phase correction counterpart Gnofix) on worldwide whole-genome data from both humans and canids and utilize its high resolution accuracy to identify the location of ancient New World haplotypes in the Xoloitzcuintle, dating back over 100 generations. Code is available at https://github.com/AI-sandbox/gnomix.

Download Full-text

Super-Resolution Based on Generative Adversarial Network for HRTEM Images

International Journal of Pattern Recognition and Artificial Intelligence ◽

10.1142/s0218001421540276 ◽

2021 ◽

pp. 2154027

Author(s):

Fuqi Mao ◽

Xiaohan Guan ◽

Ruoyu Wang ◽

Wen Yue

Keyword(s):

Deep Learning ◽

High Resolution ◽

Information Structure ◽

Rapid Development ◽

Super Resolution ◽

Training Data ◽

Learning Technology ◽

Hrtem Image ◽

Generative Adversarial Network ◽

Dual Regression

As an important tool to study the microstructure and properties of materials, High Resolution Transmission Electron Microscope (HRTEM) images can obtain the lattice fringe image (reflecting the crystal plane spacing information), structure image and individual atom image (which reflects the configuration of atoms or atomic groups in crystal structure). Despite the rapid development of HTTEM devices, HRTEM images still have limited achievable resolution for human visual system. With the rapid development of deep learning technology in recent years, researchers are actively exploring the Super-resolution (SR) model based on deep learning, and the model has reached the current best level in various SR benchmarks. Using SR to reconstruct high-resolution HRTEM image is helpful to the material science research. However, there is one core issue that has not been resolved: most of these super-resolution methods require the training data to exist in pairs. In actual scenarios, especially for HRTEM images, there are no corresponding HR images. To reconstruct high quality HRTEM image, a novel Super-Resolution architecture for HRTEM images is proposed in this paper. Borrowing the idea from Dual Regression Networks (DRN), we introduce an additional dual regression structure to ESRGAN, by training the model with unpaired HRTEM images and paired nature images. Results of extensive benchmark experiments demonstrate that the proposed method achieves better performance than the most resent SISR methods with both quantitative and visual results.

Download Full-text

Collecting training data to map forest management at global scale

10.5194/egusphere-egu21-15297 ◽

2021 ◽

Author(s):

Myroslava Lesiv ◽

Dmitry Schepaschenko ◽

Martina Dürauer ◽

Marcel Buchhorn ◽

Ivelina Georgieva ◽

...

Keyword(s):

Forest Management ◽

High Resolution ◽

Remotely Sensed ◽

Global Scale ◽

Google Earth ◽

Training Data ◽

Tree Cover ◽

Data Set ◽

The World ◽

Very High

Spatially explicit information on forest management at a global scale is critical for understanding the current status of forests for sustainable forest management and restoration. Whereas remotely sensed based datasets, developed by applying ML and AI algorithms, can successfully depict tree cover and other land cover types, it has not yet been used to depict untouched forest and different degrees of forest management. We show for the first time that with sufficient training data derived from very high-resolution imagery a differentiation within the tree cover class of various levels of forest management is possible.In this session, we would like to present our approach for labeling forest related training data by using Geo-Wiki application (https://www.geo-wiki.org/). Moreover, we would like to share a new open global training data set on forest management we collected from a series of Geo-Wiki campaigns. In February 2019, we organized an expert workshop to (1) discuss the variety of forest management practices that take place in different parts of the world; (2) generalize the definitions for the application at global scale; (3) finalize the Geo-Wiki interface for the crowdsourcing campaigns; and (4) build a data set of control points (or the expert data set), which we used later to monitor the quality of the crowdsourced contributions by the volunteers. We involved forest experts from different regions around the world to explore what types of forest management information could be collected from visual interpretation of very high-resolution images from Google Maps and Microsoft Bing, in combination with Sentinel time series and Normalized Difference Vegetation Index (NDVI) profiles derived from Google Earth Engine (GEE). Based on the results of this analysis, we expanded these campaigns by involving a broader group of participants, mainly people recruited from remote sensing, geography and forest research institutes and universities.In total, we collected forest data for approximately 230 000 locations globally. These data are of sufficient density and quality and therefore could be used in many ML and AI applications for forests at regional and local scale.&#160; We also provide an example of ML application, a remotely sensed based global forest management map at a 100 m resolution (PROBA-V) for the year 2015. It includes such classes as intact forests, forests with signs of human impact, including clear cuts and logging, replanted forest, woody plantations with a rotation period up to 15 years, oil palms and agroforestry. The results of independent statistical validation show that the map&#8217;s overall accuracy is 81%.

Download Full-text

Research on an Urban Building Area Extraction Method with High-Resolution PolSAR Imaging Based on Adaptive Neighborhood Selection Neighborhoods for Preserving Embedding

ISPRS International Journal of Geo-Information ◽

10.3390/ijgi9020109 ◽

2020 ◽

Vol 9 (2) ◽

pp. 109 ◽

Cited By ~ 1

Author(s):

Bo Cheng ◽

Shiai Cui ◽

Xiaoxiao Ma ◽

Chenbin Liang

Keyword(s):

High Resolution ◽

Extraction Method ◽

Texture Features ◽

Decomposition Methods ◽

Projection Matrix ◽

Training Data ◽

Support Vector ◽

Svm Classifier ◽

Building Area ◽

Adaptive Neighborhood

Feature extraction of an urban area is one of the most important directions of polarimetric synthetic aperture radar (PolSAR) applications. A high-resolution PolSAR image has the characteristics of high dimensions and nonlinearity. Therefore, to find intrinsic features for target recognition, a building area extraction method for PolSAR images based on the Adaptive Neighborhoods selection Neighborhood Preserving Embedding (ANSNPE) algorithm is proposed. First, 52 features are extracted by using the Gray level co-occurrence matrix (GLCM) and five polarization decomposition methods. The feature set is divided into 20 dimensions, 36 dimensions, and 52 dimensions. Next, the ANSNPE algorithm is applied to the training samples, and the projection matrix is obtained for the test image to extract the new features. Lastly, the Support Vector machine (SVM) classifier and post processing are used to extract the building area, and the accuracy is evaluated. Comparative experiments are conducted using Radarsat-2, and the results show that the ANSNPE algorithm could effectively extract the building area and that it had a better generalization ability; the projection matrix is obtained using the training data and could be directly applied to the new sample, and the building area extraction accuracy is above 80%. The combination of polarization and texture features provide a wealth of information that is more conducive to the extraction of building areas.

Download Full-text