Co-occurrence based texture synthesis

Anna Darzi; Itai Lang; Ashutosh Taklikar; Hadar Averbuch-Elor; Shai Avidan

doi:10.1007/s41095-021-0243-7

Co-occurrence based texture synthesis

Computational Visual Media ◽

10.1007/s41095-021-0243-7 ◽

2021 ◽

Vol 8 (2) ◽

pp. 289-302

Author(s):

Anna Darzi ◽

Itai Lang ◽

Ashutosh Taklikar ◽

Hadar Averbuch-Elor ◽

Shai Avidan

Keyword(s):

Texture Analysis ◽

Texture Synthesis ◽

Image Generation ◽

Generative Adversarial Network ◽

Local Characteristics ◽

Adversarial Network ◽

Input Condition ◽

End To End ◽

Smooth Texture

AbstractAs image generation techniques mature, there is a growing interest in explainable representations that are easy to understand and intuitive to manipulate. In this work, we turn to co-occurrence statistics, which have long been used for texture analysis, to learn a controllable texture synthesis model. We propose a fully convolutional generative adversarial network, conditioned locally on co-occurrence statistics, to generate arbitrarily large images while having local, interpretable control over texture appearance. To encourage fidelity to the input condition, we introduce a novel differentiable co-occurrence loss that is integrated seamlessly into our framework in an end-to-end fashion. We demonstrate that our solution offers a stable, intuitive, and interpretable latent representation for texture synthesis, which can be used to generate smooth texture morphs between different textures. We further show an interactive texture tool that allows a user to adjust local characteristics of the synthesized texture by directly using the co-occurrence values.

Download Full-text

CDL-GAN: Contrastive Distance Learning Generative Adversarial Network for Image Generation

Applied Sciences ◽

10.3390/app11041380 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1380

Author(s):

Yingbo Zhou ◽

Pengcheng Zhao ◽

Weiqin Tong ◽

Yongxin Zhu

Keyword(s):

Distance Learning ◽

Feature Learning ◽

Image Synthesis ◽

Image Feature ◽

Generative Adversarial Networks ◽

Image Generation ◽

Generative Adversarial Network ◽

Feature Representations ◽

Adversarial Network ◽

Public Datasets

While Generative Adversarial Networks (GANs) have shown promising performance in image generation, they suffer from numerous issues such as mode collapse and training instability. To stabilize GAN training and improve image synthesis quality with diversity, we propose a simple yet effective approach as Contrastive Distance Learning GAN (CDL-GAN) in this paper. Specifically, we add Consistent Contrastive Distance (CoCD) and Characteristic Contrastive Distance (ChCD) into a principled framework to improve GAN performance. The CoCD explicitly maximizes the ratio of the distance between generated images and the increment between noise vectors to strengthen image feature learning for the generator. The ChCD measures the sampling distance of the encoded images in Euler space to boost feature representations for the discriminator. We model the framework by employing Siamese Network as a module into GANs without any modification on the backbone. Both qualitative and quantitative experiments conducted on three public datasets demonstrate the effectiveness of our method.

Download Full-text

Does Generative Adversarial Network (GAN) help in SRAF image generation?

10.1109/iwaps54037.2021.9671262 ◽

2021 ◽

Author(s):

Jialu Huang ◽

Ying Huang ◽

Yan-ting Lin ◽

Zi-yang Liu ◽

Yang Lin ◽

...

Keyword(s):

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

End-to-End Medical Image Denoising via Cycle-consistent Generative Adversarial Network

10.1109/ispds54097.2021.00012 ◽

2021 ◽

Author(s):

Chenggeng Yan ◽

Hu Chen ◽

Zhao Yang

Keyword(s):

Image Denoising ◽

Medical Image ◽

Generative Adversarial Network ◽

Adversarial Network ◽

End To End

Download Full-text

Presentation Attack Face Image Generation Based on a Deep Generative Adversarial Network

Sensors ◽

10.3390/s20071810 ◽

2020 ◽

Vol 20 (7) ◽

pp. 1810

Author(s):

Dat Tien Nguyen ◽

Tuyen Danh Pham ◽

Ganbayar Batchuluun ◽

Kyoung Jun Noh ◽

Kang Ryoung Park

Keyword(s):

Recognition Task ◽

Recognition System ◽

Attack Detection ◽

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Face Images ◽

Problem Presentation ◽

Recognition Systems ◽

Public Datasets

Although face-based biometric recognition systems have been widely used in many applications, this type of recognition method is still vulnerable to presentation attacks, which use fake samples to deceive the recognition system. To overcome this problem, presentation attack detection (PAD) methods for face recognition systems (face-PAD), which aim to classify real and presentation attack face images before performing a recognition task, have been developed. However, the performance of PAD systems is limited and biased due to the lack of presentation attack images for training PAD systems. In this paper, we propose a method for artificially generating presentation attack face images by learning the characteristics of real and presentation attack images using a few captured images. As a result, our proposed method helps save time in collecting presentation attack samples for training PAD systems and possibly enhance the performance of PAD systems. Our study is the first attempt to generate PA face images for PAD system based on CycleGAN network, a deep-learning-based framework for image generation. In addition, we propose a new measurement method to evaluate the quality of generated PA images based on a face-PAD system. Through experiments with two public datasets (CASIA and Replay-mobile), we show that the generated face images can capture the characteristics of presentation attack images, making them usable as captured presentation attack samples for PAD system training.

Download Full-text

Exocentric to Egocentric Image Generation Via Parallel Generative Adversarial Network

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp40776.2020.9053957 ◽

2020 ◽

Cited By ~ 3

Author(s):

Gaowen Liu ◽

Hao Tang ◽

Hugo Latapie ◽

Yan Yan

Keyword(s):

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text

A Transfer Deep Generative Adversarial Network Model to Synthetic Brain CT Generation from MR Images

Wireless Communications and Mobile Computing ◽

10.1155/2021/9979606 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Yi Gu ◽

Qiankun Zheng

Keyword(s):

Transfer Learning ◽

Network Model ◽

Medical Images ◽

Image Method ◽

Data Sets ◽

Generation Process ◽

Image Generation ◽

Generative Adversarial Network ◽

Model Based ◽

Adversarial Network

Background. The generation of medical images is to convert the existing medical images into one or more required medical images to reduce the time required for sample diagnosis and the radiation to the human body from multiple medical images taken. Therefore, the research on the generation of medical images has important clinical significance. At present, there are many methods in this field. For example, in the image generation process based on the fuzzy C-means (FCM) clustering method, due to the unique clustering idea of FCM, the images generated by this method are uncertain of the attribution of certain organizations. This will cause the details of the image to be unclear, and the resulting image quality is not high. With the development of the generative adversarial network (GAN) model, many improved methods based on the deep GAN model were born. Pix2Pix is a GAN model based on UNet. The core idea of this method is to use paired two types of medical images for deep neural network fitting, thereby generating high-quality images. The disadvantage is that the requirements for data are very strict, and the two types of medical images must be paired one by one. DualGAN model is a network model based on transfer learning. The model cuts the 3D image into multiple 2D slices, simulates each slice, and merges the generated results. The disadvantage is that every time an image is generated, bar-shaped “shadows” will be generated in the three-dimensional image. Method/Material. To solve the above problems and ensure the quality of image generation, this paper proposes a Dual3D&PatchGAN model based on transfer learning. Since Dual3D&PatchGAN is set based on transfer learning, there is no need for one-to-one paired data sets, only two types of medical image data sets are needed, which has important practical significance for applications. This model can eliminate the bar-shaped “shadows” produced by DualGAN’s generated images and can also perform two-way conversion of the two types of images. Results. From the multiple evaluation indicators of the experimental results, it can be analyzed that Dual3D&PatchGAN is more suitable for the generation of medical images than other models, and its generation effect is better.

Download Full-text

EnsNet: Ensconce Text in the Wild

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.3301801 ◽

2019 ◽

Vol 33 ◽

pp. 801-808 ◽

Cited By ~ 5

Author(s):

Shuaitao Zhang ◽

Yuliang Liu ◽

Lianwen Jin ◽

Yaoxiong Huang ◽

Songxuan Lai

Keyword(s):

Generative Adversarial Network ◽

Local Consistency ◽

Adversarial Network ◽

Image Patches ◽

Scene Text ◽

In The Wild ◽

Lateral Connection ◽

Previous State ◽

End To End ◽

General Object

A new method is proposed for removing text from natural images. The challenge is to first accurately localize text on the stroke-level and then replace it with a visually plausible background. Unlike previous methods that require image patches to erase scene text, our method, namely ensconce network (EnsNet), can operate end-to-end on a single image without any prior knowledge. The overall structure is an end-to-end trainable FCN-ResNet-18 network with a conditional generative adversarial network (cGAN). The feature of the former is first enhanced by a novel lateral connection structure and then refined by four carefully designed losses: multiscale regression loss and content loss, which capture the global discrepancy of different level features; texture loss and total variation loss, which primarily target filling the text region and preserving the reality of the background. The latter is a novel local-sensitive GAN, which attentively assesses the local consistency of the text erased regions. Both qualitative and quantitative sensitivity experiments on synthetic images and the ICDAR 2013 dataset demonstrate that each component of the EnsNet is essential to achieve a good performance. Moreover, our EnsNet can significantly outperform previous state-of-the-art methods in terms of all metrics. In addition, a qualitative experiment conducted on the SBMNet dataset further demonstrates that the proposed method can also preform well on general object (such as pedestrians) removal tasks. EnsNet is extremely fast, which can preform at 333 fps on an i5-8600 CPU device.

Download Full-text

Asymmetric Encoder-Decoder Structured FCN Based LiDAR to Color Image Generation

Sensors ◽

10.3390/s19214818 ◽

2019 ◽

Vol 19 (21) ◽

pp. 4818 ◽

Cited By ~ 2

Author(s):

Hyun-Koo Kim ◽

Kook-Yeol Yoo ◽

Ju H. Park ◽

Ho-Youl Jung

Keyword(s):

Color Image ◽

Ground Truth ◽

Sensor Data ◽

Image Generation ◽

Generative Adversarial Network ◽

Convolutional Network ◽

Adversarial Network ◽

Ground Truth Image ◽

And Performance ◽

Reflection Intensity

In this paper, we propose a method of generating a color image from light detection and ranging (LiDAR) 3D reflection intensity. The proposed method is composed of two steps: projection of LiDAR 3D reflection intensity into 2D intensity, and color image generation from the projected intensity by using a fully convolutional network (FCN). The color image should be generated from a very sparse projected intensity image. For this reason, the FCN is designed to have an asymmetric network structure, i.e., the layer depth of the decoder in the FCN is deeper than that of the encoder. The well-known KITTI dataset for various scenarios is used for the proposed FCN training and performance evaluation. Performance of the asymmetric network structures are empirically analyzed for various depth combinations for the encoder and decoder. Through simulations, it is shown that the proposed method generates fairly good visual quality of images while maintaining almost the same color as the ground truth image. Moreover, the proposed FCN has much higher performance than conventional interpolation methods and generative adversarial network based Pix2Pix. One interesting result is that the proposed FCN produces shadow-free and daylight color images. This result is caused by the fact that the LiDAR sensor data is produced by the light reflection and is, therefore, not affected by sunlight and shadow.

Download Full-text

Attribute-guided image generation of three-dimensional computed tomography images of lung nodules using a generative adversarial network

Computers in Biology and Medicine ◽

10.1016/j.compbiomed.2020.104032 ◽

2020 ◽

Vol 126 ◽

pp. 104032

Author(s):

Mizuho Nishio ◽

Chisako Muramatsu ◽

Shunjiro Noguchi ◽

Hirotsugu Nakai ◽

Koji Fujimoto ◽

...

Keyword(s):

Computed Tomography ◽

Three Dimensional ◽

Lung Nodules ◽

Image Generation ◽

Generative Adversarial Network ◽

Adversarial Network ◽

Computed Tomography Images

Download Full-text

SAR Image Generation Using Structural Bayesian Deep Generative Adversarial Network

2019 Photonics & Electromagnetics Research Symposium - Fall (PIERS - Fall) ◽

10.1109/piers-fall48861.2019.9021403 ◽

2019 ◽

Author(s):

Jia Zhai ◽

Xunwang Dang ◽

Feng Chen ◽

Xiaodan Xie ◽

Yong Zhu ◽

...

Keyword(s):

Image Generation ◽

Sar Image ◽

Generative Adversarial Network ◽

Adversarial Network

Download Full-text