CD-GAN: Commonsense-Driven Generative Adversarial Network with Hierarchical Refinement for Text-to-Image Synthesis
Synthesizing vivid images with descriptive texts is gradually emerging as a frontier cross-domain generation task.However, it is obviously inadequate to generate the high-quality image with one single sentence accurately due to the information asymmetry between modalities, which needs external knowledge to balance the process.Moreover, the limited