Stanislav Frolov(Wissensbasierte Systeme (Prof. Andreas Dengel))
hosted by PhD Program in CS @ TU KL
With the advent of generative adversarial networks, synthesising images has recently become an active research area. Given an input text description, Text-to-Image (T2I) is the task is to generate an image that correctly reflects the meaning of that description. It is a flexible and very intuitive way for conditional image synthesis. Although significant progress has been achieved in the last few years, generating images with multiple interacting objects is still very difficult. In this talk I will introduce the basic architecture of a T2I model, discuss challenges, and present a way to improve T2I models by leveraging Visual Question Answering.
|Time:||Monday, 14.12.2020, 15:45|