We synthesize total-physique human photos starting from a provided human pose with two dedicated actions. 1) With some texts describing the shapes of garments, the supplied human pose is to start with translated to your human parsing map. 2) The ultimate human picture is then generated by furnishing the procedure https://www.humanizingaitext.com/