render the input image using the mood of the reference image red boxes on the wall are tv screen green box is electronic white board in dark mode blue long box is worldview timezone digital clock photorealistic