In implementations of image composition instruction based on reference image perspective, a template image of a desired object is captured. Reference images including a similar or same object as the desired object are obtained. Directions from a first location of where a template image was captured to a second location where a selected reference image was captured are obtained and exposed in a user interface to allow a user to move to the second location. At the second location, interactive instructions are generated based on a live video stream of the desired to move a camera capturing the video stream until a composition of the video stream is aligned with a composition of the reference image. The camera is configured with settings based on the reference image to capture a composed image having a same composition as the reference image.
展开▼