Certain embodiments involve determining visual context associated with user behavior and associating the visual context with the user behavior. For example, a system captures a portion of a user interface provided to a user at a time of a user action on the user interface. The captured portion can include digital content and the system can detect the digital content. The system can also generate a digital representation of the digital content. The digital representation can indicate the digital and exclude the digital content. The system can determine a visual context associated with the user action based on the generated digital representation. The visual context describes the digital content displayed via the user interface at the time of the user action. The system can also determine subsequent digital content to output to the user to create a subsequent visual context to encourage a particular user behavior.
展开▼