A computer-implemented method for attention generation is provided. In this method, a plurality of image frames can be obtained from a video stream. An original attention for a first image frame of the plurality of image frames can be generated. Then, at least one interested area can be identified in the first image frame. A local attention for each of the at least one interested area can be generated. Moreover, a total attention for the first image frame can be generated based on the original attention of the first image frame and the local attention of each of the at least one interested area.
展开▼