A method implemented by a client device includes accessing a plurality of image frames captured by one or more cameras of the client device and generating a working image frame based at least in part on one or more of the plurality of image frames. The method further includes classifying one or more first objects detected in the working image frame based at least in part on a determined desirability of the one or more first objects. The one or more first objects are determined to be undesirable. The method further includes applying a pixel filtering process to the working image frame to replace one or more first pixel sets associated with the first objects with pixels from one or more image frames of the plurality of image frames to generate a final image frame, and displaying the final image frame on a display of the client device.
展开▼