A feature extraction model training method and apparatus, a computer device, and a computer readable storage medium, relating to the technical field of video processing. The method comprises: implementing detection of a plurality of images in a sample video to acquire at least two images containing the same object, the number of sample videos being one or more; determining that the at least two images containing the same object are sample images; and implementing training on the basis of the determined sample images to obtain a feature extraction model, the feature extraction model being used for extracting video features of the video.
展开▼