首页>
外国专利>
Video retrieval based on encoding temporal relationships among video frames
Video retrieval based on encoding temporal relationships among video frames
展开▼
机译:基于视频帧中的编码时间关系的视频检索
展开▼
页面导航
摘要
著录项
相似文献
摘要
Systems and methods for content-based video retrieval are described. The systems and methods may break a video into multiple frames, generate a feature vector from the frames based on the temporal relationship between them, and then embed the feature vector into a vector space along with a vector representing a search query. In some embodiments, the video feature vector is converted into a text caption prior to the embedding. In other embodiments, the video feature vector and a sentence vector are each embedded into a common space using a join video sentence embedding model. Once the video and the search query are embedded into a common vector space, a distance between them may be calculated. After calculating the distance between the search query and set of videos, the distances may be used to select a subset of the videos to present as the result of the search.
展开▼