In one embodiment, a method includes receiving a query; generating a query embedding representing the query corresponding to a point in an n-dimensional embedding space; identifying one or more image objects matching the query; accessing, for each of the identified image objects, an image embedding representing the image object corresponding to a point in an m-dimensional embedding space; transforming, using a relevance model, the query embedding and each of the image embeddings into a joint p-dimensional embedding space; calculating, for each identified image object, a relevance-score based on a similarity metric between the transformed query embedding and the transformed image embedding; generating search results based on the calculated relevance-scores; and sending, to the client system in response to the query, instructions for presenting a search-results interface to the user, wherein the search-results interface includes search results referencing identified image objects presented in ranked order based on the respective relevance-scores.
展开▼