As a large amount of video based applications becomes widely available, there is a need to extract the semantic data in videos. Manual techniques, which are inefficient and subjective and costly in time, bound querying capabilities and bridge gap between
展开▼