首页>
外国专利>
METHOD FOR CHINESE CONCEPT EMBEDDING GENERATION BASED ON WIKIPEDIA LINK STRUCTURE
METHOD FOR CHINESE CONCEPT EMBEDDING GENERATION BASED ON WIKIPEDIA LINK STRUCTURE
展开▼
机译:基于维基百科链接结构的中文概念嵌入生成方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
The present invention discloses a method and a device for Chinese concept embedding generation based on Wikipedia link structure. The method includes: Step (1): According to the title concepts and/or link concepts in Chinese Wikipedia pages, a link information database is constructed; Step (2): For the title concepts, according to their link relationships with link concepts in the link information database, the positive and negative training instances are constructed respectively, which constitute the training dataset; Step (3): A concept embedding model is built, including an input layer, an embedding layer, a computational operation layer, and an output layer; Step (4): The concept embedding model is trained with the training dataset, then, the Chinese concept embedding is extracted/generated from the concept embedding model. The present invention can accurately distinguish different concepts and overcome the problem of polysemy that troubles the traditional embedding methods, which is beneficial to generate more accurate concept embedding representation.
展开▼