首页>
外国专利>
PRIVACY PRESERVATION IN A QUERYABLE DATABASE BUILT FROM UNSTRUCTURED TEXTS
PRIVACY PRESERVATION IN A QUERYABLE DATABASE BUILT FROM UNSTRUCTURED TEXTS
展开▼
机译:从非结构化文本构建的查询数据库中的隐私保存
展开▼
页面导航
摘要
著录项
相似文献
摘要
A computer-implemented method of generating a queryable database (109). The method receives a corpus of free text documents (120) containing confidential data, the free text documents being related to the same domain. A trained Natural Language Processing (NLP) system (104) assigns one or more abstract named entities to each free text document in the corpus. The abstract named entities of each free text document are stored in a queryable database configured to provide aggregated information regarding the named entities. The NLP system is configured such that the abstract named entities are recognised and disambiguated with a precision between 0.75 and less than 1 and a recall between 0.75 and less than 1, and such that the ratio of precision and recall is between 0.7 and 1.3; wherein the queryable database is free from the addition of artificial noise by an artificial noise generation algorithm.
展开▼