首页>
外国专利>
DATA SET CREATION WITH CROWD-BASED REINFORCEMENT
DATA SET CREATION WITH CROWD-BASED REINFORCEMENT
展开▼
机译:数据集创建与基于人群的加强
展开▼
页面导航
摘要
著录项
相似文献
摘要
A system and method for creation and expansion of high quality data set collections for training of machine learning algorithms via crowdsourced curation that utilizes a data marketplace which incentivizes data gatherers, publishers, and users to contribute to the creation of a vast resource of reliable data set collections. Data is automatically ingested from disparate sources and autonomously checked for data quality, provenance, and cyber-risks and subsequently given a reputation score. Data stewards curate a queue of low scoring real data as well as synthetically generated data. All reputable data is stored for user consumption and further iterative data generation.
展开▼
机译:用于创建和扩展高质量数据集集合的系统和方法,用于通过众包策策培训机器学习算法,该策划利用数据市场,这些策序激励数据收集器,发布者和用户为创建庞大资源的可靠数据集收藏品。数据从不同的来源自动摄取,并自主检查数据质量,出处和网络风险,随后给出了声誉分数。 Data Stewards策划低评分实际数据以及合成生成的数据的队列。所有信誉象败的数据都存储用于用户消费和进一步的迭代数据生成。
展开▼