首页> 美国卫生研究院文献>Frontiers in Chemistry >e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness
【2h】

e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness

机译:e-Sweet:基于机器学习的甜味剂及其相对甜度预测平台

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Artificial sweeteners (AS) can elicit the strong sweet sensation with the low or zero calorie, and are widely used to replace the nutritive sugar in the food and beverage industry. However, the safety issue of current AS is still controversial. Thus, it is imperative to develop more safe and potent AS. Due to the costly and laborious experimental-screening of AS, in-silico sweetener/sweetness prediction could provide a good avenue to identify the potential sweetener candidates before experiment. In this work, we curate the largest dataset of 530 sweeteners and 850 non-sweeteners, and collect the second largest dataset of 352 sweeteners with the relative sweetness (RS) from the literature. In light of these experimental datasets, we adopt five machine-learning methods and conformational-independent molecular fingerprints to derive the classification and regression models for the prediction of sweetener and its RS, respectively via the consensus strategy. Our best classification model achieves the 95% confidence intervals for the accuracy (0.91 ± 0.01), precision (0.90 ± 0.01), specificity (0.94 ± 0.01), sensitivity (0.86 ± 0.01), F1-score (0.88 ± 0.01), and NER (Non-error Rate: 0.90 ± 0.01) on the test set, which outperforms the model (NER = 0.85) of Rojas et al. in terms of NER, and our best regression model gives the 95% confidence intervals for the R2(test set) and ΔR2 [referring to |R2(test set)- R2(cross-validation)|] of 0.77 ± 0.01 and 0.03 ± 0.01, respectively, which is also better than the other works based on the conformation-independent 2D descriptors (e.g., 2D Dragon) according to R2(test set) and ΔR2. Our models are obtained by averaging over nineteen data-splitting schemes, and fully comply with the guidelines of Organization for Economic Cooperation and Development (OECD), which are not completely followed by the previous relevant works that are all on the basis of only one random data-splitting scheme for the cross-validation set and test set. Finally, we develop a user-friendly platform “e-Sweet” for the automatic prediction of sweetener and its corresponding RS. To our best knowledge, it is a first and free platform that can enable the experimental food scientists to exploit the current machine-learning methods to boost the discovery of more AS with the low or zero calorie content.
机译:人造甜味剂(AS)可以产生低热量或零卡路里的强烈甜味,并在食品和饮料行业中广泛用于替代营养糖。但是,当前AS的安全性问题仍然存在争议。因此,必须开发出更安全有效的AS。由于对AS进行昂贵且费力的实验筛选,因此,计算机模拟甜味剂/甜度预测可以为在实验之前确定潜在的甜味剂候选者提供一个很好的途径。在这项工作中,我们整理了530种甜味剂和850种非甜味剂的最大数据集,并从文献中收集了具有相对甜度(RS)的第二大数据集352种甜味剂。根据这些实验数据集,我们采用五种机器学习方法和与构象无关的分子指纹,分别通过共识策略得出用于预测甜味剂及其RS的分类和回归模型。我们的最佳分类模型在准确度(0.91±0.01),精度(0.90±0.01),特异性(0.94±0.01),灵敏度(0.86±0.01),F1评分(0.88±0.01)和测试集上的NER(无误码率:0.90±0.01)优于Rojas等人的模型(NER = 0.85)。就NER而言,我们最好的回归模型给出了R 2 (测试集)和ΔR 2 的95%置信区间[指| R 2 (测试集)-R 2 (交叉验证)|]分别为0.77±0.01和0.03±0.01,这也比其他基于构象无关的著作更好根据R 2 (测试集)和ΔR 2 的2D描述符(例如2D Dragon)。我们的模型是通过对19个以上的数据拆分方案进行平均而获得的,并且完全符合经济合作与发展组织(OECD)的准则,而先前的相关工作并没有完全遵循这些准则,而所有相关工作仅基于一个随机变量交叉验证集和测试集的数据拆分方案。最后,我们开发了一个用户友好的平台“ e-Sweet”,用于自动预测甜味剂及其相应的RS。据我们所知,它是第一个免费平台,可以使实验食品科学家利用当前的机器学习方法来促进发现更多低热量或零卡路里的AS。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号