首页> 外国专利> COMPUTER-IMPLEMENTED TRAINING OF A POLICY MODEL FOR SPECIFYING A CONFIGURABLE PARAMETER OF A TELECOMMUNICATIONS NETWORK, SUCH AS AN ANTENNA ELEVATION DEGREE OF A NETWORK NODE, BY SMOOTHED-LOSS INVERSE PROPENSITY

COMPUTER-IMPLEMENTED TRAINING OF A POLICY MODEL FOR SPECIFYING A CONFIGURABLE PARAMETER OF A TELECOMMUNICATIONS NETWORK, SUCH AS AN ANTENNA ELEVATION DEGREE OF A NETWORK NODE, BY SMOOTHED-LOSS INVERSE PROPENSITY

机译:通过平滑损失反向倾向指定电信网络的可配置参数的计算机实现的策略模型的培训,例如网络节点的天线高程程度

摘要

A computer implemented method for training a policy for operating a telecommunications network includes providing (602) a baseline dataset of performance indicator data for the telecommunications network, generating (612) a policy model that specifies actions to be taken on a configurable parameter of the telecommunications network given a context of the telecommunications network, generating (606) a loss model that estimates an expected loss experienced for execution in the telecommunications network of at least one action from a plurality of actions on the configurable parameter, training (608) the loss model to generate a trained loss model having a level of reduced noise, and performing (630) inverse propensity score learning on the policy model using the trained loss model to obtain a trained policy model. A method performed by a computer system for controlling an antenna elevation degree of an antenna of a network node in a telecommunications network is also provided.
机译:用于培训用于操作电信网络的策略的计算机实现的方法包括提供(602)电信网络的性能指示符数据的基线数据集,生成(612)策略模型,该策略模型指定要在电信的可配置参数上采取的动作给出网络给定电信网络的上下文,生成(606)估计在可配置参数上的多个动作中至少一个动作在电信网络中执行的预期损失,训练(608)损耗模型为了生成具有降低噪声水平的培训损耗模型,并且使用培训的损耗模型对策略模型执行(630)逆倾向评分学习以获得训练策略模型。还提供了一种用于控制电信网络中的网络节点的天线的天线高度的计算机系统执行的方法。

著录项

相似文献

  • 专利
  • 外文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号