首页>
外国专利>
Apparatus and method for estimating, from sparse data, the probability that a particular one of a set of events is the next event in a string of events
Apparatus and method for estimating, from sparse data, the probability that a particular one of a set of events is the next event in a string of events
展开▼
机译:从稀疏数据估计一组事件中的一个特定事件是一连串事件中的下一个事件的概率的设备和方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
Apparatus and method for evaluating the likelihood of an event (such as a word) following a string of known events, based on event sequence counts derived from sparse sample data. Event sequences --or m-grams -include a key and a subsequent event. For each m-gram which was counted in the sample data, there is stored a discounted probability P generated by applying a modified Turing's estimate, for example, to a count-based probability. For a key occurring in the sample data there is stored a normalization constant a which (a) adjusts the discounted probabilities for multiple counting, if any, and (b) includes a freed probability mass allocated to m-grams which do not occur in the sample data. To determine the likelihood of a selected event following a string of known events, a "backing off" scheme is employed in which successively shorter included keys (of known events) followed by the selected event (representing m-grams) are searched (302, 308) until an m-gram is found having a discounted probability stored therefor. The normalization constants (306, 312) of the longer searched keys --for which the corresponding m-grams have no stored discounted probability --are combined together with the found discounted probability to produce (304, 310, 314) the likelihood of the selected event being next.
展开▼