With the popularity of social media,the huge volume of the social media text becomes the important research object of natural language processing and data mining communities.However,the informal features of social text are significant obstacles to process and mine the text.One way to overcome the obstacles is to transform the informal text into formal one.This process is called social text normalization.This paper reviews the basic methods and recent advances of social text normalization research.And it discusses the future research direction of social text normalization.%社交媒体中的海量文本已经成为自然语言处理和数据挖掘领域的重点研究对象.然而社交文本中存在的不规范特征是对文本进行处理和挖掘的重要障碍.消除这一障碍的方法之一是将不规范的文本转化成规范的形式,即社交文本规范化.本文将对社交文本规范化的基本方法和研究现状进行综合介绍,同时也对社交文本规范化未来的研究方向进行了讨论.
展开▼