Customer reviews and comments on web pages are important information n our daily life. For example, we prefer to choose a hotel with positive comments rom previous customers. As the huge amounts of such information demonstrate the haracteristics of big data, it places heavy burdens on the assimilation of the customercontributed pinions. To overcoming this problem, we study an efficient opinion ummarization approach for a set of massive user reviews and comments associated ith an online resource, to summarize the opinions into two categories, i.e., positive nd negative. In this paper, we proposed a framework including: (1) overcoming the ig data problem of online comments using the efficient online-LDA approach; (2) electing meaningful topics from the imbalanced data; (3) summarizing the opinion f comments with high precision and recall. This framework is different from much f the previous work in that the topics are pre-defined and selected the topics for etter opinion summarization. To evaluate the proposed framework, we perform the xperiments on a dataset of hotel reviews for the variety of topics contained. The esults show that our framework can gain a significant performance improvement on pinion summarization.
展开▼