In recent years, the Web has become a popular medium for disseminating information,news, ideas, and opinions of the modem society. Due to this phenomenon, the Webinformation is reflecting current events and trends that are happening in the real worldwhich, in turn, has attracted a lot of interest in using the Web as a sociological researchtool for detecting the emerging topics, and social trends. To facilitate such kind ofsociological research, in this paper, we study the characteristics of socio-topical webkeywords sampled from a series of Thai web snapshots. The socio-topical web keyword,extracted from the content of some web pages, is a keyword relating to some topics ofinterest in a real-world society. The study was conducted as follows. First, the socio-topical keywords were sampled from the inverted index of each Thai web snapshot. Then,for each sampled keyword, we observe the pattern of changes of the number ofdocuments containing the keyword, and the inverse document frequency (IDF) scores.Finally, we try to find the relationships between the observed patterns of changes andtheir corresponding real-world events in the Thai society.
展开▼