The purpose of this paper is to research integrating web text resources and mine its emergence.
With the understanding of characteristics of internet resources, this paper will focus on solving the problem of text resource aggregation in open environment and its emergence showed during aggregation over time. The authors process these text resources, both in space and time dimension, through viewing them as an event stream evolving over time, and attempt to discover the evolutionary event patterns and furthermore, to mine the emergence of text content.
The proposed methods are generally applicable to text stream data and have many potential applications in text resource aggregation in open environment.
The main limitation is availability of data.
The paper presents a very useful method for text resource aggregation in an open environment.
The paper presents a new method to integrate web text resources and mine its emergence.
