Multi-Dimensional Analysis of Massive Text Corpora
[视频介绍]
简介:The real-world big data are largely unstructured and interconnected, in the form of natural language text. It is highly desirable to view and analyze massive text data from multi-dimensional angles. This poses a major challenge on how to transform unstructured text data into structured text and analyze such data in multidimensional space. To facilitate such analytical functionality, we propose a textcube modeling and discuss how to construct such cubes from massive text corpora and how to conduct multidimensional OLAP analysis using such textcubes. In the past several years, we have developed a text mining approach that only needs distant or minimal supervision but relies on massive data.
播放778次
收藏
视频介绍
讲师:Jiawei Han
关键词:
课程简介:The real-world big data are largely unstructured and interconnected, in the form of natural language text. It is highly desirable to view and analyze massive text data from multi-dimensional angles. This poses a major challenge on how to transform unstructured text data into structured text and analyze such data in multidimensional space. To facilitate such analytical functionality, we propose a textcube modeling and discuss how to construct such cubes from massive text corpora and how to conduct multidimensional OLAP analysis using such textcubes. In the past several years, we have developed a text mining approach that only needs distant or minimal supervision but relies on massive data.