OLAP Analyzing System of Coal Sale Data Based on Hadoop Platform
-
摘要: 针对煤炭销售数据量大而信息量少的问题,开发了基于Hadoop平台的OLAP煤炭销售数据分析系统,介绍了系统设计思想及架构,并以销售量统计为例阐述了实现数据深层次快速挖掘和直观显示的具体过程。该系统利用Hadoop云平台对数据进行ETL处理,创建Hive分布式数据仓库,并采用Hive的HQL语言进行OLAP统计分析,能够快速、准确地实现对销售量信息的多层次、多角度、深层次的数据挖掘、统计和分析,并直观、多角度地反映数据分析结果。Abstract: For the problem that coal sale data has large volume but little information content, the paper proposed a design scheme of OLAP analyzing system of coal sale data based on Hadoop platform, introduced design ideas and structure of the system, and described specific process of implementation of deep and fast mining and intuitive display of data taking statistics of sales data as an example. The system uses Hadoop cloud platform for ETL data processing, creates Hive distributed data warehouse, and uses HQL language of Hive for OLAP statistical analysis, which achieves deep data mining, statistics and analysis of sales information from multi-level and multi-angle quickly and accurately, meanwhile reflects the result of data analysis intuitively form multi-angle.
点击查看大图
计量
- 文章访问数: 45
- HTML全文浏览量: 8
- PDF下载量: 3
- 被引次数: 0