智能化煤矿数据仓库建模方法

王霖; 方乾; 张晓霞; 苏上海; 施展; 王雅琨

doi:10.13272/j.issn.1671-251x.2021120007

摘要: 煤矿海量数据存在“数据孤岛”、关联性弱、因缺乏数据管理体系而导致数据质量差等问题，难以充分利用，无法为煤矿智能化提供分析决策支撑。数据仓库可满足煤矿多源异构数据集成需求，为煤矿智能化应用提供数据基础。通过分析煤矿数据类型、特点及实际数据智能化应用需求，研究了智能化煤矿数据仓库建模方法。首先，构建了智能化煤矿数据仓库分层架构，分析了原始数据层、明细数据层、基础指标层、服务数据层、公共维度层数据模型特点；其次，以综采工作面数据为例，从业务数据分析、应用需求分析、分层架构设计等方面阐述了数据仓库建模过程；再次，介绍了煤矿数据仓库中数据模型构建方法，即通过维度对齐、维度关联、维度化指标聚合等将原始数据转换为数据仓库维度模型，解决了不同维度的煤矿数据关联应用问题；最后，为解决煤矿数据仓库的可迁移性问题，提出了煤炭行业通用数据仓库+参数化ETL（抽取、转换、加载）方法的煤矿参数化数据仓库设计思路。在实验室环境下搭建了煤矿数据仓库平台，对山西天地王坡煤业有限公司综采工作面数据进行处理，并基于处理数据辅助机理模型分析、实现可视化管理驾驶舱，验证了智能化煤矿数据仓库的实用性；对比了原始数据模型与智能化煤矿数据仓库的性能指标，结果表明智能化煤矿数据仓库的数据组织度、模型复用度和迭代难易度均优于原始数据模型，且数据查询响应时间缩短50%以上。

Abstract: The coal mine massive data has problems such as 'data island', weak correlation, poor data quality due to lack of data management system. It is difficult to make full use of the data and provide analysis and decision-making support for coal mine intelligence. The data warehouse can meet the requirements of multi-source heterogeneous data integration in coal mine, and provide data basis for intelligent application in coal mine. By analyzing the coal mine data types, characteristics and intelligent application requirements of actual data, the intelligent coal mine data warehouse modeling method is studied. Firstly, the layered architecture of intelligent coal mine data warehouse is constructed, and the characteristics of data model of original data layer, detailed data layer, basic index layer, service data layer and public dimension layer are analyzed. Secondly, taking the data of fully mechanized working face as an example, the modeling process of data warehouse is expounded from the aspects of business data analysis, application demand analysis and layered architecture design. Thirdly, the construction method of data model in coal mine data warehouse is introduced. The original data is transformed into data warehouse dimensional model through dimension alignment, dimension association and dimensional index aggregation. The method solves the application problem of coal mine data association in different dimensions. Finally, in order to solve the problem of portability of coal mine data warehouse, the design idea of coal mine parametric data warehouse based on general data warehouse in coal mine industry + parametric ETL (extraction-transformation-load) method is proposed. The platform of coal mine data warehouse in the laboratory environment is set up to process the data of fully mechanized working face of Shanxi Tiandi Wangpo Coal Industry Co., Ltd. The auxiliary mechanism model analysis and visual management cockpit are realized based on the processing data, which verifies the practicability of intelligent coal mine data warehouse. The performance indexes of the original data model and the intelligent coal mine data warehouse are compared. The results show that the data organization, model reuse and iteration difficulty of the intelligent coal mine data warehouse are better than those of the original data model, and the data query response time is shortened by more than 50%.

智能化煤矿数据仓库建模方法

Intelligent coal mine data warehouse modeling method