Big data cleaning modeling of operation status of coal mine fully—mechanized coal mining equipment
-
Graphical Abstract
-
Abstract
In view of problems of large amount of data and noise and missing values existed in data of operation status of coal mine fully—mechanized coal mining equipment, a big data cleaning model of operation status of coal mine fully—mechanized coal mining equipment based on MapReduce was established. The model is composed of dual MapReduce. Noise points and missing values in data are corrected and multiple cleaned data files are output through the first MapReduce. The multiple cleaned data files are sorted according to collection time and date and combined into a single data file through the second MapReduce. The experimental results show that the model can effectively eliminate noise data and complement missing data with good data cleaning effect.
-
-