Interference and precursor signal analysis based on random forest and improved logistic regression model
DOI:
https://doi.org/10.71451/m417wa34Keywords:
Random forest, ADF and MK test, Autocorrelation function, Logistic regression, Sigmoid functionAbstract
Coal is the main energy and industrial raw material in China. In order to prevent the risks of coal mining and ensure the safety and efficiency of coal mining, this paper established random forest model, ADF and MK test model and logistic regression model, and used autocorrelation function algorithm, Fourier transform and sliding window transformation of time series data to analyze and optimize the characteristics of interference signals and precursor characteristics. For the first work, the huge data is preprocessed, tested, and outliers are removed, and the time characteristics are taken to aggregate and classify the data. For work (1.1), data parameters were sorted out, Fourier transform prediction signal model was initially established, and characteristic values were extracted by wavelet transform. Finally, autocorrelation function algorithm was used to optimize and analyze the results. The final results were shown in Table 4 for the data characteristics of electromagnetic radiation (EMR) interference signals and Table 5 for the interference signals of acoustic propagation signals. For work (1.2), a random forest model is established, with sliding window transformation of time series data, window size is specified, boundary processing mode is specified, and label value is defined as interference and normal. The data set is divided into training set and test set in the way of eighty-two allocation. After AE training, there are three stages of training. Finally, random forest algorithm is used to calculate the electromagnetic radiation interference signal interval. For work (2.1), the model and algorithm of work 1 are used to identify the trend characteristics of the data before the occurrence of electromagnetic radiation and acoustic emission signals, and the signal data are judged to have a "slightly rising" trend. The statistical values are compared by KS trend test, and the trend characteristics of normal and precursor signals are calculated by using the autocorrelation function algorithm. A trend feature table is obtained for the precursory feature data of electromagnetic radiation and acoustic propagation signal. For work (2.2), Augmented dickey-Fuller and MK test models of the system were established. For work 3, a logistic regression model is established to predict the probability of precursor feature data appearing at the last moment of multiple time periods. By using maximum likelihood estimation to train model parameters, the characteristics of the last data collection moment of each time period are predicted, and the probability of precursor feature appearing at each moment is output. As shown in Table 14, the probability of precursor features appearing at the time when the data of multi-classification logistic regression is located is obtained.
**************** ACKNOWLEDGEMENTS****************
This work is supported by ministry of education industry-university cooperative education project (Grant No.: 231106441092432), the research and practice of integrating "curriculum thought and politics" into the whole process of graduation design of Mechanical engineering major: (Grant. No.: 30120300100-23-yb-jgkt03), research on the integration mechanism of "course-training-competition-creation-production" for innovation and entrepreneurship of mechanical engineering majors in applied local universities (Grant. No.: CXKT202405), Mechanical manufacturing equipment design school-level "gold class" construction project (Grant. No.: 30120324001).
References
[1] Wang, L. N., & Chen, L. (2017). Research on the application of support vector machine (SVM) combined with object-oriented method in information extraction of open pit mining area. The 19th Academic Exchange Meeting of six Provinces and one City of Surveying and Mapping Society in East China and the 2017 Cross-Straits Surveying and mapping Technology Exchange and Academic Seminar, 201-206.
[2] Hu, H. B., & Zhan, Y. L. (2018). Change characteristics of land use landscape pattern based on decision tree classification of remote sensing image. Green Technology, 24(072): 200-205.
[3] Wang, D. Y. (2021). Research on data reconstruction method of ground penetrating radar based on time series analysis. Shandong Technology and Business University.
[4] Wu, J. F. (2021). Research on anomaly detection and fault warning technology for in orbit satellites based on LSTM. National University of Defense Technology.
[5] Zhou,G. Y., Wan, S. P., & Chen.Y. N. (2022). Research on denoising algorithm for phase sensitive time domain reflectometer based on moving variance average algorithm. Journal of Instruments and Meters, 43(10): 233-240.
[6] Zhang, J. L., Guo, S. Y., & Ren, C. P. (2024). Research on personal credit rating card model based on logistic regression. Modern Information Technology, 8(05): 12-16.
[7] Du, B. Y., Gao, J. H., & Zhang, G. Z. (2024). Seismic prediction method and application of fracture density inversion for shale reservoir in-situ stress. Petroleum Geophysical Exploration, 59(2): 279-289.
[8] Zhou, Z. X., Zhen, X. J., & Liang, Y. G. (2024). Study on acoustic emission source location of ancient timber based on wavelet packet transform and cross-correlation. Shanxi Architecture, 50(9): 1-5.
[9] Yao, J. W., Li, Y., & Lv, Y. J. (2024). Research on the trend of water quality evolution in drinking water source areas based on water quality index method and M-K Test. Environmental Science and Management, 49(04): 43-48.
[10] Nadir, B., Azzeddine, D., & Ahmed, B. (2024). Exploring the effects of overvoltage unbalances on three phase induction motors: Insights from motor current spectral analysis and discrete wavelet transform energy assessment. Computers and Electrical Engineering, 117109242.
[11] Zhang, X. G., & Luo R. (2024). Prediction and analysis of TCM constitution based on ARIMA time series model. Asia-pacific traditional medicine, 20(04): 156-16.
[12] Li, Y. H., Zhou, Y., & Wu, X. F. (2024). A study on the potential supply evaluation method of regional landscape recreation services based on geodetectors and random forest models - taking the subtropical moist huaiyang low mountain landscape area as an example. Journal of Ecology, 13: 1-17.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 International Scientific Technical and Economic Research

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
This work is licensed under the Creative Commons Attribution International License (CC BY 4.0).