Interference and precursor signal analysis based on random forest and improved logistic regression model

Authors

  • Dongping Sheng Changzhou Institute of Technology, Changzhou, China Author
  • Jie Yang Changzhou Institute of Technology, Changzhou, China Author
  • Hongliang Wang Changzhou Institute of Technology, Changzhou, China Author
  • Chun Su Zhao Changzhou Institute of Technology, Changzhou, China Author
  • Chun Su Changzhou Institute of Technology, Changzhou, China Author

DOI:

https://doi.org/10.71451/m417wa34

Keywords:

Random forest, ADF and MK test, Autocorrelation function, Logistic regression, Sigmoid function

Abstract

Coal is the main energy and industrial raw material in China. In order to prevent the risks of coal mining and ensure the safety and efficiency of coal mining, this paper established random forest model, ADF and MK test model and logistic regression model, and used autocorrelation function algorithm, Fourier transform and sliding window transformation of time series data to analyze and optimize the characteristics of interference signals and precursor characteristics. For the first work, the huge data is preprocessed, tested, and outliers are removed, and the time characteristics are taken to aggregate and classify the data. For work (1.1), data parameters were sorted out, Fourier transform prediction signal model was initially established, and characteristic values were extracted by wavelet transform. Finally, autocorrelation function algorithm was used to optimize and analyze the results. The final results were shown in Table 4 for the data characteristics of electromagnetic radiation (EMR) interference signals and Table 5 for the interference signals of acoustic propagation signals. For work (1.2), a random forest model is established, with sliding window transformation of time series data, window size is specified, boundary processing mode is specified, and label value is defined as interference and normal. The data set is divided into training set and test set in the way of eighty-two allocation. After AE training, there are three stages of training. Finally, random forest algorithm is used to calculate the electromagnetic radiation interference signal interval. For work (2.1), the model and algorithm of work 1 are used to identify the trend characteristics of the data before the occurrence of electromagnetic radiation and acoustic emission signals, and the signal data are judged to have a "slightly rising" trend. The statistical values are compared by KS trend test, and the trend characteristics of normal and precursor signals are calculated by using the autocorrelation function algorithm. A trend feature table is obtained for the precursory feature data of electromagnetic radiation and acoustic propagation signal. For work (2.2), Augmented dickey-Fuller and MK test models of the system were established. For work 3, a logistic regression model is established to predict the probability of precursor feature data appearing at the last moment of multiple time periods. By using maximum likelihood estimation to train model parameters, the characteristics of the last data collection moment of each time period are predicted, and the probability of precursor feature appearing at each moment is output. As shown in Table 14, the probability of precursor features appearing at the time when the data of multi-classification logistic regression is located is obtained.

 

**************** ACKNOWLEDGEMENTS****************

This work is supported by ministry of education industry-university cooperative education project (Grant No.: 231106441092432), the research and practice of integrating "curriculum thought and politics" into the whole process of graduation design of Mechanical engineering major: (Grant. No.: 30120300100-23-yb-jgkt03), research on the integration mechanism of "course-training-competition-creation-production" for innovation and entrepreneurship of mechanical engineering majors in applied local universities (Grant. No.: CXKT202405), Mechanical manufacturing equipment design school-level "gold class" construction project (Grant. No.: 30120324001).

References

[1] Wang, L. N., & Chen, L. (2017). Research on the application of support vector machine (SVM) combined with object-oriented method in information extraction of open pit mining area. The 19th Academic Exchange Meeting of six Provinces and one City of Surveying and Mapping Society in East China and the 2017 Cross-Straits Surveying and mapping Technology Exchange and Academic Seminar, 201-206.

[2] Hu, H. B., & Zhan, Y. L. (2018). Change characteristics of land use landscape pattern based on decision tree classification of remote sensing image. Green Technology, 24(072): 200-205.

[3] Wang, D. Y. (2021). Research on data reconstruction method of ground penetrating radar based on time series analysis. Shandong Technology and Business University.

[4] Wu, J. F. (2021). Research on anomaly detection and fault warning technology for in orbit satellites based on LSTM. National University of Defense Technology.

[5] Zhou,G. Y., Wan, S. P., & Chen.Y. N. (2022). Research on denoising algorithm for phase sensitive time domain reflectometer based on moving variance average algorithm. Journal of Instruments and Meters, 43(10): 233-240.

[6] Zhang, J. L., Guo, S. Y., & Ren, C. P. (2024). Research on personal credit rating card model based on logistic regression. Modern Information Technology, 8(05): 12-16.

[7] Du, B. Y., Gao, J. H., & Zhang, G. Z. (2024). Seismic prediction method and application of fracture density inversion for shale reservoir in-situ stress. Petroleum Geophysical Exploration, 59(2): 279-289.

[8] Zhou, Z. X., Zhen, X. J., & Liang, Y. G. (2024). Study on acoustic emission source location of ancient timber based on wavelet packet transform and cross-correlation. Shanxi Architecture, 50(9): 1-5.

[9] Yao, J. W., Li, Y., & Lv, Y. J. (2024). Research on the trend of water quality evolution in drinking water source areas based on water quality index method and M-K Test. Environmental Science and Management, 49(04): 43-48.

[10] Nadir, B., Azzeddine, D., & Ahmed, B. (2024). Exploring the effects of overvoltage unbalances on three phase induction motors: Insights from motor current spectral analysis and discrete wavelet transform energy assessment. Computers and Electrical Engineering, 117109242.

[11] Zhang, X. G., & Luo R. (2024). Prediction and analysis of TCM constitution based on ARIMA time series model. Asia-pacific traditional medicine, 20(04): 156-16.

[12] Li, Y. H., Zhou, Y., & Wu, X. F. (2024). A study on the potential supply evaluation method of regional landscape recreation services based on geodetectors and random forest models - taking the subtropical moist huaiyang low mountain landscape area as an example. Journal of Ecology, 13: 1-17.

Downloads

Published

2025-02-09 — Updated on 2025-02-09

Issue

Section

Research Article

How to Cite

Interference and precursor signal analysis based on random forest and improved logistic regression model. (2025). International Scientific Technical and Economic Research , 8-24. https://doi.org/10.71451/m417wa34

Similar Articles

1-10 of 33

You may also start an advanced similarity search for this article.