Research on emotion analysis of Ctrip hotel reviews and user demand identification based on big data
DOI:
https://doi.org/10.71451/s7pbw235Keywords:
Ctrip Hotel Review; Sentiment analysis; Kano model; LDA model; User requirements identificationAbstract
With the rise of e-commerce economy, more and more people choose to book hotel travel online. As Ctrip is in a leading position in the e-commerce travel platform industry, this paper takes Ctrip hotel review as the representative for research. First, this article selects the top 40 hotels in the Beijing recommendation page to collect user reviews and related comments. A total of 34,988 comments were obtained after data consolidation. Then, the text data is cleaned, word segmentation, stop word and other preprocessing work. Finally, the processed data were classified. The emotion classification adopted in this paper mainly includes the emotion dictionary and the machine learning algorithms. The classification of the emotion dictionary is used to define the general classification of the text data and to visualize the display, then the word vector is constructed and the data set is divided using the TF-IDF algorithm. According to the corresponding classification algorithm to train and test the training set and test set respectively, the classifier with the best emotion classification is the logistic regression model. It can be seen from the classification results of emotional polarity that the overall emotional tendency of users to Beijing hotel experience is positive. It is best to visually analyze the conclusions and results obtained, which not only has certain reference value for the hotel merchants and other subjects, but also provides theoretical reference for users.
In addition, this paper proposes a user demand identification and evolution analysis model based on online comment mining based on the data of Beijing hotel reviews on Ctrip APP. Using the Kano model and the LDA model, the review underwent classification, identification, feature emotion pair analysis as well as time series analysis. The results show that according to the emotional trend prediction, the emotional value of type 1, type 2 and type 3 of users showed an upward trend, while the emotional value of type 4 showed a downward trend. Users mainly focus on hotel services and environmental experience. The research has improved the time dimension analysis method and model of online reviews, and provided a reference value for analyzing user needs and predicting the emotional trend of hotel selection.
References
[1] Shi Xin. Based on the emotional dictionary in the hotel field [D]. Hebei University, 2014.
[2] Ding Wei. Emotion analysis based on a combination of dictionary and machine learning [D]. Xi'an University of Posts and Telecommunications, 2017.
[3] Ma Zirui. Recommendation algorithm for tourist attractions based on the emotional dictionary [J]. Computer programming skills and maintenance, 2022, (04): 20-21 + 37.
[4] The Hao Glacier. Research on text classification algorithm based on corpus features [D]. Yanshan University, 2019.
[5] Peng Mei, Hu Bibo. Emotion analysis of e-commerce user comments based on big data and artificial intelligence [J]. Computer Programming Skills and Maintenance, 2022, (06): 123-126.
[6] Xu Dehua, Yang Zhiling. Correlation study between emotional tendency and ratings in online product review text [J]. Library and Information Guide, 2022,7 (02): 59-65.
[7] Liu Haiou, Yao Sumei, He Xutao, etc. Deep learning-based online health community depression user portrait study [J]. Small Microcomputer system, 2021,42 (03): 572-577.
[8] Zhang Cui, Zhou Mengjie. A text emotion classification method based on the fusion of CNN and bidirectional LSTM [J]. Computer Age, 2019, (12): 38-41.
[9] Yang Deqing, Zhang Jing, Guo Wei, et al. Kanano Model Building based on the online product community [J]. Mechanical Design, 2018,35 (03): 12-19.
[10] Wang Roihuan. Analysis and improvement of hotel customer satisfaction based on text mining [D]. Dongbei University of Finance and Economics, 2021.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 International Scientific Technical and Economic Research

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
This work is licensed under the Creative Commons Attribution International License (CC BY 4.0).