Issue |
Wuhan Univ. J. Nat. Sci.
Volume 26, Number 6, December 2021
|
|
---|---|---|
Page(s) | 464 - 472 | |
DOI | https://doi.org/10.1051/wujns/2021266464 | |
Published online | 17 December 2021 |
Computer Science
CLC number: TP305
Online Latent Dirichlet Allocation Model Based on Sentiment Polarity Time Series
1 School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China
2 Henan Costar Group Ltd, Nanyang 450061, Henan, China
3 School of Computer Engineering and Science, Shanghai University, Shanghai 200244, China
4 Shanghai Key Laboratory of Integrated Administration Technologies for Information Security, Shanghai 200240, China
Received: 26 July 2021
The Product Sensitive Online Dirichlet Allocation model (PSOLDA) proposed in this paper mainly uses the sentiment polarity of topic words in the review text to improve the accuracy of topic evolution. First, we use Latent Dirichlet Allocation (LDA) to obtain the distribution of topic words in the current time window. Second, the word2vec word vector is used as auxiliary information to determine the sentiment polarity and obtain the sentiment polarity distribution of the current topic. Finally, the sentiment polarity changes of the topics in the previous and next time window are mapped to the sentiment factors, and the distribution of topic words in the next time window is controlled through them. The experimental results show that the PSOLDA model decreases the probability distribution by 0.160 1, while Online Twitter LDA only increases by 0.069 9. The topic evolution method that integrates the sentimental information of topic words proposed in this paper is better than the traditional model.
Key words: topic evolution / sentiment factors / word vector / Latent Dirichlet Allocation (LDA)
Biography: HUANG Bo, male, Ph.D., Associate professor, research directions: software engineering, artificial intelligence. E-mail: huangbosues@sues.edu.cn
Foundation item: Supported by the Opening Project of Shanghai Key Laboratory of Integrated Administration Technologies for Information Security (AGK2019004) , Songjiang District Science and Technology Research Project (19SJKJGG83) , National Natural Science Foundation of China (61802251)
© Wuhan University 2021
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.