An Approach to Summarizing Product Reviews
https://doi.org/10.25205/1818-7935-2022-20-4-90-106
Abstract
Product reviews are an important feature of e-commerce because they influence the communicative behavior of a potential addressee (customers). Users often read online product reviews to get either general or specific information about a product or service. Besides, companies analyze customer reviews to improve their product quality or to adjust their marketing strategy. However, many reviews are unstructured and long. With the number of product reviews growing rapidly, reading a large number of reviews becomes a time-consuming process for both users and companies. Therefore, review summarization becomes a serious issue. In this paper, we propose a new approach to summarizing reviews of electronics and household appliances. The proposed approach makes it possible to structure information for every aspect category; it provides calculation sentiment score for each aspect category, and shows the most relevant sentences for each aspect. We used product reviews from Yandex.Market as target data. Our task was performed in five main phases: 1) expert identification of thematic aspect categories; 2) classifying sentences into the predefined aspect categories; 3) sentence classification into two classes—positive and negative—with calculating the number of both type sentences within each aspect category; 4) sentence ranking; 5) visualization of the results obtained in the previous phases. The quality of the algorithm for creating a resume from a large collection of reviews has been tested on five models of products from the following categories: coffee machines, robot vacuum cleaners, e-books, TV-sets, and washing machines.
Keywords
About the Author
N. S. ChechnevaRussian Federation
Nadezhda S. Chechneva, PhD student
Saint Petersburg
References
1. Akhtar, N. et al. Aspect based sentiment oriented summarization of hotel reviews. Procedia computer science, 2017, vol. 115, pp. 563–571.
2. Bojanowski, P. et al. Enriching word vectors with subword information. Transactions of the association for computational linguistics, 2017, vol. 5, pp. 135–146.
3. Borisova, E. G. Marketing linguistics: prospects and trends. Verhnevolzhski philological bulletin, 2016, no. 4, pp. 140–143.
4. Brokos, G. I., Malakasiotis, P., Androutsopoulos, I. Using centroids of word embeddings and word mover’s distance for biomedical document retrieval in question answering. Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016, pp. 114–118.
5. Condori, R. E. L., Pardo, T. A. S. Opinion summarization methods: Comparing and extending extractive and abstractive approaches. Expert Systems with Applications, 2017, vol. 78, pp. 124– 134.
6. d’Sa, A. G. et al. Label propagation-based semi-supervised learning for hate speech classification. Insights from Negative Results Workshop, EMNLP, 2020.
7. Ellson, J. et al. Graphviz and dynagraph—static and dynamic graph drawing tools. Graph drawing software. Springer, Berlin, Heidelberg, 2004, pp. 127–148.
8. Fishbein, M. An investigation of the relationships between beliefs about an object and the attitude toward that object. Human relations, 1963, vol. 16, no. 3, pp. 233–239.
9. Howard, J. A. Consumer behavior: Application of theory. McGraw-Hill Companies. 1977.
10. Johnson, E. J., Tversky, A. Representations of Perceptions of Risk. Journal of Experimental Psychology: General, 1984, pp. 55–70.
11. Hu, M., Liu, B. Mining and summarizing customer reviews. Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, 2004, pp. 168–177.
12. Joulin, A., Grave, E., Bojanowski, P., Mikolov, T. Bag of Tricks for Efficient Text Classification. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017, pp. 427–431.
13. Kim, S. et al. A hierarchical aspect-sentiment model for online reviews. Proceedings of the AAAI Conference on Artificial Intelligence, 2013, vol. 27, pp. 526–533.
14. Kotler, F. Marketing Management: Analysis, Planning, and Control. Prentice-Hall, 1967, 628 p.
15. Lancaster, K. J. A new approach to consumer theory. Journal of political economy, 1966, vol. 74, no. 2, pp. 132–157.
16. Mihalcea, R., Tarau, P. Textrank: Bringing order into text. Proceedings of the 2004 conference on empirical methods in natural language processing. 2004, pp. 404–411.
17. Mithun, S., Kosseim, L. Summarizing blog entries versus news texts. Proceedings of the Workshop on Events in Emerging Text Types. 2009, pp. 1–8
18. Mukherjee, A., Liu, B. Aspect extraction through semi-supervised modeling. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2012, pp. 339–348.
19. RD 50-165-79 Consumer goods. Selection of the nomenclature of consumer properties and quality indicators. The main features [Online]. URL: https://files.stroyinf.ru/Index2/1/4293762/4293762287.htm (accesed on: 28.05.2022) (in Russ.)
20. Rosenberg, M. J. Cognitive structure and attitudinal affect. The Journal of abnormal and social psychology, 1956, vol. 53, no. 3, p. 367.
21. Rubashkin, V. Ontological semantics. Knowledge. Ontologies. Ontology-based approach to information analysis of text. Moscow, Fizmatlit, 2012, 348 p. (in Russ.)
22. Thaler, R. Toward a positive theory of consumer choice. Journal of economic behavior & organization, 1980, vol. 1, no. 1, pp. 39–60.
23. Tversky, A., Kahneman, D. Judgment under uncertainty: Heuristics and biases. Science, 1974, vol. 185, no. 4157, pp. 1124–1131.
24. Zhai, Z. et al. Clustering product features for opinion mining. Proceedings of the fourth ACM international conference on Web search and data mining, 2011, pp. 347–354
25. Yu, J. et al. Domain-assisted product aspect hierarchy generation: towards hierarchical organization of unstructured consumer reviews. Proceedings of the 2011 conference on empirical methods in natural language processing, 2011, pp. 140–150.
Review
For citations:
Chechneva N.S. An Approach to Summarizing Product Reviews. NSU Vestnik. Series: Linguistics and Intercultural Communication. 2022;20(4):90-106. (In Russ.) https://doi.org/10.25205/1818-7935-2022-20-4-90-106