Advanced product review summarization in e-commerce marketplaces: elevating beyond tf-idf and lexrank method

(1) Rita Melina Anggraeni Mail (Informatics Department, Universitas Ahmad Dahlan, Indonesia)
(2) * Dewi Pramudi Ismi Mail (Informatics Department, Universitas Ahmad Dahlan, Indonesia)
*corresponding author

Abstract


In the fiercely competitive domain of online product sales, wherein engendering trust among prospective buyers assumes paramount significance, the role of product reviews cannot be understated. However, a prevailing issue in online marketplaces resides in the presence of product reviews that do not consistently align with the overall product rating. Furthermore, the sheer abundance of comments often leads potential consumers to confine their scrutiny to the initial comments, thus leaving a substantial volume of reviews unexplored. To rectify this challenge, this study introduces an automated text summarization system for product reviews, leveraging the LexRank methodology. This system underwent rigorous evaluation using the Rouge metric, with results manifesting substantial promise. At a threshold of 0.1, Rouge-1 exhibited an accuracy of 16.67%, while Rouge-2 scored 3.01%, and Rouge-L reached 16.50%. At a threshold of 0.2, Rouge-1 yielded a score of 16.08%, Rouge-2 registered 2.64%, and Rouge-L scored 16.57%. The second evaluation, performed with a distinct test dataset, notably excelled, emphasizing the system's competence. Specifically, at the 0.2 threshold, the system displayed superior performance, underscoring its efficacy in refining product review summarization within online marketplaces

Keywords


Text summarization; Review summarization; LexRank method; E-commerce marketplace; ROUGE evaluation

   

DOI

https://doi.org/10.31763/sitech.v3i2.1225
      

Article metrics

10.31763/sitech.v3i2.1225 Abstract views : 385 | PDF views : 150

   

Cite

   

Full Text

Download

References


[1] B. De Quero Navarro and A. Joy, “Confronting the Postmodern Intersection of Markets, Development, Globalization, and Technology: The Necessary Radical Vision of Nikhilesh Dholakia,” Mark. Glob. Dev. Rev., vol. 7, no. 2, pp. 1-13, 2022, doi: 10.23860/MGDR-2022-07-02-02.

[2] B. Barann, J. H. Betzing, M. Niemann, B. Hoffmeister, and J. Becker, “Exploring customers’ likeliness to use e-service touchpoints in brick and mortar retail,” Electron. Mark., vol. 32, no. 2, pp. 523–545, Jun. 2022, doi: 10.1007/s12525-020-00445-0.

[3] I. Bubanja and M. Vidas-Bubanja, “Managing trade transactions in the covid era: The rise of e-commerce,” J. Eng. Manag. Compet., vol. 12, no. 1, pp. 20–34, 2022, doi: 10.5937/jemc2201020B.

[4] R. Chai, “Leveraging Corporate Social Innovation for Value Co-Creation: The Case of Superapps, an Integrative Literature Review,” SSRN Electron. J., p. 20, 2021, doi: 10.2139/ssrn.4004373.

[5] Y. K. Dwivedi et al., “Setting the future of digital and social media marketing research: Perspectives and research propositions,” Int. J. Inf. Manage., vol. 59, p. 102168, Aug. 2021, doi: 10.1016/j.ijinfomgt.2020.102168.

[6] Y. Kim and S. Suh, “The Core Value of Sustainable Fashion: A Case Study on ‘Market Gredit,’” Sustainability, vol. 14, no. 21, p. 14423, Nov. 2022, doi: 10.3390/su142114423.

[7] Y. Wang, F. Jia, T. Schoenherr, and Y. Gong, “Supply Chain-Based Business Model Innovation: The Case of a Cross-Border E-Commerce Company,” Sustainability, vol. 10, no. 12, p. 4362, Nov. 2018, doi: 10.3390/su10124362.

[8] M. Matarazzo, L. Penco, G. Profumo, and R. Quaglia, “Digital transformation and customer value creation in Made in Italy SMEs: A dynamic capabilities perspective,” J. Bus. Res., vol. 123, pp. 642–656, Feb. 2021, doi: 10.1016/j.jbusres.2020.10.033.

[9] N. . Saravana Kumar, K. Hariprasath, N. Kaviyavarshini, and A. Kavinya, “A study on forecasting bigmart sales using optimized machine learning techniques,” Sci. Inf. Technol. Lett., vol. 1, no. 2, pp. 52–59, Nov. 2020, doi: 10.31763/sitech.v1i2.167.

[10] P.-S. Lo, Y. K. Dwivedi, G. Wei-Han Tan, K.-B. Ooi, E. Cheng-Xi Aw, and B. Metri, “Why do consumers buy impulsively during live streaming? A deep learning-based dual-stage SEM-ANN analysis,” J. Bus. Res., vol. 147, pp. 325–337, Aug. 2022, doi: 10.1016/j.jbusres.2022.04.013.

[11] R. Bandara, M. Fernando, and S. Akter, “Privacy concerns in E-commerce: A taxonomy and a future research agenda,” Electron. Mark., vol. 30, no. 3, pp. 629–647, Sep. 2020, doi: 10.1007/s12525-019-00375-6.

[12] B. Lu and S. Zhang, “A Conjoint Approach to Understanding Online Buyers’ Decisions towards Online Marketplaces,” J. Theor. Appl. Electron. Commer. Res., vol. 15, no. 3, pp. 69–83, Sep. 2020, doi: 10.4067/S0718-18762020000300106.

[13] M. Nachar, “Factors that predict the adoption of online shopping in Saudi Arabia,” Walden University, 2019. [Online]. Available at: https://scholarworks.waldenu.edu/dissertations/6630/#:~:text.

[14] J. F. Gaski, “A history of brand misdefinition – with corresponding implications for mismeasurement and incoherent brand theory,” J. Prod. Brand Manag., vol. 29, no. 4, pp. 517–530, Mar. 2020, doi: 10.1108/JPBM-11-2018-2124.

[15] L. Yang, Y. Li, J. Wang, and R. S. Sherratt, “Sentiment Analysis for E-Commerce Product Reviews in Chinese Based on Sentiment Lexicon and Deep Learning,” IEEE Access, vol. 8, pp. 23522–23530, 2020, doi: 10.1109/ACCESS.2020.2969854.

[16] L. Zheng, P. Jin, J. Zhao, and L. Yue, “Multi-dimensional Sentiment Analysis for Large-Scale E-commerce Reviews,” Decker, H., Lhotská, L., Link, S., Spies, M., Wagner, R.R. Database Expert Syst. Appl. DEXA 2014. Lect. Notes Comput. Sci., vol. 8645, pp. 449–463, 2014, doi: 10.1007/978-3-319-10085-2_41.

[17] M. R. Ramadhan, S. N. Endah, and A. B. J. Mantau, “Implementation of Textrank Algorithm in Product Review Summarization,” in 2020 4th International Conference on Informatics and Computational Sciences (ICICoS), Nov. 2020, pp. 1–5, doi: 10.1109/ICICoS51170.2020.9299005.

[18] S. Novgorodov, I. Guy, G. Elad, and K. Radinsky, “Generating Product Descriptions from User Reviews,” in The World Wide Web Conference, May 2019, pp. 1354–1364, doi: 10.1145/3308558.3313532.

[19] S. Novgorodov, I. Guy, G. Elad, and K. Radinsky, “Descriptions from the Customers: Comparative Analysis of Review-based Product Description Generation Methods,” ACM Trans. Internet Technol., vol. 20, no. 4, pp. 1–31, Nov. 2020, doi: 10.1145/3418202.

[20] S. Deo and D. Banik, “Text Summarization using Textrank and Lexrank through Latent Semantic analysis,” in 2022 OITS International Conference on Information Technology (OCIT), Dec. 2022, pp. 113–118, doi: 10.1109/OCIT56763.2022.00031.

[21] N. H. Cahyana, S. Saifullah, Y. Fauziah, A. S. Aribowo, and R. Drezewski, “Semi-supervised Text Annotation for Hate Speech Detection using K-Nearest Neighbors and Term Frequency-Inverse Document Frequency,” Int. J. Adv. Comput. Sci. Appl., vol. 13, no. 10, pp. 147-151, 2022, doi: 10.14569/IJACSA.2022.0131020.

[22] S. Saifullah, Y. Fauziah, and A. S. Aribowo, “Comparison of Machine Learning for Sentiment Analysis in Detecting Anxiety Based on Social Media Data,” Jurnal Informatika, vol 15, no.1, pp. 45-55, Jan. 2021, doi: 10.26555/jifo.v15i1.a20111.

[23] D. Yadav et al., “Qualitative Analysis of Text Summarization Techniques and Its Applications in Health Domain,” Comput. Intell. Neurosci., vol. 2022, pp. 1–14, Feb. 2022, doi: 10.1155/2022/3411881.

[24] M. S M, R. M P, A. R E, and E. S. G SR, “Text Summarization Using Text Frequency Ranking Sentence Prediction,” in 2020 4th International Conference on Computer, Communication and Signal Processing (ICCCSP), Sep. 2020, pp. 1–5, doi: 10.1109/ICCCSP49186.2020.9315203.

[25] M. Al-Dhelaan and A. Al-Suhaim, “Sentiment diversification for short review summarization,” in Proceedings of the International Conference on Web Intelligence, Aug. 2017, pp. 723–729, doi: 10.1145/3106426.3106525.

[26] N. Modani, E. Khabiri, H. Srinivasan, and J. Caverlee, “Creating Diverse Product Review Summaries: A Graph Approach,” n Wang, J., al. Web Inf. Syst. Eng. – WISE 2015. WISE 2015. Lect. Notes Comput. Sci., vol. 9418, pp. 169–184, 2015, doi: 10.1007/978-3-319-26190-4_12.

[27] T. Shaik et al., “A Review of the Trends and Challenges in Adopting Natural Language Processing Methods for Education Feedback Analysis,” IEEE Access, vol. 10, pp. 56720–56739, 2022, doi: 10.1109/ACCESS.2022.3177752.

[28] U. Singh, A. Saraswat, H. K. Azad, K. Abhishek, and S. Shitharth, “Towards improving e-commerce customer review analysis for sentiment detection,” Sci. Rep., vol. 12, no. 1, p. 21983, Dec. 2022, doi: 10.1038/s41598-022-26432-3.

[29] S. Kumar, A. K. Kar, and P. V. Ilavarasan, “Applications of text mining in services management: A systematic literature review,” Int. J. Inf. Manag. Data Insights, vol. 1, no. 1, p. 100008, Apr. 2021, doi: 10.1016/j.jjimei.2021.100008.

[30] C. Fan, M. Chen, X. Wang, J. Wang, and B. Huang, “A Review on Data Preprocessing Techniques Toward Efficient and Reliable Knowledge Discovery From Building Operational Data,” Front. Energy Res., vol. 9, Mar. pp. 1-17, 2021, doi: 10.3389/fenrg.2021.652801.

[31] D. Antons, E. Grünwald, P. Cichy, and T. O. Salge, “The application of text mining methods in innovation research: current state, evolution patterns, and development priorities,” R&D Manag., vol. 50, no. 3, pp. 329–351, Jun. 2020, doi: 10.1111/radm.12408.

[32] Kauffmann, Peral, Gil, Ferrández, Sellers, and Mora, “Managing Marketing Decision-Making with Sentiment Analysis: An Evaluation of the Main Product Features Using Text Data Mining,” Sustainability, vol. 11, no. 15, p. 4235, Aug. 2019, doi: 10.3390/su11154235.

[33] Y. Wang and C. Zhang, “Using the full-text content of academic articles to identify and evaluate algorithm entities in the domain of natural language processing,” J. Informetr., vol. 14, no. 4, p. 101091, Nov. 2020, doi: 10.1016/j.joi.2020.101091.

[34] Y. Fauziah, S. Saifullah, and A. S. Aribowo, “Design Text Mining for Anxiety Detection using Machine Learning based-on Social Media Data during COVID-19 pandemic,” in Proceeding of LPPM UPN “Veteran” Yogyakarta Conference Series 2020–Engineering and Science Series, 2020, vol. 1, no. 1, pp. 253–261. [Online]. Available at: https://proceeding.researchsynergypress.com/index.php/ess/article/view/117.

[35] S. Zaware, D. Patadiya, A. Gaikwad, S. Gulhane, and A. Thakare, “Text Summarization using TF-IDF and Textrank algorithm,” in 2021 5th International Conference on Trends in Electronics and Informatics (ICOEI), Jun. 2021, pp. 1399–1407, doi: 10.1109/ICOEI51242.2021.9453071.

[36] M. Y. Saeed, M. Awais, R. Talib, and M. Younas, “Unstructured Text Documents Summarization With Multi-Stage Clustering,” IEEE Access, vol. 8, pp. 212838–212854, 2020, doi: 10.1109/ACCESS.2020.3040506.

[37] Z. Nasar, S. W. Jaffry, and M. K. Malik, “Textual keyword extraction and summarization: State-of-the-art,” Inf. Process. Manag., vol. 56, no. 6, p. 102088, Nov. 2019, doi: 10.1016/j.ipm.2019.102088.

[38] G. N. H., R. Siautama, A. C. I. A., and D. Suhartono, “Extractive Hotel Review Summarization based on TF/IDF and Adjective-Noun Pairing by Considering Annual Sentiment Trends,” Procedia Comput. Sci., vol. 179, pp. 558–565, 2021, doi: 10.1016/j.procs.2021.01.040.

[39] C. Ma, W. E. Zhang, M. Guo, H. Wang, and Q. Z. Sheng, “Multi-document Summarization via Deep Learning Techniques: A Survey,” ACM Comput. Surv., vol. 55, no. 5, pp. 1–37, May 2023, doi: 10.1145/3529754.


Refbacks

  • There are currently no refbacks.


Copyright (c) 2022 Rita Melina Anggraeni, Dewi Pramudi Ismi

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

___________________________________________________________
Science in Information Technology Letters
ISSN 2722-4139
Published by Association for Scientific Computing Electrical and Engineering (ASCEE)
W : http://pubs2.ascee.org/index.php/sitech
E : sitech@ascee.org, andri@ascee.org, andri.pranolo.id@ieee.org

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0

View My Stats