Using machine learning to develop customer insights from user-generated content




Mustak, Mekhail; Hallikainen, Heli; Laukkanen, Tommi; Plé, Loïc; Hollebeek, Linda D.; Aleem, Majid

PublisherElsevier

2024

Journal of Retailing and Consumer Services

104034

81

0969-6989

1873-1384

DOIhttps://doi.org/10.1016/j.jretconser.2024.104034

https://doi.org/10.1016/j.jretconser.2024.104034

https://research.utu.fi/converis/portal/detail/Publication/457564510



Uncovering customer insights (CI) is indispensable for contemporary marketing strategies. The widespread availability of user-generated content (UGC) presents a unique opportunity for firms to gain a nuanced understanding of their customers. However, the size and complexity of UGC datasets pose significant challenges for traditional market research methods, limiting their effectiveness in this context. To address this challenge, this study leverages natural language processing (NLP) and machine learning (ML) techniques to extract nuanced insights from UGC. By integrating sentiment analysis and topic modeling algorithms, we analyzed a dataset of approximately four million X posts (formerly tweets) encompassing 20 global brands across industries. The findings reveal primary brand-related emotions and identify the top 10 keywords indicative of brand-related sentiment. Using FedEx as a case study, we identify five prominent areas of customer concern: parcel tracking, small business services, the firm's comparative performance, package delivery dynamics, and customer service. Overall, this study offers a roadmap for academics to navigate the complex landscape of generating CI from UGC datasets. It thus raises pertinent practical implications, including boosting customer service, refining marketing strategies, and better understanding customer needs and preferences, thereby contributing to more effective, more responsive business strategies.


Last updated on 2025-27-01 at 19:49