TY - JOUR
T1 - Mining the determinants of review helpfulness
T2 - a novel approach using intelligent feature engineering and explainable AI
AU - Kim, Jiho
AU - Lee, Hanjun
AU - Lee, Hongchul
N1 - Funding Information:
This research was supported by Brain Korea 21 FOUR.
Publisher Copyright:
© 2022, Emerald Publishing Limited.
PY - 2022
Y1 - 2022
N2 - Purpose: This paper aims to find determinants that can predict the helpfulness of online customer reviews (OCRs) with a novel approach. Design/methodology/approach: The approach consists of feature engineering using various text mining techniques including BERT and machine learning models that can classify OCRs according to their potential helpfulness. Moreover, explainable artificial intelligence methodologies are used to identify the determinants for helpfulness. Findings: The important result is that the boosting-based ensemble model showed the highest prediction performance. In addition, it was confirmed that the sentiment features of OCRs and the reputation of reviewers are important determinants that augment the review helpfulness. Research limitations/implications: Each online community has different purposes, fields and characteristics. Thus, the results of this study cannot be generalized. However, it is expected that this novel approach can be integrated with any platform where online reviews are used. Originality/value: This paper incorporates feature engineering methodologies for online reviews, including the latest methodology. It also includes novel techniques to contribute to ongoing research on mining the determinants of review helpfulness.
AB - Purpose: This paper aims to find determinants that can predict the helpfulness of online customer reviews (OCRs) with a novel approach. Design/methodology/approach: The approach consists of feature engineering using various text mining techniques including BERT and machine learning models that can classify OCRs according to their potential helpfulness. Moreover, explainable artificial intelligence methodologies are used to identify the determinants for helpfulness. Findings: The important result is that the boosting-based ensemble model showed the highest prediction performance. In addition, it was confirmed that the sentiment features of OCRs and the reputation of reviewers are important determinants that augment the review helpfulness. Research limitations/implications: Each online community has different purposes, fields and characteristics. Thus, the results of this study cannot be generalized. However, it is expected that this novel approach can be integrated with any platform where online reviews are used. Originality/value: This paper incorporates feature engineering methodologies for online reviews, including the latest methodology. It also includes novel techniques to contribute to ongoing research on mining the determinants of review helpfulness.
KW - BERT
KW - Explainable artificial intelligence
KW - Information extraction
KW - Online customer reviews
KW - Review helpfulness
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=85134172533&partnerID=8YFLogxK
U2 - 10.1108/DTA-12-2021-0359
DO - 10.1108/DTA-12-2021-0359
M3 - Article
AN - SCOPUS:85134172533
SN - 0033-0337
SP - 1
EP - 23
JO - Data Technologies and Applications
JF - Data Technologies and Applications
ER -