Enhancing Chatbot Intent Classification using Active Learning Pipeline for Optimized Data Preparation
Enhancing Chatbot Intent Classification using Active Learning Pipeline for Optimized Data Preparation
Author(s): Karolina KULIGOWSKA, Bartłomiej KOWALCZUKSubject(s): Economy, Accounting - Business Administration, ICT Information and Communications Technologies
Published by: Centrul European de Studii Manageriale și Administrarea Afacerii - CESMAA
Keywords: chatbot; intent classification; active learning; sentence-transformers; cluster label propagation
Summary/Abstract: This study presents a novel approach to enhancing chatbot intent classification through an optimized data preparation combined with Active Learning. We applied the clustering mechanism using a state-of-the-art sentence-transformers model with cosine similarity for cluster detection in order to categorize messages. This process was further refined through a dedicated Active Learning Pipeline, which focused on the most essential observations for labeling. Incorporating externally sourced labeled data from Scale AI, the labeling process was fine-tuned iteratively, until the model's performance stabilized. This approach shows promise for various datasets and tasks, suggesting a scalable solution for preparing data for supervised modeling and achieving optimal model performance in real-world commercial chatbot scenarios.
Journal: Journal of Applied Economic Sciences (JAES)
- Issue Year: XIX/2024
- Issue No: 3(85)
- Page Range: 317-325
- Page Count: 9
- Language: English