Enhancing Chatbot Intent Classification using Active Learning Pipeline for Optimized Data Preparation Cover Image

Enhancing Chatbot Intent Classification using Active Learning Pipeline for Optimized Data Preparation
Enhancing Chatbot Intent Classification using Active Learning Pipeline for Optimized Data Preparation

Author(s): Karolina KULIGOWSKA, Bartłomiej KOWALCZUK
Subject(s): Economy, Accounting - Business Administration, ICT Information and Communications Technologies
Published by: Centrul European de Studii Manageriale și Administrarea Afacerii - CESMAA
Keywords: chatbot; intent classification; active learning; sentence-transformers; cluster label propagation

Summary/Abstract: This study presents a novel approach to enhancing chatbot intent classification through an optimized data preparation combined with Active Learning. We applied the clustering mechanism using a state-of-the-art sentence-transformers model with cosine similarity for cluster detection in order to categorize messages. This process was further refined through a dedicated Active Learning Pipeline, which focused on the most essential observations for labeling. Incorporating externally sourced labeled data from Scale AI, the labeling process was fine-tuned iteratively, until the model's performance stabilized. This approach shows promise for various datasets and tasks, suggesting a scalable solution for preparing data for supervised modeling and achieving optimal model performance in real-world commercial chatbot scenarios.

  • Issue Year: XIX/2024
  • Issue No: 3(85)
  • Page Range: 317-325
  • Page Count: 9
  • Language: English
Toggle Accessibility Mode