In this work we investigate several machine learning methods to tackle the problem of intent classification for dialogue utterances. We start with Bag-of-Words (BoW) in combination with Naïve Bayes (NB). After that, we employ Continuous Bag-of-Words (CBoW) coupled with Support Vector Machines (SVM). Then follow Long Short-Term Memory (LSTM) networks, which are made bidirectional. The best performing model is hierarchical, such that it can take advantage of the natural taxonomy within classes. The main experiments are a comparison between these methods on an open sourced academic dataset. In the first experiment we consider the full dataset. We also consider the given subsets of data separately, in order to compare our results with state-of-the-art vendor solutions. In general we find that the SVM models outperform the LSTM models. The former models achieve the highest macro-F1 for the full dataset, and in most of the individual datasets. We also found out that the incorporation of the hierarchical structure in the intents improves the performance.

Additional Metadata
Persistent URL dx.doi.org/10.1109/MIS.2019.2954966, hdl.handle.net/1765/122153
Journal IEEE Intelligent Systems
Rights no access
Citation
Schuurmans, J. (Jetze), & Frasincar, F. (2019). Intent Classification for Dialogue Utterances. IEEE Intelligent Systems. doi:10.1109/MIS.2019.2954966