High Accuracy Rule-based Question Classification using Question Syntax and Semantics
High Accuracy Rule-based Question Classification using Question Syntax and Semantics
{Mark Lee Harish Tayyar Madabushi}

Abstract
We present in this paper a purely rule-based system for Question Classification which we divide into two parts: The first is the extraction of relevant words from a question by use of its structure, and the second is the classification of questions based on rules that associate these words to Concepts. We achieve an accuracy of 97.2{%}, close to a 6 point improvement over the previous State of the Art of 91.6{%}. Additionally, we believe that machine learning algorithms can be applied on top of this method to further improve accuracy.
Benchmarks
| Benchmark | Methodology | Metrics |
|---|---|---|
| text-classification-on-trec-50 | Rules | Error: 2.8 |
Build AI with AI
From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.