The goal of this thesis is the implementation of an essay grading system.
We lean heavily on the methodology of an existing system, which, besides using syntactical measurements, also uses coherence and semantic cosistency measures.
We implement the methodology in the Orange data mining tool, with a firendly user interface, optional use of word embeddings for word representation and the possibility for further developments of the system.
The system is evaluated on public datasets from the Kaggle website. The results are to the most possible extent compared with the results of the existing methodology and analyzed in detail.
We also compare several attribute selection methods, which improve our results. Main contributions of this work are comprised of (1) implementation of the system, (2) ease of use and (3) improvements upon previous work, including additional computing options and detailed attribute selection analysis.
|