Automated approaches for detecting integration in student essays
Simon Hughes, Peter Hastings, Joseph Magliano, Susan Goldman, and Kimberly Lawless. Automated approaches for detecting integration in student essays. In Proceedings of Intelligent Tutoring Systems 2012, Xania, Crete, 2012.
Download
Abstract
Integrating information across multiple sources is an important literacy skill, yet there has been little research into automated methods for measuring integration in written text. This study investigated the efficacy of three different algorithms at classifying student essays according to an expert model of the essay topic which categorized statements by argument function, including claims and integration. A novel classification algorithm is presented which uses multi-word regular expressions. Its performance is compared to that of Latent Semantic Analysis and several variants of the Support Vector Machine algorithm at the same classification task. One variant of the SVM approach worked better overall, but another proved more successful at detecting integration within and across texts. This research has important implications for systems that can gauge the level of integration demonstrated through the written word.
BibTeX
@InProceedings{Hughes:its2012, author = name:spjsk, title = {Automated approaches for detecting integration in student essays}, booktitle = {Proceedings of Intelligent Tutoring Systems 2012}, year = 2012, editor = {S. Cerri and W. Clancey and G. Papadourakis and K. Panourgia}, address = {Xania, Crete}, cvnote = {Acceptance rate: 44\%. CORE Conference Rank: A.}, abstract = {Integrating information across multiple sources is an important literacy skill, yet there has been little research into automated methods for measuring integration in written text. This study investigated the efficacy of three different algorithms at classifying student essays according to an expert model of the essay topic which categorized statements by argument function, including claims and integration. A novel classification algorithm is presented which uses multi-word regular expressions. Its performance is compared to that of Latent Semantic Analysis and several variants of the Support Vector Machine algorithm at the same classification task. One variant of the SVM approach worked better overall, but another proved more successful at detecting integration within and across texts. This research has important implications for systems that can gauge the level of integration demonstrated through the written word.} }