Classification of extremist text on the web using sentiment analysis approach

Owoeye, Kolade Olawande and Weir, George R. S. (2018) Classification of extremist text on the web using sentiment analysis approach. In: IEEE 5th International Conference on Computational Science and Computational Intelligence, 2018-12-13 - 2018-12-15.

[thumbnail of Owoeye-Weir-CSCI2018-Classification-of-extremist-text-on-the-web-using-sentiment-analysis-approach]
Preview
Text. Filename: Owoeye_Weir_CSCI2018_Classification_of_extremist_text_on_the_web_using_sentiment_analysis_approach.pdf
Accepted Author Manuscript

Download (194kB)| Preview

Abstract

The high volume of extremist materials online makes manual classification impractical. However, there is a need for automated classification techniques. One set of extremist web pages obtained by the TENE Web-crawler was initially subjected to manual classification. A sentiment-based classification model was then developed to automate the classification of such extremist Websites. The classification model measures how well the pages could be automatically matched against their appropriate classes. The method also identifies particular data items that differ in manual classification from their automated classification. The results from our method showed that overall web pages were correctly matched against the manual classification with a 93% success rate. In addition, a feature selection algorithm was able to reduce the original 26-feature set by one feature to attain a better overall performance of 94% in classifying the Web data.