DOI:10.2298/CSIS100420034V

Ontology-based multi-label classification of economic articles

Sergeja Vogrinčič1 and Zoran Bosnić2

  1. Jožef Stefan International Postgraduate School
    Jamova 39, 1000 Ljubljana, Slovenia
    sergeja.sabo@mps.si
  2. University of Ljubljana, Faculty of Computer and Information Science
    Tržaška cesta 25, 1000 Ljubljana, Slovenia
    zoran.bosnic@fri.uni-lj.si

Abstract

The paper presents an approach to the task of automatic document categorization in the field of economics. Since the documents can be annotated with multiple keywords (labels), we approach this task by applying and evaluating multi-label classification methods of supervised machine learning. We describe forming a test corpus of 1015 economic documents that we automatically classify using a tool which integrates ontology construction with text mining methods. In our experimental work, we evaluate three groups of multi-label classification approaches: transformation to single-class problems, specialized multi-label models, and hierarchical/ranking models. The classification accuracies of all tested classification models indicate that there is a potential for using all of the evaluated methods to solve this task. The results show the benefits of using complex groups of approaches which benefit from exploiting dependence between the labels. A good alternative to these approaches is also single-class naive Bayes classifiers coupled with the binary relevance transformation approach.

Key words

ontology, multi-label classification, machine learning, text categorization, economics, document classification

Digital Object Identifier (DOI)

https://doi.org/10.2298/CSIS100420034V

Publication information

Volume 8, Issue 1 (January 2011)
Year of Publication: 2011
ISSN: 1820-0214 (Print) 2406-1018 (Online)
Publisher: ComSIS Consortium

Full text

DownloadAvailable in PDF
Portable Document Format

How to cite

Vogrinčič, S., Bosnić, Z.: Ontology-based multi-label classification of economic articles. Computer Science and Information Systems, Vol. 8, No. 1, 101-119. (2011)