Adiyaman University Repository

IRText: An Item Response Theory-Based Approach for Text Categorization

Show simple item record

dc.contributor.author Çoban, Önder
dc.date.accessioned 2025-12-15T11:26:36Z
dc.date.available 2025-12-15T11:26:36Z
dc.date.issued 2021
dc.identifier.issn 2193-567X
dc.identifier.uri http://dspace.adiyaman.edu.tr:8080/xmlui/handle/20.500.12414/7019
dc.description.abstract Text categorization (TC) is a machine learning task that tries to assign a text to one of the predefined categories. In a nutshell, texts are converted into numerical feature vectors in which each feature is bounded with a weight value. Afterward, a classifier is trained on vectorized texts and is used to classify previously unseen documents. Feature selection (FS) is also optionally applied to achieve better classification accuracy by using a lower number of features. Item response theory (IRT), on the other hand, is a set of statistical models designed to understand persons based on their responses to questions by assuming that responses on a given item are a function of both person and item properties. Even though there exist many studies devoted to understand, explore, and improve methods, there is not any previous study that aims at combining powers of these fields. As such, in this study, an IRT-based approach is proposed that suggests using the IRT score of a feature in both term weighting and FS that are important inter-steps of TC. The efficiency of the proposed approach is measured on two well-known benchmark datasets by comparing it with its two traditional peers. Experimental results show that the IRT-based approach can be used for text FS and there is open room for possible improvements. To the best of our knowledge, this study is the first of its kind which tries to adapt IRT for classical TC. tr
dc.language.iso en tr
dc.publisher SPRINGER HEIDELBERG tr
dc.subject Item response theory tr
dc.subject Text categorization tr
dc.subject Term weighting tr
dc.subject Feature selection tr
dc.title IRText: An Item Response Theory-Based Approach for Text Categorization tr
dc.type Article tr
dc.contributor.department Adiyaman Univ, Dept Comp Engn tr
dc.identifier.endpage 9439 tr
dc.identifier.issue 8 tr
dc.identifier.startpage 9423 tr
dc.identifier.volume 47 tr
dc.source.title ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING tr


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account