Reliability in ICA-based text classification

Research output: Book chapterChapterpeer-review

6 Citations (Scopus)

Abstract

This paper introduces a novel approach for improving the reliability of ICA-based text classifiers, attempting to make the most of the independent components of the text data. In this framework, two issues are adressed: firstly, a relative relevance measure for category assignment is presented. And secondly, a reliability control process is included in the classifier, avoiding the classification of documents belonging to none of the categories defined during the training stage. The experiments have been conducted on a journalistic-style text corpus in Catalan, achieving encouraging results in terms of rejection accuracy. However, similar results are obtained when comparing the proposed relevance measure to the classic magnitude-based technique for category assignment.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsCarlos G. Puntonet, Alberto Prieto
PublisherSpringer Verlag
Pages1213-1220
Number of pages8
ISBN (Electronic)3540230564, 9783540230564
DOIs
Publication statusPublished - 2004

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3195
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fingerprint

Dive into the research topics of 'Reliability in ICA-based text classification'. Together they form a unique fingerprint.

Cite this