A hierarchical consensus architecture for robust document clustering

Research output: Book chapterConference contributionpeer-review

1 Citation (Scopus)

Abstract

A major problem encountered by text clustering practitioners is the difficulty of determining a priori which is the optimal text representation and clustering technique for a given clustering problem. As a step towards building robust document partitioning systems, we present a strategy based on a hierarchical consensus clustering architecture that operates on a wide diversity of document representations and partitions. The conducted experiments show that the proposed method is capable of yielding a consensus clustering that is comparable to the best individual clustering available even in the presence of a large number of poor individual labelings, outperforming classic non-hierarchical consensus approaches in terms of performance and computational cost.

Original languageEnglish
Title of host publicationAdvances in Information Retrieval - 29th European Conference on IR Research, ECIR 2007, Proceedings
PublisherSpringer Verlag
Pages741-744
Number of pages4
ISBN (Print)3540714944, 9783540714941
DOIs
Publication statusPublished - 2007
Event29th European Conference on IR Research, ECIR 2007 - Rome, Italy
Duration: 2 Apr 20075 Apr 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4425 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference29th European Conference on IR Research, ECIR 2007
Country/TerritoryItaly
CityRome
Period2/04/075/04/07

Fingerprint

Dive into the research topics of 'A hierarchical consensus architecture for robust document clustering'. Together they form a unique fingerprint.

Cite this