Biclustering is an intrinsically challenging andhighly complex problem, particularly studied in thebiology field, where the goal is to simultaneouslycluster genes and samples of an expression data matrix.In this paper we present a novel approach togene expression biclustering by providing a binaryFactor Graph formulation to such problem. In moredetail, we reformulate biclustering as a sequentialsearch for single biclusters and use an efficient optimizationprocedure based on the Max Sum algorithm.Such approach, drastically alleviates thescaling issues of previous approaches for biclusteringbased on Factor Graphs obtaining significantlymore accurate results on synthetic datasets. A furtheranalysis on two real-world datasets confirmsthe potentials of the proposed methodology whencompared to alternative state of the art methods.
Biclustering gene expressions using factor graphs and the max-sum algorithm
Denitto, Matteo;FARINELLI, Alessandro;BICEGO, Manuele
2015-01-01
Abstract
Biclustering is an intrinsically challenging andhighly complex problem, particularly studied in thebiology field, where the goal is to simultaneouslycluster genes and samples of an expression data matrix.In this paper we present a novel approach togene expression biclustering by providing a binaryFactor Graph formulation to such problem. In moredetail, we reformulate biclustering as a sequentialsearch for single biclusters and use an efficient optimizationprocedure based on the Max Sum algorithm.Such approach, drastically alleviates thescaling issues of previous approaches for biclusteringbased on Factor Graphs obtaining significantlymore accurate results on synthetic datasets. A furtheranalysis on two real-world datasets confirmsthe potentials of the proposed methodology whencompared to alternative state of the art methods.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.