View article

[PDF] from psu.edu

Associative clustering for exploring dependencies between functional genomics data sets

Authors

Samuel Kaski, Janne Nikkila, Janne Sinkkonen, Leo Lahti, Juha EA Knuuttila, Christophe Roos

Publication date

2005/9/6

Journal

IEEE/ACM Transactions on Computational Biology and bioinformatics

Volume

Issue

Pages

203-216

Publisher

IEEE

Description

High-throughput genomic measurements, interpreted as cooccurring data samples from multiple sources, open up a fresh problem for machine learning: What is in common in the different data sets, that is, what kind of statistical dependencies are there between the paired samples from the different sets? We introduce a clustering algorithm for exploring the dependencies. Samples within each data set are grouped such that the dependencies between groups of different sets capture as much of pairwise dependencies between the samples as possible. We formalize this problem in a novel probabilistic way, as optimization of a Bayes factor. The method is applied to reveal commonalities and exceptions in gene expression between organisms and to suggest regulatory interactions in the form of dependencies between gene expression profiles and regulator binding patterns.

Total citations

Cited by 47

20042005200620072008200920102011201220132014201520162017201820192020202120222023202420251 6 5 6 4 4 3 1 2 2 4 4 2 1 1

Scholar articles

Associative clustering for exploring dependencies between functional genomics data sets

S Kaski, J Nikkila, J Sinkkonen, L Lahti, JEA Knuuttila… - IEEE/ACM Transactions on Computational Biology and …, 2005

zproxy.org