Kernelized bayesian matrix factorization

Mehmet Gönen; Suleiman A. Khan; Samuel Kaski

Kernelized bayesian matrix factorization

Mehmet Gönen, Suleiman A. Khan, Samuel Kaski

Research output: Contribution to conference › Paper › peer-review

Abstract

We extend kernelized matrix factorization with a fully Bayesian treatment and with an ability to work with multiple side information sources expressed as different kernels. Kernel functions have been introduced to matrix factorization to integrate side information about the rows and columns (e.g., objects and users in recommender systems), which is necessary for making out-of-matrix (i.e., cold start) predictions. We discuss specifically bipartite graph inference, where the output matrix is binary, but extensions to more general matrices are straightforward. We extend the state of the art in two key aspects: (i) A fully conjugate probabilistic formulation of the kernelized matrix factorization problem enables an efficient variational approximation, whereas fully Bayesian treatments are not computationally feasible in the earlier approaches, (ii) Multiple side information sources are ineluded, treated as different kernels in multiple kernel learning that additionally reveals which side information sources are informative. Our method outperforms alternatives in predicting drug-protein interactions on two data sets. We then show that our framework can also be used for solving multilabel learning problems by considering samples and labels as the two domains where matrix factorization operates on. Our algorithm obtains the lowest Hamming loss values on 10 out of 14 multilabel classification data sets compared to five state-of-the-art multilabel learning algorithms.

Original language	English (US)
Pages	1901-1909
Number of pages	9
State	Published - 2013
Externally published	Yes
Event	30th International Conference on Machine Learning, ICML 2013 - Atlanta, GA, United States Duration: Jun 16 2013 → Jun 21 2013

Other

Other	30th International Conference on Machine Learning, ICML 2013
Country/Territory	United States
City	Atlanta, GA
Period	6/16/13 → 6/21/13

ASJC Scopus subject areas

Human-Computer Interaction
Sociology and Political Science

Cite this

@conference{e9aae3250bc148b8b58792ac0f2c13ec,

title = "Kernelized bayesian matrix factorization",

abstract = "We extend kernelized matrix factorization with a fully Bayesian treatment and with an ability to work with multiple side information sources expressed as different kernels. Kernel functions have been introduced to matrix factorization to integrate side information about the rows and columns (e.g., objects and users in recommender systems), which is necessary for making out-of-matrix (i.e., cold start) predictions. We discuss specifically bipartite graph inference, where the output matrix is binary, but extensions to more general matrices are straightforward. We extend the state of the art in two key aspects: (i) A fully conjugate probabilistic formulation of the kernelized matrix factorization problem enables an efficient variational approximation, whereas fully Bayesian treatments are not computationally feasible in the earlier approaches, (ii) Multiple side information sources are ineluded, treated as different kernels in multiple kernel learning that additionally reveals which side information sources are informative. Our method outperforms alternatives in predicting drug-protein interactions on two data sets. We then show that our framework can also be used for solving multilabel learning problems by considering samples and labels as the two domains where matrix factorization operates on. Our algorithm obtains the lowest Hamming loss values on 10 out of 14 multilabel classification data sets compared to five state-of-the-art multilabel learning algorithms.",

author = "Mehmet G{\"o}nen and Khan, {Suleiman A.} and Samuel Kaski",

year = "2013",

language = "English (US)",

pages = "1901--1909",

note = "30th International Conference on Machine Learning, ICML 2013 ; Conference date: 16-06-2013 Through 21-06-2013",

}

TY - CONF

T1 - Kernelized bayesian matrix factorization

AU - Gönen, Mehmet

AU - Khan, Suleiman A.

AU - Kaski, Samuel

PY - 2013

Y1 - 2013

N2 - We extend kernelized matrix factorization with a fully Bayesian treatment and with an ability to work with multiple side information sources expressed as different kernels. Kernel functions have been introduced to matrix factorization to integrate side information about the rows and columns (e.g., objects and users in recommender systems), which is necessary for making out-of-matrix (i.e., cold start) predictions. We discuss specifically bipartite graph inference, where the output matrix is binary, but extensions to more general matrices are straightforward. We extend the state of the art in two key aspects: (i) A fully conjugate probabilistic formulation of the kernelized matrix factorization problem enables an efficient variational approximation, whereas fully Bayesian treatments are not computationally feasible in the earlier approaches, (ii) Multiple side information sources are ineluded, treated as different kernels in multiple kernel learning that additionally reveals which side information sources are informative. Our method outperforms alternatives in predicting drug-protein interactions on two data sets. We then show that our framework can also be used for solving multilabel learning problems by considering samples and labels as the two domains where matrix factorization operates on. Our algorithm obtains the lowest Hamming loss values on 10 out of 14 multilabel classification data sets compared to five state-of-the-art multilabel learning algorithms.

AB - We extend kernelized matrix factorization with a fully Bayesian treatment and with an ability to work with multiple side information sources expressed as different kernels. Kernel functions have been introduced to matrix factorization to integrate side information about the rows and columns (e.g., objects and users in recommender systems), which is necessary for making out-of-matrix (i.e., cold start) predictions. We discuss specifically bipartite graph inference, where the output matrix is binary, but extensions to more general matrices are straightforward. We extend the state of the art in two key aspects: (i) A fully conjugate probabilistic formulation of the kernelized matrix factorization problem enables an efficient variational approximation, whereas fully Bayesian treatments are not computationally feasible in the earlier approaches, (ii) Multiple side information sources are ineluded, treated as different kernels in multiple kernel learning that additionally reveals which side information sources are informative. Our method outperforms alternatives in predicting drug-protein interactions on two data sets. We then show that our framework can also be used for solving multilabel learning problems by considering samples and labels as the two domains where matrix factorization operates on. Our algorithm obtains the lowest Hamming loss values on 10 out of 14 multilabel classification data sets compared to five state-of-the-art multilabel learning algorithms.

UR - http://www.scopus.com/inward/record.url?scp=84897531872&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84897531872&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:84897531872

SP - 1901

EP - 1909

T2 - 30th International Conference on Machine Learning, ICML 2013

Y2 - 16 June 2013 through 21 June 2013

ER -

Kernelized bayesian matrix factorization

Abstract

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this