Convex optimization for the planted k-disjoint-clique problem

成果类型:
Article
署名作者:
Ames, Brendan P. W.; Vavasis, Stephen A.
署名单位:
California Institute of Technology; University of Waterloo
刊物名称:
MATHEMATICAL PROGRAMMING
ISSN/ISSBN:
0025-5610
DOI:
10.1007/s10107-013-0733-1
发表日期:
2014
页码:
299-337
关键词:
large hidden clique NORM
摘要:
We consider the -disjoint-clique problem. The input is an undirected graph in which the nodes represent data items, and edges indicate a similarity between the corresponding items. The problem is to find within the graph disjoint cliques that cover the maximum number of nodes of . This problem may be understood as a general way to pose the classical 'clustering' problem. In clustering, one is given data items and a distance function, and one wishes to partition the data into disjoint clusters of data items, such that the items in each cluster are close to each other. Our formulation additionally allows 'noise' nodes to be present in the input data that are not part of any of the cliques. The -disjoint-clique problem is NP-hard, but we show that a convex relaxation can solve it in polynomial time for input instances constructed in a certain way. The input instances for which our algorithm finds the optimal solution consist of disjoint large cliques (called 'planted cliques') that are then obscured by noise edges inserted either at random or by an adversary, as well as additional nodes not belonging to any of the planted cliques.
来源URL: