Approximate clustering of incomplete fingerprintsShow others and affiliations
2008 (English)In: Journal of Discrete Algorithms, ISSN 1570-8667, E-ISSN 1570-8675, Vol. 6, no 1, p. 103-108Article in journal (Refereed) Published
Abstract [en]
We study the problem of clustering fingerprints with at most p missing values (CMV(p) for short) naturally arising in oligonucleotide fingerprinting, which is an efficient method for characterizing DNA clone libraries. We show that already CMV(2) is NP-hard. We also show that a greedy algorithm yields a min(1+ln n , 2+pln l) approximation for CMV(p), and can be implemented to run in O(nl2^p) time. We also introduce other variants of the problem of clustering incomplete fingerprints based on slightly different optimization criteria and show that they can be approximated in polynomial time with ratios 2^(2p−1) and 2(1-1/(2^(2p))), respectively.
Place, publisher, year, edition, pages
Elsevier, 2008. Vol. 6, no 1, p. 103-108
Keywords [en]
approximation algorithms, oligonucleotide fingerprinting, clustering, NP-hardness
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:mau:diva-2517DOI: 10.1016/j.jda.2007.01.004ISI: 000213937500013Scopus ID: 2-s2.0-39149127023Local ID: 16086OAI: oai:DiVA.org:mau-2517DiVA, id: diva2:1399280
2020-02-272020-02-272024-11-28Bibliographically approved