NACluster: A Non-Supervised Clustering Algorithm for Matching Multi Catalogues

Vinicius P. Freire 1 José A. F. de Macêdo 1 Fábio Porto 2, * Reza Akbarinia 3
* Corresponding author
3 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Astronomy surveys use powerful instruments to browse the sky and identify objects of interest within the surveyed region. Sky objects are individually characterized with spatial coordinates, identifying their position in the sky, in addition to other descriptive attributes. Composing an integrated view of the sky based on catalogues produced by different surveys faces a hard problem of matching objects that have been captured in various catalogues. Due to variations on capturing instruments calibration, the sky position of a single sky object may vary from a catalog to the other. Moreover, in particular dense regions of the sky this problem is exacerbated by a huge number of candidate matches for each given object. Traditional approaches for dealing with this problem use a threshold distance of to reduce the number of matching candidates. Additionally, they adopt a pairwise approach for matching n catalogues inferring transitivity among matches, which not always hold. In this paper, we present NACluster a non-supervised clustering algorithm for dealing with sky object matching in multiple catalogues. NACluster matching strategy extends the traditional k-means clustering algorithm by relaxing the number k of cluster (i.e. matched sky objects). We experiment NACluster with real and synthetic catalogues and show that the results present better accuracy than state of the art solutions.
Document type :
Conference papers
Complete list of metadatas

Cited literature [4 references]  Display  Hide  Download

https://hal-lirmm.ccsd.cnrs.fr/lirmm-01076107
Contributor : Reza Akbarinia <>
Submitted on : Tuesday, October 21, 2014 - 10:50:20 AM
Last modification on : Monday, August 19, 2019 - 9:54:02 AM
Long-term archiving on: Thursday, January 22, 2015 - 10:17:46 AM

File

2014-IEEE e-science.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Vinicius P. Freire, José A. F. de Macêdo, Fábio Porto, Reza Akbarinia. NACluster: A Non-Supervised Clustering Algorithm for Matching Multi Catalogues. International Conference on e-Science, Oct 2014, Guarujá, SP, Brazil. pp.83-86, ⟨10.1109/eScience.2014.61⟩. ⟨lirmm-01076107⟩

Share

Metrics

Record views

496

Files downloads

691