Clustering and Geodesic Scaling of Dissimilarities on the Spherical Surface
Metadatos
Mostrar el registro completo del ítemEditorial
Springer Nature
Materia
Multidimensional scaling Dissimilarity Geodesic distance
Fecha
2024-01-30Referencia bibliográfica
Fernando Vera, J., Subiabre, R. & Macías, R. Clustering and Geodesic Scaling of Dissimilarities on the Spherical Surface. JABES (2024). https://doi.org/10.1007/s13253-023-00597-4
Patrocinador
Grant RTI2018-099723-B-I00, Ministry of Science, Innovation and Universities of Spain, co-financed by FEDER; Grant B-CTS-184-UGR20 funded by ERDF, EU/Ministry of Economic Transformation, Industry, Knowledge and Universities of Andalusia; Austral University of Chile; The Pacific Alliance Grant of the Carolina FoundationResumen
Spherical embedding is an important tool in several fields of data analysis, including
environmental data, spatial statistics, text mining, gene expression analysis, medical
research and, in general, areas in which the geodesic distance is a relevant factor. Many
data acquisition technologies are related to massive data acquisition, and these highdimensional
vectors are often normalised and transformed into spherical data. In this
representation of data on spherical surfaces, multidimensional scaling plays an important
role. Traditionally, the methods of clustering and representation have been combined,
since the precision of the representation tends to decrease when a large number of objects
are involved, which makes interpretation difficult. In this paper, we present a model that
partitions objects into classes while simultaneously representing the cluster centres on a
spherical surface based on geodesic distances. The model combines a partition algorithm
based on the approximation of dissimilarities to geodesic distances with a representation
procedure for geodesic distances. In this process, the dissimilarities are transformed in
order to optimise the radius of the sphere. The efficiency of the procedure described
is analysed by means of an extensive Monte Carlo experiment, and its usefulness is
illustrated for real data sets.