Dealing with weighting scheme in composite indicators: An unsupervised distance-machine learning proposal for quantitative data Jiménez Fernández, Eduardo Sánchez Domínguez, María Ángeles Composite indicator P2 distance Unsupervised machine learning Benchmarking Weighting scheme MARS PACS C02 C15 C44 C43 There is increasing interest in the construction of composite indicators to benchmark units. However, the mathematical approach on which the most commonly used techniques are based does not allow benchmarking in a reliable way. Additionally, the determination of the weighting scheme in the composite indicators remains one of the most troubling issues. Using the vector space formed by all the observations, we propose a new method for building composite indicators: a distance or metric that considers the concept of proximity among units. This approach enables comparisons between the units being studied, which are always quantitative. To this end, we take the P2 Distance method of Pena Trapero as a starting point and improve its limitations. The proposed methodology eliminates the linear dependence on the model and seeks functional relationships that enable constructing the most efficient model. This approach reduces researcher subjectivity by assigning the weighting scheme with unsupervised machine learning techniques. Monte Carlo simulations confirm that the proposed methodology is robust. 2022-10-18T10:43:29Z 2022-10-18T10:43:29Z 2022-05-21 journal article Eduardo Jiménez-Fernández, Angeles Sánchez, Mario Ortega-Pérez, Dealing with weighting scheme in composite indicators: An unsupervised distance-machine learning proposal for quantitative data, Socio-Economic Planning Sciences, Volume 83, 2022, 101339, ISSN 0038-0121, [https://doi.org/10.1016/j.seps.2022.101339] https://hdl.handle.net/10481/77377 10.1016/j.seps.2022.101339 eng http://creativecommons.org/licenses/by-nc-nd/4.0/ open access Attribution-NonCommercial-NoDerivatives 4.0 Internacional Elsevier