The performance of machine learning models depends on several factors, including data normalization, which can significantly improve its accuracy. There are many standardization techniques, and none is universally suitable; the choice depends on the characteristics of the problem, the predictive task, and the needs of the model used. This study analyzes how normalization techniques influence the outcomes of real estate price regression models using machine learning to uncover complex relationships between urban and economic factors. Six normalization techniques are employed to assess how they affect the estimation of relationships between property value and factors like social degradation, resident population, per capita income, green spaces, building conditions, and degraded neighborhood presence. The study’s findings underscore the pivotal role of normalization in shaping the perception of variables, accentuating critical thresholds, or distorting anticipated functional relationships. The work is the first application of a methodological approach to define the best technique on the basis of two criteria: statistical reliability and empirical evidence of the functional relationships obtainable with each standardization technique. Notably, the study underscores the potential of machine-learning-based regression to circumvent the limitations of conventional models, thereby yielding more robust and interpretable results.

The Interpretative Effects of Normalization Techniques on Complex Regression Modeling: An Application to Real Estate Values Using Machine Learning / Anelli, Debora; Morano, Pierluigi; Tajani, Francesco; Guarini, Maria Rosaria. - In: INFORMATION. - ISSN 2078-2489. - 16:6(2025). [10.3390/info16060486]

The Interpretative Effects of Normalization Techniques on Complex Regression Modeling: An Application to Real Estate Values Using Machine Learning

Morano, Pierluigi;
2025

Abstract

The performance of machine learning models depends on several factors, including data normalization, which can significantly improve its accuracy. There are many standardization techniques, and none is universally suitable; the choice depends on the characteristics of the problem, the predictive task, and the needs of the model used. This study analyzes how normalization techniques influence the outcomes of real estate price regression models using machine learning to uncover complex relationships between urban and economic factors. Six normalization techniques are employed to assess how they affect the estimation of relationships between property value and factors like social degradation, resident population, per capita income, green spaces, building conditions, and degraded neighborhood presence. The study’s findings underscore the pivotal role of normalization in shaping the perception of variables, accentuating critical thresholds, or distorting anticipated functional relationships. The work is the first application of a methodological approach to define the best technique on the basis of two criteria: statistical reliability and empirical evidence of the functional relationships obtainable with each standardization technique. Notably, the study underscores the potential of machine-learning-based regression to circumvent the limitations of conventional models, thereby yielding more robust and interpretable results.
2025
The Interpretative Effects of Normalization Techniques on Complex Regression Modeling: An Application to Real Estate Values Using Machine Learning / Anelli, Debora; Morano, Pierluigi; Tajani, Francesco; Guarini, Maria Rosaria. - In: INFORMATION. - ISSN 2078-2489. - 16:6(2025). [10.3390/info16060486]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/298130
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact