Ca veni vorba de variabile corelate: in cartulia "DISCOVERING KNOWLEDGE IN DATA An Introduction to Data Mining" a lui Daniel Larose scrie asa:
"One should take care to avoid feeding correlated variables to one’s data mining and
statistical models. At best, using correlated variables will overemphasize one data
component; at worst, using correlated variables will cause the model to become
unstable and deliver unreliable results."
Din experienta voastra practica, e adevarat?
Lucian