I have been working with San Diego Water quality data project: https://www.sandiegodata.org/2018/04/summer-water-quality-data-proj… Here are data sets: https://data.sandiegodata.org/dataset?tags=water-project Regretfully my complete works do not fit into… Read More »San Diego Water Pollution Map by Stations
Neural networks are considered complicated and they are always explained using neurons and a brain function. But we do not need to learn how to… Read More »Neural Networks as a Corporation Chain of Command
In this post I will sometimes use a term “variable” for “feature”(“predictor”“) or”outcome“(”predicted value“”). The question of variable dependencies for a particular data is quite… Read More »Measuring Dependence of Variables with Confidence Intervals.
Choosing features to improve a performance of a particular algorithm is a difficult question. Currently here is PCA, which is difficult to understand (although it… Read More »Improving performance of random forests for a particular value of outcome by adding chosen features
There are many ways to choose features with given data, and it is always a challenge to pick up the ones with which a particular… Read More »Choosing features for random forests algorithm