VLG Data Engineering
  • Blog
  • About
VLG Data Engineering

dplyr


Back to basics: Scaling train and test samples.

 Posted on October 12, 2020

Splitting and scaling a dataset seems easy. Well, it is admittedly not that hard, however it can be tricky. Today we will see how to properly split and scale a dataset, as this step if often necessary before any ML wizardry. Let us do this with a few R & Python packages/modules. [Read More]
scaling  normalize  standardize  spark  pyspark  python  r  dplyr  caret 

Vincent Le Goualher  • © 2023  •  VLG Data Engineering

Hugo v0.110.0 powered  •  Theme Beautiful Hugo adapted from Beautiful Jekyll