In this chapter we'll create and use a Jupyter notebook using Kubeflow on EKS to prepare a data set for training.