In the problem statement it is mentioned as below.
In order to solve this problem, two different data sets are provided:
Training dataset
Prediction dataset
Only the data dictionary is available for download. Are there data sets available or they need to created on our own?