To rectify the problem, the testing set must have stopped at that particular dataset its referencing.
If you can reshuffle the whole dataset and rerun, if it still reface the same data, then something is wrong with that particular dataset.Delete it and check it out.
If that doesn't work check your pipeline for bottleneck.