1 min readNov 10, 2018

You aren’t checking trends with the real test data, but with validation data. And to mimic real life, your validation data should be from a time period different than train data. Say, first 1 month for train, next 10 days for validation. A feature not holding trend between these two is very likely to not hold trend consistently across different comparisons including your real test data.

Abhay Pawar
Abhay Pawar

Written by Abhay Pawar

ML @ StitchFix, Instacart. Columbia and IIT Madras alum

Responses (1)