用10-fold cross-validation 之后怎么挑Model?# DataSciences - 数据科学
t*i
1 楼
一个不大的数据,十几万个record, 一百个变量,用random forest作 binary
classification
因为有over-fitting, 决定用 10-fold cross-validation
做完之后,有十个 random forest Models
下一步 怎么做?
之后 是挑 validation error (on its set-aside 10th hold-out set) 最小的那个
Model吗?(需要一个final model 放进 production system)
Thanks!
classification
因为有over-fitting, 决定用 10-fold cross-validation
做完之后,有十个 random forest Models
下一步 怎么做?
之后 是挑 validation error (on its set-aside 10th hold-out set) 最小的那个
Model吗?(需要一个final model 放进 production system)
Thanks!