avatar
DS in my understanding# DataSciences - 数据科学
w*2
1
Not sure if this is right (I did similar thing for some biomedical projects):
You are facing a chunk of data without any previous knowledge from
literature or any other means, then ask yourself:
1. Is there any interesting question in this data set?
2. If so, how many groups/types can be formed?
3. If so, is there any difference among these groups?
4. If so, what is the difference?
5. If so, what are the differentiators?
6. If so, identify all of the major differentiators?
7. If so, build a model or index combining all of the differentiators and
predict the training data set. Do not over fit. Validate with another
independent data set. Apply the model to new data sets for prediction.
8. Reiterate to optimize the model.
9. Interpret the results according to different backgrounds/professions.
Is this logic/workflow right for DS? Thanks.
avatar
w*2
2
no response?
so ds is about data collection and cleaning?
not yet for later steps?
avatar
j*n
3
这些都是浮云,没啥好吹得,找个本科生就能做,大部分都是call个啥 function就完
了。code code code, can u scale the algo to PB? Can u decrease the latency
to ms? 这些才是有意义得工作
相关阅读
logo
联系我们隐私协议©2024 redian.news
Redian新闻
Redian.news刊载任何文章,不代表同意其说法或描述,仅为提供更多信息,也不构成任何建议。文章信息的合法性及真实性由其作者负责,与Redian.news及其运营公司无关。欢迎投稿,如发现稿件侵权,或作者不愿在本网发表文章,请版权拥有者通知本网处理。