Redian新闻
>
datascientist几个基本问题
avatar
datascientist几个基本问题# DataSciences - 数据科学
y*a
1
本人master 是统计的,现在想慢慢学习一些关于data science 的东西,请大侠们给点
basic idea,谢谢
In data science related jobs, how big was the data set (how many parameters/
fields the dataset contained) you guys usually work on?
were there dirty data problems? was there sampling bias? how did you guys
solve those problems, anything need to learn particularly?
How are you used to getting your data ? What software languages have you
used for extraction?
What do you use to clean and/or analyze the data? what quantitative methods
are usually used?
avatar
d*n
2
高维度,稀疏,维度不断变化,或者根本没有简单的维度定义(例如SN数据)。
一般在几千到几个M之间,超过几个M的都是工程问题。
avatar
m*a
3
能elaborate一下,或者给个例子吗?

【在 d****n 的大作中提到】
: 高维度,稀疏,维度不断变化,或者根本没有简单的维度定义(例如SN数据)。
: 一般在几千到几个M之间,超过几个M的都是工程问题。

avatar
E*s
4
interviewing for google?

parameters/
methods

【在 y****a 的大作中提到】
: 本人master 是统计的,现在想慢慢学习一些关于data science 的东西,请大侠们给点
: basic idea,谢谢
: In data science related jobs, how big was the data set (how many parameters/
: fields the dataset contained) you guys usually work on?
: were there dirty data problems? was there sampling bias? how did you guys
: solve those problems, anything need to learn particularly?
: How are you used to getting your data ? What software languages have you
: used for extraction?
: What do you use to clean and/or analyze the data? what quantitative methods
: are usually used?

相关阅读
logo
联系我们隐私协议©2024 redian.news
Redian新闻
Redian.news刊载任何文章,不代表同意其说法或描述,仅为提供更多信息,也不构成任何建议。文章信息的合法性及真实性由其作者负责,与Redian.news及其运营公司无关。欢迎投稿,如发现稿件侵权,或作者不愿在本网发表文章,请版权拥有者通知本网处理。