其实以前鹊桥也常吵架 - 未名空间MITBBS历史存档

国际科技财经博客移民网络热点娱乐民生时事公众号

Redian新闻

>未名空间

>Piebridge - 鹊桥

其实以前鹊桥也常吵架

其实以前鹊桥也常吵架# Piebridge - 鹊桥

s*c2012-06-10 07:06

1 楼

求助万能的学术版
从500个样品中确定了mean = m standard deviation=s
问，随机取9个样品取平均，平均值大于m+s的几率是多少。

I*D2012-06-10 07:06

2 楼

但那种骂大街似的好像还真从没发生过。。。。

H*72012-06-10 07:06

3 楼

0.5

b*z2012-06-10 07:06

4 楼

现在是女女炒，以前是女男炒。

s*c2012-06-10 07:06

5 楼

对呀。可有本书的答案告我说要用students test。然后给了1% 做答案
我凌乱了

【在 H******7 的大作中提到】

: 0.5

I*D2012-06-10 07:06

6 楼

看来吵架上男人还是差个档次，难怪男人喜欢用拳头解决问题。。。

【在 b****z 的大作中提到】

: 现在是女女炒，以前是女男炒。

M*P2012-06-10 07:06

7 楼

书上写的未必都对吧？
什么书？

【在 s*****c 的大作中提到】

: 对呀。可有本书的答案告我说要用students test。然后给了1% 做答案
: 我凌乱了

s*02012-06-10 07:06

8 楼

这次可以真正见识到那些美女的淑女风范，
什么b呀，操呀，都成了日常词汇。
什么虚伪了，小人呀，都成了评价一个从未谋面的真凭实据。

l*o2012-06-10 07:06

9 楼

怎么可能

【在 H******7 的大作中提到】

: 0.5

I*D2012-06-10 07:06

10 楼

lol。。。

【在 s********0 的大作中提到】

: 这次可以真正见识到那些美女的淑女风范，
: 什么b呀，操呀，都成了日常词汇。
: 什么虚伪了，小人呀，都成了评价一个从未谋面的真凭实据。

y*c2012-06-10 07:06

11 楼

就是3个sigma之外的几率吧。

【在 s*****c 的大作中提到】

: 求助万能的学术版
: 从500个样品中确定了mean = m standard deviation=s
: 问，随机取9个样品取平均，平均值大于m+s的几率是多少。

b*z2012-06-10 07:06

12 楼

基本上这个版的大部分是剩女
剩女年级大，又找不到男人，又在美国压力大，国内的人又催，所以易怒是正常的。
这里的男人嘛。。很多都来看热闹的，或者来征个女人玩。男人毕竟看的开；毕竟结
婚也不是什么必需品。

【在 I*D 的大作中提到】

: 看来吵架上男人还是差个档次，难怪男人喜欢用拳头解决问题。。。

G*Y2012-06-10 07:06

13 楼

对个P呀

【在 s*****c 的大作中提到】

: 对呀。可有本书的答案告我说要用students test。然后给了1% 做答案
: 我凌乱了

C*k2012-06-10 07:06

14 楼

吵架也好，骂大街也好，事情都已经过去了，没有人在纠结了。
你发这种不痛不痒的帖子到底想说什么？

【在 I*D 的大作中提到】

: 但那种骂大街似的好像还真从没发生过。。。。

g*a2012-06-10 07:06

15 楼

你得先说明population是什么distribution。n（0，1）之类的。
[在 springc (阿泉) 的大作中提到：]
：求助万能的学术版
：从500个样品中确定了mean = m standard deviation=s
：...........

h*m2012-06-10 07:06

16 楼

你在鹊版骂大街用的脏字还少吗？

【在 I*D 的大作中提到】

: 但那种骂大街似的好像还真从没发生过。。。。

s*c2012-06-10 07:06

17 楼

So what's your answer?

【在 G**Y 的大作中提到】

: 对个P呀

D*G2012-06-10 07:06

18 楼

关键是听她们B来B去的~众WSN谁也没见着真东西呀！
太TM虐待大家了~！

【在 s********0 的大作中提到】

: 这次可以真正见识到那些美女的淑女风范，
: 什么b呀，操呀，都成了日常词汇。
: 什么虚伪了，小人呀，都成了评价一个从未谋面的真凭实据。

s*c2012-06-10 07:06

19 楼

You can assume it is normal distribution.
Actually, based on central limit theorem, 9 samples average will be very
close to normal distribution regardless.

【在 g**a 的大作中提到】

: 你得先说明population是什么distribution。n（0，1）之类的。
: [在 springc (阿泉) 的大作中提到：]
: ：求助万能的学术版
: ：从500个样品中确定了mean = m standard deviation=s
: ：...........

l*g2012-06-10 07:06

20 楼

嘻嘻　谁骂大街了？　把你吵醒了？

【在 I*D 的大作中提到】

: 但那种骂大街似的好像还真从没发生过。。。。

g*a2012-06-10 07:06

21 楼

谁说的？9不够。尤其是你这种要看tail的情况，9太小了。

【在 s*****c 的大作中提到】

: You can assume it is normal distribution.
: Actually, based on central limit theorem, 9 samples average will be very
: close to normal distribution regardless.

C*k2012-06-10 07:06

22 楼

不管你是有意还是无意，你很有添油加醋，煽风点火的潜质。
可能你不知道我在说什么，但是很有id 能看懂。

【在 l**********g 的大作中提到】

: 嘻嘻　谁骂大街了？　把你吵醒了？

s*c2012-06-10 07:06

23 楼

The original question says to assume the data from 500 individual are normal
distribution.

【在 g**a 的大作中提到】

: 谁说的？9不够。尤其是你这种要看tail的情况，9太小了。

l*g2012-06-10 07:06

24 楼

好吧
不要这么认真吧，我开玩笑的呢　大周末的不要太认真啊，亲～

【在 C********k 的大作中提到】

: 不管你是有意还是无意，你很有添油加醋，煽风点火的潜质。
: 可能你不知道我在说什么，但是很有id 能看懂。

h*e2012-06-10 07:06

25 楼

赌场扔骰子连续九次都输的也不是少数。

【在 s*****c 的大作中提到】

: You can assume it is normal distribution.
: Actually, based on central limit theorem, 9 samples average will be very
: close to normal distribution regardless.

S*82012-06-10 07:06

26 楼

围观吵架。。。

s*c2012-06-10 07:06

27 楼

Well, that has nothing to do with what I said.
I am also not sure about your definition of 少数. I can only assert that 连
续九次都输 will occur less frequently than otherwise.

【在 h******e 的大作中提到】

:
: 赌场扔骰子连续九次都输的也不是少数。

h*e2012-06-10 07:06

28 楼

找500个赌徒做个问卷，连续输9+把的目测一半以上。

【在 s*****c 的大作中提到】

: Well, that has nothing to do with what I said.
: I am also not sure about your definition of 少数. I can only assert that 连
: 续九次都输 will occur less frequently than otherwise.

g*a2012-06-10 07:06

29 楼

你这不回到我原来说的了吗？对population distribution需要一个假设。你原题里面
没有写population distribution是什么。如果你限制了population只有500，那完全可
以把mean distribution simulate出来。choose（500，9）也就是5e18个点，均匀取1/
10e10就能看出来了。

normal

【在 s*****c 的大作中提到】

: The original question says to assume the data from 500 individual are normal
: distribution.

s*c2012-06-10 07:06

30 楼

Good idea. If I have 500000 samples, I might apply some funding to get a
good computer to run this simulation.

1/

【在 g**a 的大作中提到】

: 你这不回到我原来说的了吗？对population distribution需要一个假设。你原题里面
: 没有写population distribution是什么。如果你限制了population只有500，那完全可
: 以把mean distribution simulate出来。choose（500，9）也就是5e18个点，均匀取1/
: 10e10就能看出来了。
:
: normal

g*a2012-06-10 07:06

31 楼

哇靠，你搞清楚sample和population的区别没有？前面的问题是population是500，
sample是9，
你现在就变成了sample是500000？那population是多少？你有500000 sample,干嘛还要
做simulation?直接CLT啊。mean=mu，var=delta^2/n。
你不要告诉我，你连population和sample这个基本统计概念都没搞清楚就在这瞎嚷嚷！

【在 s*****c 的大作中提到】

: Good idea. If I have 500000 samples, I might apply some funding to get a
: good computer to run this simulation.
:
: 1/

s*c2012-06-10 07:06

32 楼

No, my population is really the population on earth. The assumption given
by the question is that the quantity I measure (say height) follows normal
distribution. Now I measured 500 individuals (500 samples). From that we
computed the mean (m) and standard deviation (s) from the 500 samples. The
question is the probability of the average of the 9 randomly sampled
individual to be greater than m+s.
The solution you provided is basically monte carlo method. To get accurate
answer, I have to run many iterations to reduce the variance. Actually what
you described will contain bias from the 500 sample itself. What you
really need to do is to get another 500 samples and simulate. I am just
pointing out that it is absurd to resort to this brute-force approach if we
can analytically solve the problem.

【在 g**a 的大作中提到】

: 哇靠，你搞清楚sample和population的区别没有？前面的问题是population是500，
: sample是9，
: 你现在就变成了sample是500000？那population是多少？你有500000 sample,干嘛还要
: 做simulation?直接CLT啊。mean=mu，var=delta^2/n。
: 你不要告诉我，你连population和sample这个基本统计概念都没搞清楚就在这瞎嚷嚷！

g*a2012-06-10 07:06

33 楼

9 samples out of 500 or 9 samples out of population?
If it is the second one, assuming your 500 samples are iid and random, you
should be able to use m and s to estimate sample mean and variance. The dist
for 9 samples mean should also follow normal dist with mean and variance
calculated from estimated population mean and variance.

The
accurate
what

【在 s*****c 的大作中提到】

: No, my population is really the population on earth. The assumption given
: by the question is that the quantity I measure (say height) follows normal
: distribution. Now I measured 500 individuals (500 samples). From that we
: computed the mean (m) and standard deviation (s) from the 500 samples. The
: question is the probability of the average of the 9 randomly sampled
: individual to be greater than m+s.
: The solution you provided is basically monte carlo method. To get accurate
: answer, I have to run many iterations to reduce the variance. Actually what
: you described will contain bias from the 500 sample itself. What you
: really need to do is to get another 500 samples and simulate. I am just

n*r2012-06-10 07:06

34 楼

是问sample mean > u + s的概率，不是大于u的，所以显然不是0.5

【在 s*****c 的大作中提到】

: 对呀。可有本书的答案告我说要用students test。然后给了1% 做答案
: 我凌乱了

s*c2012-06-10 07:06

35 楼

0.5%
basically 9 sample mean will have standard deviation of s/3.
So 3 sigma gives you 99%. they only ask you the part that is greater. So 0
.5%.
I don't think we need to use t-distribution because the m and s from 500
samples (of a normal distribution) should be good enough estimation of the u
and sigma of the underlying population. I will need to use t-distribution
if I get m and s from the 9 samples.

【在 n******r 的大作中提到】

: 是问sample mean > u + s的概率，不是大于u的，所以显然不是0.5

s*c2012-06-10 07:06

36 楼

That part I am fuzzing about. I don't have that book with me. What's the
difference? If it is randomly drawn from the 500 samples (I guess we can
assume it is our finite population).

dist

【在 g**a 的大作中提到】

: 9 samples out of 500 or 9 samples out of population?
: If it is the second one, assuming your 500 samples are iid and random, you
: should be able to use m and s to estimate sample mean and variance. The dist
: for 9 samples mean should also follow normal dist with mean and variance
: calculated from estimated population mean and variance.
:
: The
: accurate
: what

n*r2012-06-10 07:06

37 楼

这题用t-test是对的
在实际中假定知道population distribution的情况很少，除了题目明确给定的理想情况
如果population u 和 s 未知，sample size较小（30以下），应该用t-test而不是z-
test
你找本统计推断的书或者google一下t-test和z-test的选择

0
u
distribution

【在 s*****c 的大作中提到】

: 0.5%
: basically 9 sample mean will have standard deviation of s/3.
: So 3 sigma gives you 99%. they only ask you the part that is greater. So 0
: .5%.
: I don't think we need to use t-distribution because the m and s from 500
: samples (of a normal distribution) should be good enough estimation of the u
: and sigma of the underlying population. I will need to use t-distribution
: if I get m and s from the 9 samples.

s*c2012-06-10 07:06

38 楼

I know.
But the m and s are estimated from 500 samples!
My understanding is that if we obtain m and s from <30 samples, then we have
to use t-test.

情况

【在 n******r 的大作中提到】

: 这题用t-test是对的
: 在实际中假定知道population distribution的情况很少，除了题目明确给定的理想情况
: 如果population u 和 s 未知，sample size较小（30以下），应该用t-test而不是z-
: test
: 你找本统计推断的书或者google一下t-test和z-test的选择
:
: 0
: u
: distribution

n*r2012-06-10 07:06

39 楼

你的理解不对
不管你的estimate是从500还是50000个sample里来的，都不能直接当成population
estimate
只有假定population分布已知的情况才能用z-test
你去找本书看看就明白了，好过在这里干争

have

【在 s*****c 的大作中提到】

: I know.
: But the m and s are estimated from 500 samples!
: My understanding is that if we obtain m and s from <30 samples, then we have
: to use t-test.
:
: 情况

s*c2012-06-10 07:06

40 楼

but my sample size is 500.
Or do you think my population is the 500 cases, and my sample size is 9? If
so, my population's u and s is known.

【在 n******r 的大作中提到】

: 你的理解不对
: 不管你的estimate是从500还是50000个sample里来的，都不能直接当成population
: estimate
: 只有假定population分布已知的情况才能用z-test
: 你去找本书看看就明白了，好过在这里干争
:
: have

s*c2012-06-10 07:06

41 楼

I read those books. It just I interpret it differently from you. I really
like to know where my mistakes are.
Seems you believe 9 is the sample size and you have to look up the t-test
table with degree of freedom of 8. And it doesn't matter whether my
estimate is from how many samples. I really can't agree.
If I got my estimate from 5 billion samples, wouldn't the mean and standard
deviation really close to population? And you still tell me I can't use z-
test but to use t-test and will get the same answer?

【在 n******r 的大作中提到】

s*c2012-06-10 07:06

42 楼

so 500 measurements is my population. The randomly selected 9 cases are
from the population.
Given that we know the sigma of the population, we should use z test and the
answer is the 3 sigma.
I made a mistake earlier regarding the probability of 3 sigma. Should be 99
.97%. So the answer should be 0.015%. Is that right?

dist

【在 g**a 的大作中提到】