Bayesian model of predicting lesbian sub-gender type: PHT - 未名空间MITBBS历史存档

国际科技财经博客移民网络热点娱乐民生时事公众号

Redian新闻

>未名空间

>LES - 同女之舞

Bayesian model of predicting lesbian sub-gender type: PHT

Bayesian model of predicting lesbian sub-gender type: PHT# LES - 同女之舞

s*n2011-07-30 07:07

1 楼

What proposed here is a Bayesian model of predicting whether a particular
person is P, H or T.
Let Type denote a discreet variable of predictions P, H or T.
Let s denote an observed variable of appearance or attribute.
P(Type) is the probability of T, P, H based on common sense and statistics
without knowing anything about the particular person.
P(s,Type) is the probability of seeing characteristics s (long hair, shorts
, red Jeep, worrying about marrying a man, etc) given the Type.
Bayesian inference tells us:
P(Type= PHT,s ) = P(Type)*P(s,Type)
We want to find out whether our hypothesis, say, an ID is T or not, we just
have to write down the prior distribution and the likelihood.
P(Type) is prior distribution of the type. In north america, we can assume
a skewed distribution that there are much more T than P+H. We can also have
a Prior over geographical distribution. For example, There are more Ts in
Southern California than New York, etc.
How do we get the prior distribution? Very easy. We just take some data from
this BBS Id and that will give us a distribution.
P(s,Type) is the likelihood. It defines, given a particular Type, such as T,
how do we describe the corresponding attribute. The attribute could
include measurable appearance and personality characteristics. For example,
if know a person is T, we can describe THE PROBABILITY of what clothes she
would wear, hairstyle she would have, whether she will carry heavy bags for
her lady friends, etc. On the other hand, if a person is T, how likely he
would prefer dressing in shorts and drive a red Jeep, etc.
Once we have both prior distribution and the likelihood, we can apply
standard machine learning algorithm to find the posterior distribution P(
Type, s).

d*w2011-07-30 07:07

2 楼

哈哈，学术妞，我可以贡献你的model，在Machine Learning的时候作为test data,包
子有么？

T*n2011-07-30 07:07

3 楼

汗死了。上个月刚写完了Bernoulli Naive Bayesian 和 Multinomial Naive
Bayesian。不过我很怀疑Bayesian Model在categorical 数据上的表现。lz为啥不考虑
下其他的模型呢。正如版上某id提示过，Bayesian大概用来做文本分类不错，区分红楼
梦前面后面是不是一个作者那种。（提示了我，也许ml都不需要，lz的问题t-test就足
够了。。。。）
如果一定要用Bayesian，还必须要选取比较好的feature, 因为该模型对feature的
噪声很敏感。。。

d*w2011-07-30 07:07

4 楼

又来一个学术妞。该贴上升为学术探讨了。。。。你们激发了dwaw要回学校读PHT的梦
想。座右铭：不为中华崛起而读书，为泡学术妞而读书！

【在 T*********n 的大作中提到】

: 汗死了。上个月刚写完了Bernoulli Naive Bayesian 和 Multinomial Naive
: Bayesian。不过我很怀疑Bayesian Model在categorical 数据上的表现。lz为啥不考虑
: 下其他的模型呢。正如版上某id提示过，Bayesian大概用来做文本分类不错，区分红楼
: 梦前面后面是不是一个作者那种。（提示了我，也许ml都不需要，lz的问题t-test就足
: 够了。。。。）
: 如果一定要用Bayesian，还必须要选取比较好的feature, 因为该模型对feature的
: 噪声很敏感。。。

T*n2011-07-30 07:07

5 楼

请慎重考虑是否要读PHT.

【在 d**w 的大作中提到】

: 又来一个学术妞。该贴上升为学术探讨了。。。。你们激发了dwaw要回学校读PHT的梦
: 想。座右铭：不为中华崛起而读书，为泡学术妞而读书！

d*w2011-07-30 07:07

6 楼

没错，我要读的就是PHT，这是新开学位，比PHD还牛叉，毕业了直接分配进八马智囊团
！

【在 T*********n 的大作中提到】

: 请慎重考虑是否要读PHT.

i*s2011-07-30 07:07

7 楼

复习了一遍贝叶斯
那个分母你算了是1哒？

shorts

【在 s********n 的大作中提到】

: What proposed here is a Bayesian model of predicting whether a particular
: person is P, H or T.
: Let Type denote a discreet variable of predictions P, H or T.
: Let s denote an observed variable of appearance or attribute.
: P(Type) is the probability of T, P, H based on common sense and statistics
: without knowing anything about the particular person.
: P(s,Type) is the probability of seeing characteristics s (long hair, shorts
: , red Jeep, worrying about marrying a man, etc) given the Type.
: Bayesian inference tells us:
: P(Type= PHT,s ) = P(Type)*P(s,Type)