Redian新闻
>
Interview questions about hash function
avatar
Interview questions about hash function# Programming - 葵花宝典
l*e
1
Phone screen question:
You have a billion urls , where each has a huge page, how to detect the
duplicate documents?
I said hashing the document contents, so the interviewer asker do I know
which hash function should I used? I have no clue about what specific
function can hash a large file into a small key that takes relatively less
space.
Anybody give can give me some hint?
avatar
D*a
2
you can use any hash functions, e. g. sum all characters mod 2^32-1

less

【在 l******e 的大作中提到】
: Phone screen question:
: You have a billion urls , where each has a huge page, how to detect the
: duplicate documents?
: I said hashing the document contents, so the interviewer asker do I know
: which hash function should I used? I have no clue about what specific
: function can hash a large file into a small key that takes relatively less
: space.
: Anybody give can give me some hint?

相关阅读
logo
联系我们隐私协议©2024 redian.news
Redian新闻
Redian.news刊载任何文章,不代表同意其说法或描述,仅为提供更多信息,也不构成任何建议。文章信息的合法性及真实性由其作者负责,与Redian.news及其运营公司无关。欢迎投稿,如发现稿件侵权,或作者不愿在本网发表文章,请版权拥有者通知本网处理。