如何在text file里找frequently occurring patterns?# JobHunting - 待字闺中
s*d
1 楼
How will you find n most frequently occurring patterns in a text file. What
data structures would you use?
Here, a pattern is not a single word but rather a sequence of words. For
instance, "this is a" could be a frequently occurring pattern in the file.
Followup questions:
- What if the file is very large (in GBs)?
- What if the file contains text in multiple languages (english, japanese
etc)?
data structures would you use?
Here, a pattern is not a single word but rather a sequence of words. For
instance, "this is a" could be a frequently occurring pattern in the file.
Followup questions:
- What if the file is very large (in GBs)?
- What if the file contains text in multiple languages (english, japanese
etc)?