![]() TF-IDF encoding: words are mapped to numerics generated using tf-idf metric.with a large corpus, the vocabulary would be about tens of thousands of tokens, making the one-hot vectors very sparse and inefficient. ![]() Each term would get 1 if it is present in the document 0 otherwise.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |