文件名称:TF-IDF-to-Determine-Word-Relevance
-
所属分类:
- 标签属性:
- 上传时间:2015-04-06
-
文件大小:152.74kb
-
已下载:0次
-
提 供 者:
-
相关连接:无下载说明:别用迅雷下载,失败请重下,重下不扣分!
介绍说明--下载内容来自于网络,使用问题请自行百度
Using TF-IDF to Determine Word Relevance in Document Queries :
In this paper, we examine the results of applying
Term Frequency Inverse Document Frequency
(TF-IDF) to determine what words in a corpus of
documents might be more favorable to use in a
query. As the term implies, TF-IDF calculates
values for each word in a document through an
inverse proportion of the frequency of the word
in a particular document to the percentage of
documents the word appears in. Words with
high TF-IDF numbers imply a strong
relationship with the document they appear in,
suggesting that if that word were to appear in a
query, the document could be of interest to the
user. We provide evidence that this simple
algorithm efficiently categorizes relevant words
that can enhance query retri
In this paper, we examine the results of applying
Term Frequency Inverse Document Frequency
(TF-IDF) to determine what words in a corpus of
documents might be more favorable to use in a
query. As the term implies, TF-IDF calculates
values for each word in a document through an
inverse proportion of the frequency of the word
in a particular document to the percentage of
documents the word appears in. Words with
high TF-IDF numbers imply a strong
relationship with the document they appear in,
suggesting that if that word were to appear in a
query, the document could be of interest to the
user. We provide evidence that this simple
algorithm efficiently categorizes relevant words
that can enhance query retri
(系统自动生成,下载前可以参看下载内容)
下载文件列表
TF-IDF to Determine Word Relevance.pdf
1999-2046 搜珍网 All Rights Reserved.
本站作为网络服务提供者,仅为网络服务对象提供信息存储空间,仅对用户上载内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。
