|
Informing Science + IT Education Conference 2003 Proceedings

| Informing Science + IT Education Conference |
| Pori, Finland June 24-27, 2003 |
| Using Text Analysis to Inform Clients of the Subject of a Document |
| Hebrew University of Jerusalem, Israel |
| Contemporary information databases contain many millions of electronic documents. Locating informa-tion on |
| the Internet today is problematic, due to the enormous number of documents it contains. Several other studies have |
| found that associating documents with a subject or list of topics can improve lo-catability of information on the |
| Internet (Drori, 2000a 2000b 2000c). Effective cataloguing of informa-tion is performed manually, requiring |
| extensive resources. Consequently, most information is currently not catalogued. This paper aims to present a |
| software tool that automatically locates the subject of a document and to show the results of a test performed, |
| using the software tool, TextAnalysis, specially developed for this purpose. The main purpose of this study is to |
| inform clients of the subject of the cor-pus of texts it obtains from search engines as a search results list. |
| Keywords: Frequently occurring words, Web documents classification, Search results list, Identify top-ics of |
ISSN
1535-07-03
Unless otherwise indicated, this paper has undergone blind external review by three or more reviewers.
Types of Papers: A Best Paper, Regular Paper, Short Paper, Informal Paper,
Unrefereed Panel Paper
|