ATTENTION: THIS WEB SITE HAS MOVED.
The pages you are looking at are no longer being maintained.
Please go to http://www.poly.edu/cis/
to visit the new site of the Department of Computer and Information Science
at Polytechnic University.
CS912 - St: Web Protocols / Information Retrieval on the World Wide Web
Prof. Torsten Suel
Fall 2004, Wed. 8:20- 10:35PM
This course covers a variety of topics related to web protocols and information retrieval on the web, including HTTP protocol, proxies and caching, search engine architecture, crawling and web exploration, indexing, classification of web pages, use of hyperlink structure for information retrieval, graph models for the web, data mining, and semi-structured data. The course emphasizes both algorithmic techniques and implementation aspects, and students are required to complete several substantial programming projects.
Textbook:
- Witten, Moffat, Bell: Managing Gigabytes, Morgan Kaufmann 1999.
- Krishnamurthy, Rexford: Web Protocols and Practice, Addison-Wesley 2001.
Course Goals:
- Provide students with basic understanding of web protocols, web performance issues and web search.
- Give an introduction to information retrieval using the example of modern search engines.
- Have students acquire basic system skills (network programming, scripting, various open source tools).
- Provide students with hands-on experience in building search engines, web proxies, and web services.
Prerequisites by Topic: Good programming skills, CS 3414 or CS 603 (algorithms).