Proposed Applications of IV Techniques to the Web


Abstract | Introduction | IV Techniques | Applications | Proposals | References

I'd like to restrict this discussion to the use of Information Visualization techniques to display various kinds of information obtained from the WWW. There is a temptation to get bogged down in Information Retrieval techniques. Information Retrieval is a related and very relevant subject, but it is also a lot bigger than I'd like to get into. Instead, I prefer to focus on the kinds of information we can already obtain from the WWW, and consider ways we can visualize that already-existing data more efficiently. I believe there are four questions we must ask about how we intend to visualize web data.
  1. What kind of data do we want to visualize?
  2. What kind of visualization techniques do we want to use?
  3. How do we want to implement the visualization front end? That is, what platform, software, libraries, etc. do we want to use for drawing the displays?
  4. How do we want to implement the back end? That is, how do we want to collect the information that we will then visualize?

What kind of data to visualize?

I'd like to think about sorting and filtering WWW data based on the following kinds of characteristics:

What visualization techniques to use?

Well-suited for this application: Not well suited for this application:

How to implement the visualization front end?

That is, what platform, software, libraries, etc. do we want to use for drawing the displays?

How do we want to implement the back end?

That is, how do we want to collect the information that we will then visualize? The back end must be powerful enough to sort and sift data quickly. It must also have a fast connection to the internet so it can gather the information quickly.

The Suns here at school are powerful and well-connected, but not everyone has a Sun. What if the researcher is working at home? What if working over a phone line? Then in might be better to offload the data collection to a back-end machine, and on the front end machine perform only the graphics rendering.

MetaCrawler is implemented not as a web browser running on the client machine, but as a CGI script running on the (back-end) server machine. This allows it do perform its searches and collations very quickly, returning only the end results to the user. This takes the most advantage of the server's fast connection to the internet, the server's more powerful sorting features, and the client browser's familiarity (you can use Netscape), acessibility (you can use it at home), and graphics (you can use your own Mac or PC or whatever you have at home.)


Abstract | Introduction | IV Techniques | Applications | Proposals | References


This page maintained by Mark Brautigam (PDA version)
Last updated 1 March 1997