Put Your Vertical File on the Web Inexpensively -- Do It Yourself

By Ellen Fox

Many libraries have collections of newspaper clippings, pamphlets, magazine articles, government brochures and other odds and ends on particular subjects of local interest. If your library has few financial resources but a little extra staff time (or perhaps some community volunteers), you can make your vertical file available and searchable on the web at little cost.

The first step, of course, is determining if there are any copyright issues associated with your local collection. That should certainly be a priority before you reproduce any material. Local newspapers may be willing to grant permission to reprint articles with the appropriate attribution. Review your vertical file and see what steps you can take.

Next, you must convert the articles to HTML format. Many people think scanning is the fastest way to accomplish this, but often it takes more time to scan than to rekey the material. Scanners have a notoriously difficult time accurately scanning and converting newspaper and magazine fonts, especially since font face and size may change from one paragraph to the next. Once scanned, the electronic version must be carefully edited, since no Optical Character Resolution (OCR) software has a 100 percent accuracy rate. Then HTML tags must be added. In the end, it often makes more sense to put volunteers to work at a basic HTML editor to re-enter the text along with the HTML codes.

If you don't need every word of the article indexed, you can scan the article and save it as an image. Then you can use the information in the META tags as your indexing terms.

Next, someone on the cataloging staff (or anyone who has facility with indexing) should sit down with each article and identify keywords that would be helpful to searchers. Put the keywords in the META tags at the start of the article. These META tags will not be visible to users of the web site, but are searchable and will increase the relevancy of search retrievals. For example, a pamphlet describing local bike trails might have a keyword tag like this: <META name = "keywords" content = "bike trails, bicycle trails, commuting by bike, recreation">. For a fuller overview of META tags, visit the web site WDVL.com/Authoring /HTML/Head/Meta/.

After the files are loaded on your web server, add a search engine to your site -- preferably one that will give special relevance or weight to META tags. (See "New BCR Search Engine Provides Users with More Functional Web Site" in the October 1998 issue of Action for Libraries). If you run a Windows NT server with the Internet Information Server as your web server, keyword searching is built in. Just index your new files and make them available. All the words in the files will be searchable, and you've provided assistance to your patrons by adding indexing terms that will help them locate what they need. As you add new files to your site, you will want to run the indexing software frequently to make sure searchers are getting the most up-to-date information.

This solution to putting your vertical file material on the web isn't quite as clean and elegant as some of the proprietary systems allow, but it is doable by anyone who has the time and energy to undertake the project.