|
An Introduction to Taxonomies and Categorization
Introduction
Taxonomies are a way to organize documents or web pages into logical groupings, based on their contents. Ideally, documents discussing the same subject will be grouped together into one of the taxonomy's categories. Taxonomies are often organized into "trees" to make them easier to navigate; the subject-related categories and subcategories form the "branches" of the tree.. Near the "root" of the tree are very broad subject categories, such as "financial news", "computer technology", "history" and "travel". As a user navigates down a particular "branch" of a tree, the subject categories get more and more specific. For example, a user navigating down a "Financial News" branch might then select "Mergers and Acquisitions", and then within there "AOL / Time Warner", and then finally "Key Executives". Probably the best-known example of a taxonomy is the Yahoo Internet portal. Yahoo has logically grouped the millions of web pages they index into convenient categories and subcategories. Taxonomies are also sometimes referred to as "knowledge trees" or "topics", depending on the vendor. Once a taxonomy tree has been created, all the documents in the system are tagged as belonging to one or more specific taxonomy categories. This process is typically referred to as "categorization", "tagging" or "profiling", again depending on the vendor. Users can then browse and search within specific categories. The Value of TaxonomiesTaxonomies are becoming very important to companies as they struggle to organize their ever increasing mountains of electronic data. Early search engines had no problem searching through a few thousand documents that might have been stored in a single large repository; almost any search engine works fine if you only have 1,000 documents! As the number of indexed documents grew, search vendors tried again and again to improve their search engines, some even added Artificial Intelligence to parse user queries and locate pertinent documents. But the average user search is composed of just 1.4 words! These short one and two word queries thwarted most of those advanced algorithms. But more importantly, it has become clear that users sometimes prefer an iterative experience. They enter a one word search, and then look at the results. Based on the results, they may edit their search and try again. The early search engines were much more "one shot" oriented. Taxonomies provide a well understood structure for more modern, targeted searches. Taxonomies provide several key benefits:
When properly implemented, taxonomies speedup employee access to critical data, dramatically increasing their productivity. Ultimately, this is the main reason companies implement the technology. Creating Taxonomies and Categorizing DocumentsOnce customers understand what a taxonomy is, their next question is typically "So where do these taxonomies come from?" That's an excellent question! Different vendors have different methods for creating and maintaining taxonomy trees. Some vendors separate the creation of the trees from the process of categorizing documents, whereas other vendors combine these two processes. There are three general type of taxonomy creation, with some vendors offering tools that span more than one type:
Where Taxonomies Are UsedElectronic taxonomies have been used by library science professionals for several decades. During adoption of the World Wide Web in the mid 1990's, taxonomies were deployed as part of large Internet portals. In the late 1990's large corporations began deploying them as part of their internal corporate portals. This industry has continued to grow and now, collectively, offers a wide range technology and products at various price points. Small to midsized companies are now also able to offer their employees and customers these same benefits. In addition to public Internet and corporate portals, taxonomies are also finding their way into vertical portals, customer and partner extranet sites, and even to power very specialized knowledge worker document repositories. New Idea Engineering and TaxonomiesNew Idea Engineering has partnered with many of the industry's leading technology providers in order to help our customers retrofit existing knowledge systems, or to create completely new ones. For new systems, we can help you sort through the many vendor claims and product demos. For existing systems, we can help you do a technical audit and plan your integration. We can also help with the actual taxonomy creation process. And, of course, we can also do the actual integration for you, if you choose. Even if the search engine currently deployed on an intranet site doesn't offer native support for taxonomies, they can often still be retrofitted in, to create a seamless user experience. Please contact our Sales Team if you would like to discuss any of this in more detail.
|