Notes
Outline
Designing Information Architecture
for Search
Marti Hearst
University of California, Berkeley
www.sims.berkeley.edu/~hearst
NSF CAREER Grant, NSF9984741
Outline
Motivation
Search Interfaces:
Web search vs Site Search
Search UIs: What works; what doesn’t
Methodology
Information Architecture Defined
Faceted Metadata
Integrating Search into IA via Faceted Metadata
Results of Usability Studies
Tools
Conclusions
Contributors to the Research
Dr. Rashmi Sinha
Graduate Students
Ame Elliott
Jennifer English
Kirsten Swearington
Ping Yee
Research funded by
NSF CAREER Grant, NSF9984741
Motivation and Background
Claims
Web Search is OK
Gets people to the right starting points
Web SITE search is NOT ok
The best way to improve site search is
NOT to make new fancy algorithms
Instead …
Slide 6
Recent Study by Vividence Research
Spring 2001, 69 web sites
70% eCommerce
31% Service
21% Content
  2% Community
The most common problems:
53% had poorly organized search results
32% had poor information architecture
32% had slow performance
27% had cluttered home pages
25% had confusing labels
15% invasive registration
13% inconsistent navigation
Vividence findings: effects on users
Poorly organized search results
Frustration and wasted time
Poor information architecture
Confusion
Dead ends
"back and forthing"
Forced to search
Vividence findings: effects on users
Cluttered home pages
Creates disinterest
Wastes time
No contrast: everything has equal weight
Don’t know where to start
Failure to engage
No call to action
Failure to establish navigation
Layout reflects company organization chart
Investor centeredness
Vividence findings: characteristics
Inconsistent Navigation
Primary navigation bar is, in fact, really secondary
Un-scalable designs
Poor transitions between company divisions
"Junk Drawer" navigation bars
Random links
Shoe-horned functions
Heavy need to hit the "back-button"
Vividence Study
Breakdown of most common search problems
41% - of searches encountered no problems
20% - had search problems not named below
14% - of searches were not “advanced” enough
12% - did not organize results well
10% - of searches yielded inaccurate/unrelated results
 9% - were too slow
 8% - of searches had insufficient instructions
 7% - engine was too difficult to locate
 7% - of searches produced too few results
 7% - of searches were too limiting
 3% - of searches produced an error message
 3% - were too difficult to use
Other Relevant Studies
Commercial studies (are not usually scientific, do not supply full details)
CreativeGood.com Holiday 2000 ecommerce report
UIE, and Jared Spool’s talks: http://world.std.com/~uieweb
Scientific studies (often less relevant to real web situations)
Many papers from the CHI proceedings http://www.acm.org/dl/
Papers from Human Factors and the Web http://www.optavia.com/hfweb/
See the extensive bibliography from my textbook chapter (in this package).
The Philosophy
Information architecture should be designed to integrate search throughout
Search results should reflect the information architecture.
This supports an interplay between navigation and search
This supports the most common human search strategies.
The Approach
Assign faceted metadata to content items
Allow users to navigate through the faceted metadata in a flexible manner
Organize search results according to the faceted metadata so navigation looks similar throughout
Give previews of next choices
Allow access to previous choices
Advantages of the Approach
Supports different task types
Highly constrained known-item searches use one interface
Open-ended, browsing tasks use another interface
Both types of interface use the same underlying structure
Can easily switch from one interface type to the other midstream
Advantages of the Approach
Honors many of the most important usability design goals
User control
Provides context for results
Reduces short term memory load
Allows easy reversal of actions
Provides consistent view
Advantages of the Approach
Allows different people to add content without breaking things
Can make use of standard technology
Web Search vs. Site Search
Web Search is Working!
Survey finds high user satisfaction
Study by npd group
http://www.searchenginewatch.com/reports/npd.html
Why is Web Search Working?
Web Search is Successful at Finding Good Starting Points (home pages)
Evidence:
Search engines using
Link analysis
Page popularity
Interwoven categories
These all find dominant home pages
Slide 21
Slide 22
Organizing Search Results:
What works, What Doesn’t
There is a lot of prior work on this
Cha-Cha (Chen et al. 1999)
Scatter-Gather clustering (Cutting et al. 93, Hearst et al. 1996)
Becoming more prevalent in web search too.
Teoma
Vivisimo
Northern Light
Putting Results into Clusters
Drilldown – what does it mean?
Vivisimo – same idea
Slide 27
Yahoo lists category matches
Web Search Results Grouping
Drill down one category
Cannot mix and match categories
Not clear if it is useful or not
Can help differentiate different meanings of the same word.
But …what about site search?
If Web search engines are providing source selection …
… what happens when the user gets to the site?
Following Hyperlinks
Works great when it is clear where to go next
Frustrating when the desired directions are undetectable or unavailable
An Analogy
Analogy
Hypertext:
A fixed number of choices of where to go next;
A glance at the map tells you where you are;
But may not go where you want to go.
To get from Topeka to Santa Fe, may have to go through Frostbite Falls
Site Search:
Can go anywhere;
But may get stuck, disoriented, in a crevasse!
Goal: An All-Tertrain Vehicle
The best of both techniques
A vehicle that magically lays down track to suggest choices of where you want to go next based on what you’ve done so far and what you are trying to do
The tracks follow the lay of the land and go everywhere, but cross over the crevasses
The tracks allow you to back up easily
Organizing Search Results
What works; what doesn’t
What works, what doesn’t
There is negative evidence for
Clustering
Fancy visualizations
There is positive evidence for
Grouping into meaningful, consistent categories
Relevance feedback
Depends how you do it
Showing similar items
Kohonen Feature Maps on Text
(from Chen et al., JASIS 49(7))
Study of Kohonen Feature Maps
H. Chen, A. Houston, R. Sewell, and B. Schatz, JASIS 49(7)
Comparison: Kohonen Map and Yahoo
Task:
“Window shop” for interesting home page
Repeat with other interface
Results:
Starting with map could repeat in Yahoo (8/11)
Starting with Yahoo unable to repeat in map (2/14)
Study (cont.)
Participants liked:
Correspondence of region size to # documents
Overview (but also wanted zoom)
Ease of jumping from one topic to another
Multiple routes to topics
Use of category and subcategory labels
Study (cont.)
Participants wanted:
hierarchical organization
other ordering of concepts (alphabetical)
integration of browsing and search
corresponce of color to meaning
more meaningful labels
labels at same level of abstraction
fit more labels in the given space
combined keyword and category search
multiple category assignment (sports+entertain)
Visualization of Clusters
Huge 2D maps may be inappropriate focus for information retrieval
Can’t see what documents are about
Documents forced into one position in semantic space
Space is difficult to use for IR purposes
Hard to view titles
Perhaps more suited for pattern discovery
problem: often only one view on the space
Summary: Clustering
(Based on other studies as well)
Advantages:
Get an overview of main themes
Domain independent
Disadvantages:
Many of the ways documents could group together are not shown
Not always easy to understand what they mean
Different levels of granularity
Probably best for scientists only
Take heart – there is good evidence for organizing via categories!
The DynaCat System
Decide on important question types in an advance
What are the adverse effects of drug D?
What is the prognosis for treatment T?
Make use of MeSH categories
Retain only those types of categories known to be useful for this type of query.
DynaCat
DynaCat Study
Design
Three queries
24 cancer patients
Compared three interfaces
ranked list, clusters, categories
Results
Participants strongly preferred categories
Participants found more answers using categories
Participants took same amount of time with all three interfaces
Cha-Cha (intranet search)
Cha-Cha (intranet search)
How People Search
The Standard Model
Assumptions:
Maximizing precision and recall simultaneously
The information need remains static
The value is in the resulting document set
“Berry-Picking” as an Information Seeking Strategy (Bates 90)
Berry-picking model
Interesting information is scattered like berries among bushes
The user learns as they progress, thus
The query is continually shifting
A sketch of  a searcher… “moving through many actions towards a general goal of satisfactory completion of research related to an information need.” (after Bates 89)
Search Tactics and Strategies
Marcia J. Bates, Information Search Tactics, Journal of the American
Society for Information Science, 30, 4, 1979
Marcia J. Bates, Where should the person stop and the information
search interfaces start?, Information Processing & Management, 26, 5,
1990
Marcia J. Bates, The Berry-Picking Search: User Interface Design, User
Interface Design, Harold Thimbleby, Addison-Wesley, 1990
Marcia J. Bates, The design of browsing and berrypicking techniques
for the on-line search interface, Online Review, 1989, 13, 5,
407—431.
Vicki L. O'Day and Robin Jeffries, Orienteering in an information
landscape: how information seekers get from here to there, Proceedings of ACM INTERCHI '93, April, Amsterdam, 1993
Gary Marchionini, Information Seeking in Electronic Environments, Cambridge University Press, 1995.
Tactics vs. Strategies
Tactic: short term goals and maneuvers
operators, actions
Strategy: overall planning
link a sequence of operators together to achieve some end
An Important Strategy
Do a simple, general search
Gets results in the generally correct area
Look around in the local space of those results
If that space looks wrong, start over
Akin to Shneiderman’s overview + details
Our approach supports this strategy
Integrate navigation with search
Term Tactics
Move around a thesaurus
Look at category labels
Look at related terms
Look at parent terms
Look at child terms
In older literature, refers to navigating the thesaurus itself, as opposed to the items themselves.
Source-level Tactics
“Bibble”:
 look for a pre-defined result set
 e.g., a good link page on web
Survey:
look ahead, review available options
e.g., don’t simply use the first term or first source that comes to mind
Cut:
eliminate large proportion of search domain
e.g., search on rarest term first
Source-level Tactics (cont.)
Stretch
use source in unintended way
e.g., use patents to find addresses
Scaffold
take an indirect route to goal
e.g., when looking for references to obscure poet, look up contemporaries
Monitoring Strategies
Check
compare original goal with current state
Weigh
make a cost/benefit analysis of current or anticipated actions
Pattern
recognize common strategies
Correct Errors
Record
keep track of (incomplete) paths
Additional Considerations
(Bates 79)
Need a Sort tactic
When to stop?
How to judge when enough information has been gathered?
How to decide when to give up an unsuccesful search?
When to stop searching in one source and move to another?
Information Architecture
A Taxonomy of WebSites
 A View of Website Design
A View of Information Architecture
Content Items +
Information Structure +
Navigation Structure +
Layout
Content Items
The information items that the site is designed to show the user.
Individual content items can be considered leaves in a tree, or base-level items.
Aggregates of individual (base-level) items can be considered to be content items.
This definition is especially relevant for catalog-style sites, for example:
Image collection
Product selling
Collection of articles on some topic (medical, legal)
Collection of information about some entity (IRS, Park Service)
Information Structure
Independent of the website.
A set of descriptors which are used to characterize the content of a website.
Consists primarly of a category structure and a set of textual labels.
The categories can have flat, hierarchical, faceted or network structure.
The textual labels include alternative ways of expressing the same concepts (synonyms).
Navigation Structure
Defined in terms of the website.
Site level:
The paths connecting content items throughout the site.
Page level:
The link from one page to others.
Example from Walmart.com
Content
Related Items
Often are content items
Related to the target by some shared information structure
The particular related items that are shown are revealed through the navigation structure
The Information Structure
Consists of a set of descriptors for the content items
Can’t really see it directly, since it is independent of web site description
Can see parts of it in the navigation structure
A View of Information Architecture
A View of Information Architecture
Navigation structure links items or groups of items.
Navigation Structure Differs from Information Structure
Example:
Part of the info structure is the product hierarchy.
Some products are assigned more than one spot in the hierarchy (e.g., sports and games), thus forming a tree structure
Navigation structure shows a progressive disclosure of the hierarchical structure only.
Navigation Structure Differs from Information Structure
Example:
Main navigation structure is the product hierarchy.
However, “lateral” links are shown from product leaf nodes to other nodes
(e.g., from a tent to a flashlight and a sleeping bag)
Navigation Structure Differs from Information Structure
The differences can be much more profound
Examples:
Show only main product categories at top levels
After a search, show links according to brands of items, but only those brands that make sense for the items retrieved by the search.
“Breadcrumbs”
A navigation technique for showing either history or contextualizing hierarchy via hyperlinks.
Two main types:
Hierarchy without history:
Search results at walmart.com
History across facets (without hierarchy):
Epicurious path recording.
An Important IA Trend
Generating web pages from databases
Implications:
Web sites can adapt to user actions
Web sites can be instrumented
“An essential feature of a design environment is to give authors the possibility of evaluating the current network against the final adaptive system.”
Petrelli, Baggio, & Pezzulo, Adaptive Hypertext Design Environments: Putting Principles into Practice, AH 2000
Faceted Metadata
Metadata: data about data
Facets: orthogonal categories
Faceted Metadata: Biomedical 
MeSH (Medical Subject Headings)
www.nlm.nih.org/mesh
Mesh Facets (one level expanded)
Using Mesh Facets
Some stats:
>18,000 labels
avg depth: 4.5, max depth 9
~8 labels/article on average
How to go from the information structure to the navigation structure?
Using faceted metadata incorrectly
Yahoo uses faceted metadata poorly in both their search results and in their top-level directory
They combine region + other hierarchical facets in awkward ways
Yahoo’s use of facets
Yahoo’s use of facets
Yahoo’s use of facets
Yahoo’s use of facets
However, Yahoo does use some metadata well
Yahoo restaurant guide combines:
Region
Topic (restaurants)
Related Information
Other attributes (cuisines)
Other topics related in place and time (movies)
Slide 90
Combining Information Types
Region
State
City
A & E
Film
Theatre
Music
Restaurants
California
Eclectic
Indian
French
Other Possible Combinations
Region + A&E
City + Restaurant + Movies
City + Weather
City + Education: Schools
Restaurants + Schools
…
Bookstore preview combinations
topic + related topics
topic + publications by same author
topic + books of same type but related topic
Problems with Metadata Usage
Standard approaches
Paths are hand-edited, predefined
Not well-integrated with search
Not tailored to task as it develops
Not personalized
Not dynamic
Questions we are trying to answer
How many facets are allowable?
Should facets be mixed and matched?
How much is too much?
Should hierarchies be progressively revealed, tabbed, some combination?
How should free-text search be integrated?
Recipe Collection Examples
Slide 97
Slide 98
Slide 99
Slide 100
Slide 101
Slide 102
Epicurious Metadata Usage
Advantages
Creates combinations of metadata on the fly
Different metadata choices show the same information in different ways
Previews show how many recipes will result
Easy to back up
Supports several task types
``Help me find a summer pasta,'' (ingredient type with event type),
``How can I use an avocado in a salad?'' (ingredient type with dish type),
``How can I bake sea-bass'' (preparation type and ingredient type)
Metadata usage in Epicurious
Metadata usage in Epicurious
Metadata usage in Epicurious
Metadata usage in Epicurious
Metadata usage in Epicurious
Recipe Information Architecture
Information design
Recipes have five types of metadata categories
Cuisine, Preparation, Ingredients, Dish, Occasion
Each category has one level of subcategories
Recipe Information Architecture
Navigation design
Home page:
show top level of all categories
Other pages:
A link on an attribute ANDS that attribute to the current query;  results are shown according to a category that is not yet part of the query
A change-view link does not change the query, but does change which category’s metadata organizes the results
Metadata Usage in Epicurious
Can choose category types in any order
But categories never more than one level deep
And can never use more than one instance of a category
Even though items may be assigned more than one of each category type
Items (recipes) are dead-ends
Don’t link to “more like this”
Not fully integrated with search
Epicurious Basic Search
Lacks integration with metadata
Slide 113
Information previews
Use the metadata to show where to go next
More flexible than canned hyperlinks
Less complex than full search
Help users see and return to what happened previously
Reduces mental work
Recognition over recall
Suggest alternatives
The Importance of Information Previews
Jared Spool’s studies (www.uie.com)
More clicks are ok if
The “scent” of the target does not weaken
If users feel they are going towards, rather than away, from their target.
Problem with Metadata Previews as Currently Used
Hand edited, predefined
Not tailored to task as it develops
Not personalized
Often not systematically integrated with search, or within the information architecture in general
Putting it Together
Desiderata for Objects in
Information-Seeking Workspaces
Structured
Fractal
Queriable
Navigable
Historical
Similarity Engine Compatible
Contextualized
Other
Search Usability Design Goals
Strive for Consistency
Provide Shortcuts
Offer Informative Feedback
Design for Closure
Provide Simple Error Handling
Permit Easy Reversal of Actions
Support User Control
Reduce Short-term Memory Load
Analogy: Chess
Chess is characterized by a few simple rules that disguise an infinitely complex game
Another intriguing characteristic:  the three-part structure
Openings: many strategies, new ones all the time, many books on this
Endgame: well-defined, well-understood
Middlegame: nebulous, hard to describe
Our thought: search is similar and the middlegame is critically underserved.
Chess-based view of Info Architecture
The Opening:
Usually exposes top-level hierarchy or top-level facets (or both)
Usually also has a search component
This is also the place to expose the main tasks that can be accomplished on the site
The Opening
The Opening
The Opening
The Opening
The Opening
Chess-based view of Info Architecture
The Endgame:
Has become rather well-established in shopping sites
Penultimate page: shows a list of items
Leaf node:
Shows one content item in detail
Lateral links
To similar items (same facet)
To other items that go with it (other facets)
The Endgame – Penultimate Pages
The Endgame – Penultimate Pages
The Endgame – Leaf Nodes
Chess-based view of Info Architecture
The Middlegame:
Hardest to describe/understand
The “berry-picking” part of supporting search
Issues:
How to progressively expose hierarchies?
How to show multiple facet choices?
How to integrate with search results?
How to show history / retain context?
Sophisticated Middlegames
Sophisticated Middlegames
Sophisticated Middlegames
Sophisticated Middlegames
Sophisticated Middlegames
Online Grocery Shopping Examples
In each case, note
Chess analogy
What is the opening?
What is the endgame?
How is the middlegame handled?
How are search results integrated?
How is hierarchical drill-down revealed?
Are multiple facets allowed?
Grocery shopping example
Grocery shopping example
Grocery shopping example
Grocery shopping example
Grocery shopping example
Grocery shopping example
Grocery shopping example
Grocery shopping example
Summary: Grocery Shopping Examples
A good opening seems to make a big difference
Familiar metadata helps make the task easier
Middlegame hierarchy exposure
One uses cascading menus
Two use webpage-based drilldown
Two use metadata to organize search results
But don’t use metadata creatively
Could organize by recipe, etc.
Medical Text Example
Allow user to select metadata in any order
At each step, show different types of relevant metadata,
based on prior steps and personal history,
include # of documents
Previews restricted to only those metadata types that might be helpful
Ecommerce Examples
E-commerce sites are farther ahead than information collection sites
However, their problem is usually easier
Single facet often works fine
Categories are familiar to users
Collections are often much smaller
How to move this to large sites containing more abstract information?
Image collections?
Text collections
Current Search Approach
Asthma > Steroids
Asthma > Steroids > Admin & Dosage
Asthma > Steroids > Budesonide > Huang
Other paths: back up and go forward
Advantages of the Methodology
Supports different types of information seeking tasks
Uses interface idioms known to be usable for general users
Flexible content entry and update
Allows for non-experts to add new content independently
Makes use of standard DBMS technology
Advantages of the Methodology
Systematically integrates search:
search results reflect the structure of the info architecture
search results retain the context of previous interactions
search results preview next choices
Gives user control
Over order of metadata use
Over when to navigate vs. when to search
Allows integration with advanced methods
Collaborative filtering, predicting users preferences
Advantages
Users have a feeling of control
Users can predict what will happen
Not true of statistical ranking or clustering
Adding new items to the system changes the behavior in understandable ways
Users have flexibility
In ordering of operations
In combining of operations
Usability Study: epicurious
Epicurious Usability Study
9 participants so far
Independent Variables:
1) Epicurious Interface (Basic vs. Enhanced vs. Browse)
2) Task type (known-item search vs. browsing for inspiration)
3) Degree of constraint of query
4) Number of results required (1 vs. many)
Dependent Variables:
 1) Time to find satisfactory recipe(s)
 2) Navigation path (backtracking, starting over, revising queries)
 3) Satisfaction with results of search
4) Satisfaction with individual system features (e.g. breadcrumbs, query previews, refine by hyperlinks)
 5) Likelihood of using each interface in the future.
Epicurious Usability Study
Participants were asked to:
Do 3 pre-specified searches in advance
In the lab:
Specify a cooking scenario of interest to them
Search for 3 recipes for this recipe
Search for each recipe using each of the interfaces
Complete several structured tasks
Along the way, answer questions about
 Getting closer or farther away from goal
Satisfaction with search results
Satisfaction with the interace
Usability Study:
Preliminary Results, Preference Data
Usability Study:
Preliminary Results, Preference Data
Usability Study
Preliminary Results: Feature Preference
Usability Study
Preliminary Results: Quantitative
Usability Study
Preliminary Results:
Constraint-based Preferences
Observed patterns of use of epicurious metadata browse interface
choosefacet
refine
refine
back
scan focus
choosefacet
refine
back refine
scan focus
choosefacet
refine
refine
scan focus
choosefacet
refine
refine
back refine
scan focus
choosefacet
refine
refine
searchword
choosefacet
searchword
scan searchword
back refine
scan focus
choosefacet
refine
back back refine
refine
refine
choosefacet
refine
refine
searchword
scan
Usability Study Results: Summary
People liked the browsing-style metadata-based search and found it helpful
People sometimes preferred the metadata search when the task was more constrained
But zero results are frustrating
This can be alleviated with query previews
People dis-prefer the standard simple search
More study needed!
Application to Image Search
Image Search: What is the task?
Illustrate my slides?
“Find a crevasse”
Keyword match works pretty well
Find inspiration for an architectural design?
Needs richer search support
Faceted Metadata for Image Collection
Faceted Metadata for Image Collection
SPIRO Query Form (Original)
SPIRO query on Subject: church
Pilot Study
Architecture task:
Emphasize images over text
Use hypertext-style interface as a reasonable baseline for comparison
Find out how much choice is too much
Find out whether explicit metadata is better than implicit more-like-this
Evaluation Methodology
Solicit feedback from architects to determine if faceted metadata is helpful and how to present it
Informal evaluation of paper prototype
Informal study of a crude live version
1 hour one-on-one with 9 architects /grad students, 2 tasks (audio recorded) and a survey
Results of a pilot study with Archictects: Metadata is Helpful
Very positive feedback about the general approach
All 9 participants named the metadata in the search results area as their favorite aspect of Flamenco
Metadata was successful at giving hints about where to go next
Perceived as useful “These are places I can go from here.”
Results: More Metadata Please
Participants asked for more metadata
Although there were complaints about the contents of the metadata, users still wanted more
Longer lists of options (more hints)
Users wanted more control to make very specific searches
Half the participants requested the ability to control order of results with metadata
Juxtapose visible images 2 different ways:
Overview (one image from each project) vs. like together ( all images of a project next to each other)
Different than ranking for text retrieval (precision, recall), but ordering does matter
Results: Complaints
The UI was not successful at clarifying searching within results vs. starting a new search
Only 2 of the 9 participants understood the distinction without discussion – but they want to do both
The 1/3 of the participants who couldn’t find a treasure hunt image felt that Flamenco was slow
Corroborates findings that perceived system speed is about finding what you want (Spool ‘00)
New Developments
A new, sophisticated implementation
Richer, hierarchical, cleaned up metadata
Usability Study contrasting four versions:
Single search form
Multiple facet search form
Yahoo-style directory-based
Faceted interface with query previews
Results TBA
Slide 178
Slide 179
Slide 180
Tools
Our system (all open-source)
Mysql (has a text search component)
Python 2.2
Python-mysql
Webware (python application server)
Earlier attempt
Cold fusion – not flexible enough, not enough of a programming language
A new term: “Parametric Search”
From an XML glossary
"A search request submitted to a search or database engine delivered with consideration for the metadata of the underlying dataset.”
www.sla.org/chapter/ctor/courier/v37/v37n1.pdf
Commercial Tools
This list is NOT comprehensive
These are NOT recommendations
General Search
Inktomi Search/site  (formerly Infoseek ultra)
Specializing in Online Catalogs
Dieselpoint
Requisite
Saqqara
Question-answering
Askjeeves
Primus (formerly Answerlogic)
“Parametric” Search
A survey of sites using parametric search:
http://www.amp.com/search/default.asp (see product family search)
http://ebiz.zilog.com/
http://www.sears.com (Dieselpoint)
http://dieselpoint.com/flashlink.htm (for Dieselpoint 2.0 demo)
http://www.findmro.com (Requisite's BugsEye)
http://www.cypress.com (Saqqara's one step)
http://infineon-tech.sacosnet.de/search/index.htm
http://www.idt.com/tools/parametric.html
http://www.ti.com/sc/docs/psheets/parms/uarts.htm#parms
http://www.gensemi.com/search/productsearch.htm
http://www.usa.samsungsemi.com/search/
http://www.gearfinder.com
http://www.mysimon.com/category/index.jhtml?c=babydiaperingbathing
“Parametric” Search Usage
Goal is to focus on product group for comparison shopping.
Common Procedure
Begin with a list of product "families" or groups.
User selects a category, and is prompted to
 1) select a sub-category from a list of hyperlinks or
2) select search parameters using a form
If the number of results is too big, the system may prompt the user to refine the search further.
When an acceptable number of results is returned, the user sees a list of products which can be:
1) sorted by various criteria
2) selected for display in a comparison table
3) viewed individually with more detail.
“Parametric” Search as used on these Sites
Observations:
Only one facet (appropriate for products?)
No query previews
Breadcrumbs rare
Many allow sorting by attribute to facilitate comparison
“Others like this” simply moves up the hierarchy
Summary and Conclusions
Summary
Web site search needs improvement
Users want more organized results
Our approach: integrate navigation with search
Metadata is being mixed and matched in interesting ways, but there are no guidelines on what works
We are investigating how to design websites containing large sets of items
Preliminary results indicate that metadata organization is useful in some situations
Depends on the type of search need
Advantages of the Methodology
Supports different types of information seeking tasks
Uses interface idioms known to be usable for general users
Flexible content entry and update
Systematically integrates navigation & search
Gives user control
Allows integration with advanced methods
Summary
Our research goals
Systematically determine what works, with the following emphases:
Task-centric
Integrate metadata with search
Dynamic previews
Easily retrace steps
Develop recommendations that reflect both the task structure and the richness of the information structure
In future: integrate with more sophisticated displays
Some Unanswered Questions
How best show combinations of facets that consist of large hierarchies?
How to use faceted metadata to expand (as opposed to refine)?
How to integrate with relevance feedback (more like this)?
How to incorporate user preferences and past behavior?
How to combine facets to reflect tasks?
Thank you!
bailando.sims.berkeley.edu/flamenco.html