Broad and Ambiguous Search Queries

A typical approach for processing search queries is to retrieve a set of matching documents and then rank them with a relevance scoring function. This simple approach generally works well for unambiguous, specific search queries.

But sometimes this approach breaks down. When a search query is broad (e.g., “shirts”), it isn’t clear how to decide which matching results are the most relevant ones. Even worse, when the query is ambiguous (e.g., “mixers”), it isn’t even clear how to determine the matching results, let alone rank them.

Recognizing When Search Results Need Diversification

Can a search engine automatically determine when a search query is broad or ambiguous intent? No approach is perfect, but here are some useful signals:

  • Number of results. Specific queries tend to have small result sets. Conversely, broad and ambiguous queries tend to have large result sets. But a large result set may simply reflect an aggressive matching strategy. A more nuanced approach is to count the results with high relevance scores. If this number is high, then the query is probably broad or ambiguous.

Broad Queries vs. Ambiguous Queries

All of these signals are ways to identify broad and ambiguous search queries. But these two classes of queries have important differences.

Broad queries are unambiguous but underspecified. For example, the broad query “shirts” includes shirts for men, women, and children; t-shirts, polo shirts, and dress shirts; shirts of all colors and materials; etc. In contrast, “mixers” is ambiguous because it could denote kitchen appliances, sound equipment, or several kinds of industrial machines. All shirts fall into the same general class, but the different kinds of mixers fall into distinct classes.

Some signals that can help distinguish broad queries from ambiguous ones:

  • Modality of distribution. The results for a broad query center around a single mode that represents the “average” result. In contrast, an ambiguous query returns a mixture of results with two or more modes. There are various statistical tests to measure the modality of a distribution.

Search User Interface Implications

All of the discussion so far has been about recognizing broad and ambiguous queries. But what should a search engine do differently if it does recognize such a query?

Disambiguate Ambiguous Queries

If a query is ambiguous, the search engine cannot reliably determine the searcher’s intent. The best way to resolve this ambiguity is through a clarification dialogue. The search engine should present the searcher with unambiguous queries that represent the most probable possible interpretations, with examples to communicate the distinct alternatives. In our “mixer” example, the suggested queries might include “kitchen mixer” and “audio mixer”.

Refine Broad Queries

If a query is broad, then the search engine should suggest refinements that guide the search towards more specific queries. These typically include category suggestions, such as refining from shirts to t-shirts, dress shirts, etc. They may also include faceted refinements that suggest useful attributes to narrow the result set.

It’s important to remember that disambiguation comes before refinement. If a query is ambiguous, the search engine’s first priority is to disambiguate it. Then, if the disambiguated query is still broad, the search engine should help the searcher refine it.

Summary

Many search queries only require the traditional approach of ranking a set of matching results. But some queries require a more complex approach, either because they are broad or ambiguous. It’s important for a search engine to detect such queries, as well as to distinguish broad queries from ambiguous ones. Fortunately, there are a variety of signals that search engines can use to do so. Doing so allows the search engine to help the searcher disambiguate or refine the query as appropriate.

High-Class Consultant.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store