Modeling Queries as Bags of Documents

Daniel Tunkelang
1 min readDec 2, 2024

--

Last week, I had the honor of presenting “Modeling Queries as Bags of Documents” at Search Solutions 2024 with Aritra Mandal.

We introduced the bag-of-documents model as a way to align query and document representations — specifically addressing the gap between the broad variability of query intents and the inherent specificity of individual documents or products. We described how to compute bag-of-documents representations of frequent queries by aggregating document vectors from their clicks and then using those query vectors as training data to build a sentence transformer model for infrequent queries. We then showed how the bag-of-documents model is useful to recognize query similarity and compute query specificity, both of which are helpful for improving quality, experience, and analytics for search applications.

Here are the slides. Enjoy!

Sign up to discover human stories that deepen your understanding of the world.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

Daniel Tunkelang
Daniel Tunkelang

Responses (3)

Write a response