Result Overlap in Web Search Engines

A Comparison of Source Distribution and Result Overlap in Web Search Engines

This study investigated the overlap of four search engines (Google, Bing, DuckDuckGo, and Metager), in Germany and the US.

RAT was used to collect more than 200,000 results and to compute the overlap between the search engines.

Abstract

When it comes to search engines, users generally prefer Google. Our study aims to find the differences between the results found in Google compared to other search engines. We compared the top 10 results from Google, Bing, DuckDuckGo, and Metager, using 3,537 queries generated from Google Trends from Germany and the US. Google displays more unique domains in the top results than its competitors. Wikipedia and news websites are the most popular sources overall. With some top sources dominating search results, the distribution of domains is also consistent across all search engines. The overlap between Google and Bing is always under 32%, while Metager has a higher overlap with Bing than DuckDuckGo, going up to 78%. This study shows that the use of another search engine, especially in addition to Google, provides a wider variety in sources and might lead the user to find new perspectives.

RAT functionality used

Scrapers: Google (DE), Bing (DE), DuckDuckGo (DE), Metager (DE), Google (US), Bing (US), DuckDuckGo (US), Metager (US)

Data analysis: Overlap analysis

Publications and presentations

Conference paper

Yagci, N.; Sünkler, S.; Häußler, H.; Lewandowski, D. (2022). A Comparison of Source Distribution and Result Overlap in Web Search Engines. Proceedings of the Association for Information Science and Technology, 59(1), 346-357. https://doi.org/10.1002/pra2.758

Poster

Poster at the RAT Community Meeting 2023: https://searchstudies.org/wp-content/uploads/2023/11/poster-overlap.pdf

Video

Want to learn more about RAT and the Search Studies research group? Sign up for our Newsletter!