Description
Online search engines are an essential tool for seeking information, but results returned from these search engines can contain undesirable forms of bias with respect to protected attributes such as gender or race. These biases can exist due to the word embeddings used by search engines, the design of re-ranking algorithms, the development of retrieval algorithms, or a variety of other reasons. Classical information retrieval (IR) methods, such as query recommendation or query expansion, were designed to produce the most relevant results. However, if such biases are present in the system, then these methods will also deliver biased results.
IR systems/recommender systems also play a major role in social media algorithms, where platforms have pivoted away from friend-follow timelines to for you timelines containing algorithmically-selected content. If these algorithms are biased (towards, say, maximizing screen time to show ads, maximizing user interaction to likes, comments), then they may push end users towards clickbait or non-mainstream trending topics.
This book presents an overview of modern IR and discusses the work done to mitigate biases in IR systems. It also examines methods for debiasing word embeddings and re-ranking search results to address group fairness, and presents a query reformulation method that analyzes bias in search results and delivers balanced results to the end user.
Awareness of how information retrieval systems work, ways to mitigate bias in search results, and the tradeoffs between accuracy and bias metrics in search results will help readers understand real-world search engines.
Chapter 1 Introduction.- Chapter 2 Unfairness in Information Retrieval.- Chapter 3 Measuring Unfairness.- Chapter 4 Debiasing Word Embeddings.- Chapter 5 Fair Information Retrieval Methods.- Chapter 6 Concluding Thoughts.
Harshit Mishra is a Ph.D. student in the Department of Electrical Engineering and Computer Science at Syracuse University. He holds a Master of Science degree in Computer Science from Syracuse University. His research interests include natural language processing, algorithmic fairness, network science, and AI for social good.
Sucheta Soundarajan is an Associate Professor in the Department of Electrical Engineering and Computer Science at Syracuse University. She received her Ph.D. in Computer Science from Cornell University. Her research interests include the theory and applications of network science, algorithmic fairness, and AI in government.



