skip to main content
10.1145/3539618.3591840acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
short-paper
Open access

Facebook Content Search: Efficient and Effective Adapting Search on A Large Scale

Published: 18 July 2023 Publication History

Abstract

Facebook content search is a critical channel that enables people to discover the best content to deepen their engagement with friends and family, creators, and communities. Building a highly personalized search engine to serve billions of daily active users to find the best results from a large scale of candidates is a challenging task. The search engine must take multiple dimensions into consideration, including different content types, different query intents, and user social graph, etc. In this paper, we discuss the challenges of Facebook content search in depth, and then describe our novel approach to efficiently handling a massive number of documents with advanced query understanding, retrieval, and machine learning techniques. The proposed system has been fully verified and applied to the production system of Facebook Search, which serves billions of users.

Supplemental Material

MP4 File
In this video, we discuss the challenges of Facebook content search, and then describes our novel approach to efficiently handling a massive number of documents with advanced query understanding, retrieval, and machine learning techniques.

References

[1]
Ricardo Baeza-Yates, Liliana Calderón-Benavides, and Cristina González-Caro. The intention behind web queries. In String Processing and Information Retrieval: 13th International Conference, SPIRE 2006, Glasgow, UK, October 11-13, 2006. Pro- ceedings 13, pages 98--109. Springer, 2006.
[2]
Andrei Broder. A taxonomy of web search. In ACM Sigir forum, volume 36, pages 3--10. ACM New York, NY, USA, 2002.
[3]
Michael Curtiss, Iain Becker, Tudor Bosman, Sergey Doroshenko, Lucian Grijincu, Tom Jackson, Sandhya Kunnatur, Soren Lassen, Philip Pronin, Sriram Sankar, et al. Unicorn: A system for searching the social graph. Proceedings of the VLDB Endowment, 6(11):1150--1161, 2013.
[4]
Junwu Du, Zhimin Zhang, Jun Yan, Yan Cui, and Zheng Chen. Using search session context for named entity recognition in query. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, pages 765--766, 2010.
[5]
Georges Dupret and Ciya Liao. A model to estimate intrinsic document relevance from the clickthrough logs of a web search engine. In Proceedings of the third ACM international conference on Web search and data mining, pages 181--190, 2010.
[6]
Kuzman Ganchev, Keith Hall, Ryan McDonald, and Slav Petrov. Using search-logs to improve query tagging. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 238--242, 2012.
[7]
Nadav Golbandi Golbandi, Liran Katzir Katzir, Yehuda Koren Koren, and Ronny Lempel Lempel. Expediting search trend detection via prediction of query counts. In Proceedings of the sixth ACM international conference on Web search and data mining, pages 295--304, 2013.
[8]
Jiafeng Guo, Gu Xu, Xueqi Cheng, and Hang Li. Named entity recognition in query. In Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pages 267--274, 2009.
[9]
Yunzhong He, Cong Zhang, Ruoyan Kong, Chaitanya Kulkarni, Qing Liu, Ashish Gandhe, Amit Nithianandan, and Arul Prakash. Hiercat: Hierarchical query categorization from weakly supervised data at facebook marketplace. In Proceedings of the ACM Web Conference 2023 (WWW '23 Companion), 2023.
[10]
Steven CH Hoi and Michael R Lyu. A multimodal and multilevel ranking scheme for large-scale video retrieval. IEEE transactions on Multimedia, 10(4):607--619, 2008.
[11]
Jui-Ting Huang, Ashish Sharma, Shuying Sun, Li Xia, David Zhang, Philip Pronin, Janani Padmanabhan, Giuseppe Ottaviano, and Linjun Yang. Embedding-based retrieval in facebook search. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 2553--2561, 2020.
[12]
Ryan Kiros, Ruslan Salakhutdinov, and Richard S Zemel. Unifying visual- semantic embeddings with multimodal neural language models. arXiv preprint arXiv:1411.2539, 2014.
[13]
Jon Kleinberg. Bursty and hierarchical structure in streams. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 91--101, 2002.
[14]
Guillaume Lample and Alexis Conneau. Cross-lingual language model pretraining. arXiv preprint arXiv:1901.07291, 2019.
[15]
Ying Li, Zijian Zheng, and Honghua Dai. Kdd cup-2005 report: Facing a great challenge. ACM SIGKDD Explorations Newsletter, 7(2):91--99, 2005.
[16]
Zhen Liao. pirank: A probabilistic intent based ranking framework for facebook search, 2022.
[17]
Yiqun Liu, Xiaohui Xie, Chao Wang, Jian-Yun Nie, Min Zhang, and Shaoping Ma. Time-aware click model. ACM Transactions on Information Systems (TOIS), 35(3):1--24, 2016.
[18]
Mehdi Manshadi and Xiao Li. Semantic tagging of web search queries. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pages 861--869, 2009.
[19]
Tao Mei, Yong Rui, Shipeng Li, and Qi Tian. Multimedia search reranking: A literature survey. ACM Computing Surveys (CSUR), 46(3):1--38, 2014.
[20]
Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, and Adam Lerer. Automatic differentiation in pytorch. 2017.
[21]
Stephen Robertson, Hugo Zaragoza, et al. The probabilistic relevance framework: Bm25 and beyond. Foundations and Trends® in Information Retrieval, 3(4):333--389, 2009.
[22]
Daniel E Rose and Danny Levinson. Understanding user goals in web search. In Proceedings of the 13th international conference on World Wide Web, pages 13--19, 2004.
[23]
Shikui Wei, Yao Zhao, Zhenfeng Zhu, and Nan Liu. Multimodal fusion for video search reranking. IEEE Transactions on Knowledge and Data Engineering, 22(8):1191--1199, 2009.
[24]
Jun Yu, Xiaokang Yang, Fei Gao, and Dacheng Tao. Deep multimodal distance metric learning using click constraints for image ranking. IEEE transactions on cybernetics, 47(12):4014--4024, 2016.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '23: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2023
3567 pages
ISBN:9781450394086
DOI:10.1145/3539618
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 July 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. information retrieval
  2. query understanding
  3. social media

Qualifiers

  • Short-paper

Conference

SIGIR '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 635
    Total Downloads
  • Downloads (Last 12 months)515
  • Downloads (Last 6 weeks)66
Reflects downloads up to 01 Nov 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media