skip to main content
10.1145/1963405.1963488acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Layered label propagation: a multiresolution coordinate-free ordering for compressing social networks

Published: 28 March 2011 Publication History

Abstract

We continue the line of research on graph compression started with WebGraph, but we move our focus to the compression of social networks in a proper sense (e.g., LiveJournal): the approaches that have been used for a long time to compress web graphs rely on a specific ordering of the nodes (lexicographical URL ordering) whose extension to general social networks is not trivial. In this paper, we propose a solution that mixes clusterings and orders, and devise a new algorithm, called Layered Label Propagation, that builds on previous work on scalable clustering and can be used to reorder very large graphs (billions of nodes). Our implementation uses task decomposition to perform aggressively on multi-core architecture, making it possible to reorder graphs of more than 600 millions nodes in a few hours.
Experiments performed on a wide array of web graphs and social networks show that combining the order produced by the proposed algorithm with the WebGraph compression framework provides a major increase in compression with respect to all currently known techniques, both on web graphs and on social networks. These improvements make it possible to analyse in main memory significantly larger graphs.

References

[1]
Alberto Apostolico and Guido Drovandi. Graph compression by BFS. Algorithms, 2(3):1031--1044, 2009.
[2]
Michael J. Barber and John W. Clark. Detecting network communities by propagating labels under constraints. Phys. Rev. E, 80(2):026129, Aug 2009.
[3]
Krishna Bharat, Andrei Broder, Monika Henzinger, Puneet Kumar, and Suresh Venkatasubramanian. The Connectivity Server: fast access to linkage information on the Web. Computer Networks and ISDN Systems, 30(1--7):469--477, 1998.
[4]
Paolo Boldi, Massimo Santini, and Sebastiano Vigna. Permuting web graphs. In WAW '09: Proceedings of the 6th International Workshop on Algorithms and Models for the Web-Graph, pages 116--126, Berlin, Heidelberg, 2009. Springer-Verlag.
[5]
Paolo Boldi, Massimo Santini, and Sebastiano Vigna. Permuting web and social graphs. Internet Math., 6(3):257--283, 2010.
[6]
Paolo Boldi and Sebastiano Vigna. The WebGraph framework I: Compression techniques. In Proc. of the Thirteenth International World Wide Web Conference, pages 595--601. ACM Press, 2004.
[7]
Flavio Chierichetti, Ravi Kumar, Silvio Lattanzi, Michael Mitzenmacher, Alessandro Panconesi, and Prabhakar Raghavan. On compressing social networks. In KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 219--228, New York, NY, USA, 2009. ACM.
[8]
Santo Fortunato. Community detection in graphs. Physics Report, 486:75--174, February 2010.
[9]
Santo Fortunato and Marc Barthelemy. Resolution limit in community detection. Proceedings of the National Academy of Science, 104:36--41, January 2007.
[10]
Anna C. Gilbert and Kirill Levchenko. Compressing network graphs. In Proceedings of the LinkKDD workshop at the 10th ACM Conference on KDD, August 2004.
[11]
Ankur Gupta, Wing-Kai Hon, Rahul Shah, and Jeffrey Scott Vitter. Compressed data structures: Dictionaries and data-aware measures. Theoret. Comput. Sci., 387(3):313--331, 2007.
[12]
Guy Jacobson. Space-efficient static trees and graphs. In 30th Annual Symposium on Foundations of Computer Science, pages 549--554, Research Triangle Park, North Carolina, 1989. IEEE.
[13]
David Knoke and Song Yang. Social Network Analysis. Sage Publications, Inc, second edition edition, 2008.
[14]
Donald E. Knuth. The Art of Computer Programming, Volume 4, Fascicle 2: Generating All Tuples and Permutations (Art of Computer Programming). Addison-Wesley Professional, 2005.
[15]
Hossein Maserrat and Jian Pei. Neighbor query friendly compression of social networks. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 533--542. ACM, 2010.
[16]
Marina Meil\va. Comparing clusterings: an axiomatic view. In ICML '05: Proceedings of the 22nd international conference on Machine learning, pages 577--584, New York, NY, USA, 2005. ACM.
[17]
Mark E. J. Newman and Michelle Girvan. Finding and evaluating community structure in networks. Phys. Rev. E, 69(2):026113, Feb 2004.
[18]
Usha N. Raghavan, Réka Albert, and Soundar Kumara. Near linear time algorithm to detect community structures in large-scale networks. Physical Review E (Statistical, Nonlinear, and Soft Matter Physics), 76(3), 2007.
[19]
Keith H. Randall, Raymie Stata, Janet L. Wiener, and Rajiv G. Wickremesinghe. The Link Database: Fast access to graphs of the web. In Proceedings of the Data Compression Conference, pages 122--131, Washington, DC, USA, 2002. IEEE Computer Society.
[20]
Peter Ronhovde and Zohar Nussinov. Local resolution-limit-free Potts model for community detection. Phys. Rev. E, 81(4):046114, Apr 2010.
[21]
Ilya Safro and Boris Temkin. Multiscale approach for the network compression-friendly ordering. Journal of Discrete Algorithms, 2010.
[22]
Gergely Tibély and János Kertész. On the equivalence of the label propagation method of community detection and a Potts model approach. Physica A Statistical Mechanics and its Applications, 387:4982--4984, August 2008.
[23]
Sebastiano Vigna. Stanford matrix considered harmful. In Andreas Frommer, Michael W. Mahoney, and Daniel B. Szyld, editors, Web Information Retrieval and Linear Algebra Algorithms, number 07071 in Dagstuhl Seminar Proceedings. Internationales Begegnungs- und Forschungszentrum für Informatik (IBFI), Schloss Dagstuhl, Germany, 2007.
[24]
Stanley Wasserman, Katherine Faust, and Dawn Iacobucci. Social Network Analysis : Methods and Applications (Structural Analysis in the Social Sciences). Cambridge University Press, 1994.

Cited By

View all
  • (2024)SeraphProceedings of the 22nd USENIX Conference on File and Storage Technologies10.5555/3650697.3650719(373-387)Online publication date: 27-Feb-2024
  • (2024)Floating-point histograms for exploratory analysis of large scale real-world data setsIntelligent Data Analysis10.3233/IDA-230638(1-48)Online publication date: 25-Jan-2024
  • (2024)Enabling Window-Based Monotonic Graph Analytics with Reusable Transitional Results for Pattern-Consistent QueriesProceedings of the VLDB Endowment10.14778/3681954.368197917:11(3003-3016)Online publication date: 30-Aug-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
WWW '11: Proceedings of the 20th international conference on World wide web
March 2011
840 pages
ISBN:9781450306324
DOI:10.1145/1963405
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 March 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graph clustering
  2. graph compression
  3. social networks
  4. web graphs

Qualifiers

  • Research-article

Conference

WWW '11
WWW '11: 20th International World Wide Web Conference
March 28 - April 1, 2011
Hyderabad, India

Acceptance Rates

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)49
  • Downloads (Last 6 weeks)2
Reflects downloads up to 04 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)SeraphProceedings of the 22nd USENIX Conference on File and Storage Technologies10.5555/3650697.3650719(373-387)Online publication date: 27-Feb-2024
  • (2024)Floating-point histograms for exploratory analysis of large scale real-world data setsIntelligent Data Analysis10.3233/IDA-230638(1-48)Online publication date: 25-Jan-2024
  • (2024)Enabling Window-Based Monotonic Graph Analytics with Reusable Transitional Results for Pattern-Consistent QueriesProceedings of the VLDB Endowment10.14778/3681954.368197917:11(3003-3016)Online publication date: 30-Aug-2024
  • (2024)FSM: A Fine-Grained Splitting and Merging Framework for Dual-Balanced Graph PartitionProceedings of the VLDB Endowment10.14778/3665844.366586417:9(2378-2391)Online publication date: 1-May-2024
  • (2024)Improving Graph Compression for Efficient Resource-Constrained Graph AnalyticsProceedings of the VLDB Endowment10.14778/3665844.366585217:9(2212-2226)Online publication date: 1-May-2024
  • (2024)FlowWalker: A Memory-Efficient and High-Performance GPU-Based Dynamic Graph Random Walk FrameworkProceedings of the VLDB Endowment10.14778/3659437.365943817:8(1788-1801)Online publication date: 1-Apr-2024
  • (2024)RAGraph: A Region-Aware Framework for Geo-Distributed Graph ProcessingProceedings of the VLDB Endowment10.14778/3632093.363209417:3(264-277)Online publication date: 20-Jan-2024
  • (2024)Algorithm 1043: Faster Randomized SVD with Dynamic ShiftsACM Transactions on Mathematical Software10.1145/366062950:2(1-27)Online publication date: 23-Apr-2024
  • (2024)Play like a Vertex: A Stackelberg Game Approach for Streaming Graph PartitioningProceedings of the ACM on Management of Data10.1145/36549652:3(1-27)Online publication date: 30-May-2024
  • (2024)Graph Summarization: Compactness Meets EfficiencyProceedings of the ACM on Management of Data10.1145/36549432:3(1-26)Online publication date: 30-May-2024
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media