Network Structure Analysis > Small-scale datasets
This page contains links to some network data sets that have been compiled over the years. All of these are free
for scientific use to the best of our knowledge, meaning that the original authors have already made the data freely
available. If you make use of any of these data, please cite the original sources.
- Zachary's karate club
Social network of friendships between 34 members of a karate club at a US university in the 1970s.
Please cite W. W. Zachary, An information flow model for conflict and fission in small groups,
Journal of Anthropological Research 33, 452-473 (1977).
- Les Miserables
Coappearance network of characters in the novel Les Miserables. Please cite D. E. Knuth, The Stanford GraphBase: A
Platform for Combinatorial Computing, Addison-Wesley, Reading, MA (1993).
- Word adjacencies
Adjacency network of common adjectives and nouns in the novel David Copperfield by Charles Dickens.
Please cite M. E. J. Newman, Phys. Rev. E 74, 036104 (2006).
- American College football
Network of American football games between Division IA colleges during regular season Fall 2000.
Please cite M. Girvan and M. E. J. Newman, Proc. Natl. Acad. Sci. USA 99, 7821-7826 (2002).
- Neural network
A directed, weighted network representing the neural network of C. Elegans. Data compiled by D. Watts
and S. Strogatz and made available on the web here. Please cite D. J. Watts and S. H. Strogatz, Nature
393, 440-442 (1998). Original experimental data taken from J. G. White, E. Southgate, J. N. Thompson,
and S. Brenner, Phil. Trans. R. Soc. London 314, 1-340 (1986).
- Power grid
An undirected, unweighted network representing the topology of the Western States Power Grid of the
United States. Data compiled by D. Watts and S. Strogatz and made available on the web here. Please
cite D. J. Watts and S. H. Strogatz, Nature 393, 440-442 (1998).
- Condensed matter collaborations 1999
Weighted network of coauthorships between scientists posting preprints on the Condensed Matter E-Print
Archive between Jan 1, 1995 and December 31, 1999. Please cite M. E. J. Newman, The structure of scientific
collaboration networks, Proc. Natl. Acad. Sci. USA 98, 404-409 (2001).
- Condensed matter collaborations 2003
Updated network of coauthorships between scientists posting preprints on the Condensed Matter E-Print
Archive. This version includes all preprints posted between Jan 1, 1995 and June 30, 2003. The largest
component of this network, which contains 27519 scientists, has been used by several authors as a test-bed
for community-finding algorithms for large networks; see for example J. Duch and A. Arenas, Phys. Rev. E 72,
027104 (2005). These data can be cited as M. E. J. Newman, Proc. Natl. Acad. Sci. USA 98, 404-409 (2001).
- Condensed matter collaborations 2005
Updated network of coauthorships between scientists posting preprints on the Condensed Matter E-Print Archive.
This version includes all preprints posted between Jan 1, 1995 and March 31, 2005. Please cite M. E. J. Newman,
Proc. Natl. Acad. Sci. USA 98, 404-409 (2001).
- Astrophysics collaborations
Weighted network of coauthorships between scientists posting preprints on the Astrophysics E-Print Archive
between Jan 1, 1995 and December 31, 1999. Please cite M. E. J. Newman, Proc. Natl. Acad. Sci. USA 98, 404-409
(2001).
- High-energy theory collaborations
Weighted network of coauthorships between scientists posting preprints on the High-Energy Theory E-Print
Archive between Jan 1, 1995 and December 31, 1999. Please cite M. E. J. Newman, Proc. Natl. Acad. Sci. USA 98,
404-409 (2001).
- Coauthorships in network science
Coauthorship network of scientists working on network theory and experiment, as compiled by M. Newman in May 2006.
A figure depicting the largest component of this network can be found here. These data can be cited as M. E. J.
Newman, Phys. Rev. E 74, 036104 (2006).
-
Synthesized Data
It contains different synthesized networks of 1000, 5000 nodes with different mutation rate of overlapping ratio.
- Dolphin social network
An undirected social network of frequent associations between 62 dolphins in a community living off
Doubtful Sound, New Zealand. Please cite D. Lusseau, K. Schneider, O. J. Boisseau, P. Haase, E. Slooten,
and S. M. Dawson, Behavioral Ecology and Sociobiology 54, 396-405 (2003). Link to the
Source
- Political blogs
A directed network of hyperlinks between weblogs on US politics, recorded in 2005 by Adamic and Glance.
Please cite L. A. Adamic and N. Glance, "The political blogosphere and the 2004 US Election", in
Proceedings of the WWW-2005 Workshop on the Weblogging Ecosystem (2005). Link to the
Source
- Books about US politics
A network of books about US politics published around the time of the 2004 presidential election and
sold by the online bookseller Amazon.com. Edges between books represent frequent copurchasing of books
by the same buyers. The network was compiled by V. Krebs and is unpublished, but can found on Krebs'
web site. Link to the Source
- Internet
A symmetrized snapshot of the structure of the Internet at the level of autonomous systems, reconstructed from
BGP tables posted by the University of Oregon Route Views Project. This snapshot was created by Mark Newman from
data for July 22, 2006 and is not previously published. Link to the Source