Network Structure Analysis
-
Dataset
- Pajek Dataset
Collection of complex network datasets which are obtained through Pajek [(Slovene word for Spider) is a program,
for Windows, for analysis and visualization of large networks].
- Large-scale network data
Large Network Dataset Collection of various categories such as the social networks like Facebook, Twitter
, communication networks like Enron, citation networks like Arxiv High Energy Physics paper citation network,
collaboration networks like collaboration network of Arxiv High Energy Physics Theory, Amazon network, Internet
p2p networks and other online communities like Flickr and Reddit.
- Small-scale network data
Small Network Dataset Collection of various networks such as Zachary's karate club, Word adjacencies, American
College football, Power grid, High-energy theory collaborations, Coauthorships in network science and etc.
-
Tools
- Identifying communities
Interactive GUI to help you identify communities in a graph.
- Breaking communities
Interactive GUI to help you identify and break selected/all communities in a graph. The selected community and the community structure in the graph before and after breaking the selected community will be available to view.
- Source Codes
The network structure analysis algorithms are presented here. A brief description and link to download the
source code are provided for your reference. The algorithms include detection of overlapping communities,
adaptively finding overlapping community structure, community vulnerability assessment and approximations and
maximizing the modularity.
s
Information Diffusion and Social Influence
-
Dataset
- Social Media Dataset
Social Computing Data Repository hosts data from a collection of many different social media sites,
most of which have blogging capacity. Some of the prominent social media sites included in this
repository are BlogCatalog, Twitter, MyBlogLog, Digg, StumbleUpon, del.icio.us, MySpace, LiveJournal,
The Unofficial Apple Weblog (TUAW), Reddit, etc. The repository contains various facets of blog data
including blog site metadata like, user defined tags, predefined categories, blog site description;
blog post level metadata like, user defined tags, date and time of posting; blog posts; blog post mood
(which is defined as the blogger's emotions when (s)he wrote the blog post); blogger name; blog post
comments; and blogger social network.
- Interdependent Networks
Collection of interdependent network (overlapping users) data among various social networks, such as
Foursquare and Twitter and collaboration networks such as Co-author networks in the area of
Condensed Matter(CM), High-Energy Theory(Het), and Network Science (NetS).
-
Tools
- Source codes
The Information diffusion and social influence algorithms are presented here. A brief description
and link to download the source code are provided for your reference. The algorithms include viral advertising
in online social networks (OSN) [Under construction]
Smart Grids
-
Dataset
- Power Systems Test Case Archive
The above is the link to the UW Power System Test Case Archive which provides access to power system data (test cases). To use the archive,
select the test case that interests you, then select the appropriate parts of it. If you have selected a text
file, its contents will appear on your browser. At this point, use the File Save As option to download it.
Other file types will invite you to save them when you select them.
- Bus test cases for Power Grid
The is the link for datasets compiled by Daniel Bienstock and Abhinav Verma, in their paper "The N - k Problem
in Power Grids: New Models, Formulations and Numerical Experiments". It includes the 58- and 118-bus
test cases with some variations on the capacities; as well as the 49-node \square grid" example and
three larger networks created using the replication technique described at the start of Section 2.4 of the above cited
paper: a 300-node, 409-arc network, a 600-node, 990-arc network, and a 619-node, 1368-arc network.
Additional artificial networks were created to test specific conditions. Please cite the source if you plan to
use the dataset.
- Western States Power Grid
An undirected, unweighted network representing the topology of the Western States Power Grid of the United States.
Data compiled by D. Watts and S. Strogatz and made available on the web here. Please cite D. J. Watts and
S. H. Strogatz, Nature 393, 440-442 (1998).
Other Useful tools