CAIA DATASET ------------ NAME: CAIA-DATA-WIKI-150816A.txt URL: http://caia.swin.edu.au/mapping/data/CAIA-DATA-WIKI-150816A.tar.bz2 SHORT DESCRIPTION: The dataset contains unique IPv4 addresses collected from Wikipedia's edit logs for 3-month time periods over 4 years, starting at the beginning of 2011 and ending at the end of 2014. LONG DESCRIPTION: Each of the files in this dataset contains the unique IPv4 addresses observed on Wikipedia edit logs during a 3-month period. Note that only edit entries from anonymous users have IP addresses attached. In July 2012 we downloaded the -stub-meta-history.xml files for a large number of Wikis and extracted the IPv4 addresses from these. Since August 2012 we downloaded the incremental changes for a number of Wikis on a daily basis and extracted the IPv4 addresses from these. Wikis include the en de fr es it nl ja pl pt ru ar id ca cs da eo fa ko lt hu no ro sk sr fi sv vi tr uk zh ms bg et el simple eu gl he hr nn sl sh th versions of wikipedia.org, commons.wikimedia.org, en version of wikitravel.org, en de es fr versions of wiktionary.org and wikibooks.org. CITATION: If you use this data, please reference our original paper using the datasets: S. Zander, L. L. H. Andrew, G. Armitage, "Capturing Ghosts: Predicting the Used IPv4 Space by Inferring Unobserved Addresses", in Internet Measurement Conference (IMC), November 2014. FILE NAMING: period_XX_wiki_unique_ips.txt.bz2 TIME PERIODS: Period Start_date End_date 00 20110101 20110331 01 20110401 20110630 02 20110701 20110930 03 20111001 20111231 04 20120101 20120331 05 20120401 20120630 06 20120701 20120930 07 20121001 20121231 08 20130101 20130331 09 20130401 20130630 10 20130701 20130930 11 20131001 20131231 12 20140101 20140331 13 20140401 20140630 14 20140701 20140930 15 20141001 20141231 FILES: MD5 (period_00_wiki_unique_ips.txt.bz2) = d5f7963a10782c20bed269b9ddbf2cb8 MD5 (period_01_wiki_unique_ips.txt.bz2) = 15d8eb23496e8725f29d04d5b9bad671 MD5 (period_02_wiki_unique_ips.txt.bz2) = c638c8cc68044b382bb710b4f3830625 MD5 (period_03_wiki_unique_ips.txt.bz2) = 1ab52fa1400e9018be34c0cdf9e9be14 MD5 (period_04_wiki_unique_ips.txt.bz2) = 26aa68da2102307c6ddc9067391cad42 MD5 (period_05_wiki_unique_ips.txt.bz2) = 687b087ee4d1dfde9dbc2fb0e19219bb MD5 (period_06_wiki_unique_ips.txt.bz2) = dc70c790e75fe4bc311fd3e037a2ccd6 MD5 (period_07_wiki_unique_ips.txt.bz2) = 2abebfe89993d32888575b02c4fe4aa1 MD5 (period_08_wiki_unique_ips.txt.bz2) = c5dc150d2174b578884e0e9978baaa5f MD5 (period_09_wiki_unique_ips.txt.bz2) = 126bb4a6f625ffdcc89d585d9a485613 MD5 (period_10_wiki_unique_ips.txt.bz2) = 79fe9b17a5b45600f81ed433087cd9cd MD5 (period_11_wiki_unique_ips.txt.bz2) = eb3ac5313962fb4ad98d97f2f50c7cd7 MD5 (period_12_wiki_unique_ips.txt.bz2) = 46f169f717db95e0446fcf42f29b16d6 MD5 (period_13_wiki_unique_ips.txt.bz2) = 4fc5aeb48850c972e2e5682d25964774 MD5 (period_14_wiki_unique_ips.txt.bz2) = 84b529244e0837a0732e25583b6d91af MD5 (period_15_wiki_unique_ips.txt.bz2) = ac890ddaf4a3e4af674320c920ec3673