Passive and Active Internet Measurements
This page contains datasets and tools that can aid in Internet Measurements research (esp. those focusing on security). Feel free to email me other links/datasets that you think would be useful.
Passive measurement datasets
- Twitter: Random tweets API (and relevant operators), Archive repo of tweets, Twitter Trends.
- Website reputation: URLVoid and URLhaus.
- Web and Virus databases: VirusTotal, Google Safe Browsing, and IBM X-Force.
- Top websites:
- Network measurements:
- Device search engines: Shodan, Censys, ZoomEye, Project Sonar (Rapid7), thingful.
- Certificates:
- Domains, Registrars, and DNS:
- Firefox Telemetry: Project, Data publishing.
- Stratosphere Lab datasets (IoT, Wifi, malware, botnet pcaps).
- Phishing: PhishTank (phishing domains), and per country reports from SecureList.
- Adblock lists: easylist.
- Web Censorship: Citizen Lab (censored websites), Open Observatory of Network Interference (OONI) (data and explore).
- Content measurements:
- App reviews: Android
- MISC: Internetwache.
Active measurement tools
Research artifacts
- A Catalog of Research Artifacts for Computer Science: FindResearch.