
A challenging aspect of cybersecurity data science (CSDS) concerns a lack of labeled datasets which record incidents and attacks. Such data is necessary for understanding attack vectors and for training and validating predictive models.
A number of cybersecurity research datasets are now available and should be of interest to researchers and practitioners:
• CSE-CIC-IDS2018 on AWS: https://www.unb.ca/cic/datasets/ids-2018.html
• HoneyPot Project: http://honeynet.org/challenges
• LANL CSR Red Teaming: https://csr.lanl.gov/data/cyber1/
• CTU-13 CTU University: https://mcfp.weebly.com/the-ctu-13-dataset-a-labeled-dataset-with-botnet-normal-and-background-traffic.html
• SecRepo.com: http://www.secrepo.com/
• VizSec: http://vizsec.org/data/
• Data.gov Cyber Data Sets: https://catalog.data.gov/dataset?tags=cybersecurity
• Malware Traffic Analysis: http://malware-traffic-analysis.net/
• MIT Lincoln Laboratory IDS Data Sets: https://www.ll.mit.edu/r-d/datasets
• Center for Applied Internet Data Analysis (CAIDA) Data Sets:
http://www.caida.org/data/overview/
• Protected Repository for the Defense of Infrastructure Against Cyber Threats (PREDICT): https://www.dhs.gov/publication/dhsstpia-006-protected-repository-defense-infrastructure-against-cyber-threats
• NSA Cyber Defense Exercise Data Set: https://www.iad.gov/iad/programs/cyber-defense-exercise/index.cfm
Are there new datasets you feel should be added to the list? Let me know by messaging and will add!
Trackbacks/Pingbacks
[…] Research Data:Cybersecurity Research Datasets […]