Cybersecurity Research Datasets

November 18, 2019

Best practices, Methods, Research

A challenging aspect of cybersecurity data science (CSDS) concerns a lack of labeled datasets which record incidents and attacks. Such data is necessary for understanding attack vectors and for training and validating predictive models.

A number of cybersecurity research datasets are now available and should be of interest to researchers and practitioners:
• CSE-CIC-IDS2018 on AWS: https://www.unb.ca/cic/datasets/ids-2018.html
• HoneyPot Project: http://honeynet.org/challenges
• LANL CSR Red Teaming: https://csr.lanl.gov/data/cyber1/
• CTU-13 CTU University: https://mcfp.weebly.com/the-ctu-13-dataset-a-labeled-dataset-with-botnet-normal-and-background-traffic.html
• SecRepo.com: http://www.secrepo.com/
• VizSec: http://vizsec.org/data/
• Data.gov Cyber Data Sets: https://catalog.data.gov/dataset?tags=cybersecurity
• Malware Traffic Analysis: http://malware-traffic-analysis.net/
• MIT Lincoln Laboratory IDS Data Sets:  https://www.ll.mit.edu/r-d/datasets
• Center for Applied Internet Data Analysis (CAIDA) Data Sets:
http://www.caida.org/data/overview/
• Protected Repository for the Defense of Infrastructure Against Cyber Threats (PREDICT): https://www.dhs.gov/publication/dhsstpia-006-protected-repository-defense-infrastructure-against-cyber-threats
• NSA Cyber Defense Exercise Data Set: https://www.iad.gov/iad/programs/cyber-defense-exercise/index.cfm

Are there new datasets you feel should be added to the list?  Let me know by messaging and will add!

 

, , , , , , ,

About SARK7

Scott Allen Mongeau (@SARK7), an INFORMS Certified Analytics Professional (CAP), is a researcher, lecturer, and consulting Data Scientist. Scott has over 30 years of project-focused experience in data analytics across a range of industries, including IT, biotech, pharma, materials, insurance, law enforcement, financial services, and start-ups. Scott is a part-time lecturer and PhD (abd) researcher at Nyenrode Business University on the topic of data science. He holds a Global Executive MBA (OneMBA) and Masters in Financial Management from Erasmus Rotterdam School of Management (RSM). He has a Certificate in Finance from University of California at Berkeley Extension, a MA in Communication from the University of Texas at Austin, and a Graduate Degree (GD) in Applied Information Systems Management from the Royal Melbourne Institute of Technology (RMIT). He holds a BPhil from Miami University of Ohio. Having lived and worked in a number of countries, Scott is a dual American and Dutch citizen. He may be contacted at: webmaster@sark7.com LinkedIn: https://www.linkedin.com/in/smongeau/ Twitter: @sark7 Blog: sctr7.com Web: www.sark7.com All posts are copyright © 2020 SARK7 All external materials utilized imply no ownership rights and are presented purely for educational purposes.

View all posts by SARK7

Subscribe

Subscribe to our RSS feed and social profiles to receive updates.

Trackbacks/Pingbacks

  1. | CSDS portfolio sark7SCTR7: Data Science and Analytics - December 18, 2022

    […] Research Data:Cybersecurity Research Datasets […]

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: