This project involves applying graph theory to the analysis of large volumes of DNS data and related datasets, such as the Whois and registration transactions.
Aim
The aim of the project is to evaluate how useful graph theory is for analysing large volumes of registry data, particularly with a view to further enhancing the safety and stability of .nl, the DNS and the internet in general.
Expected results
The project is expected to yield the following results:
A model of the types of object that SIDN manages in its role as .nl registry (e.g. registrar, registrant, name server and domain name transaction) and the interrelationships amongst those objects
A instantiation of the model with Apache Spark, based on operational data
Applications/usage cases and visualisations of the model, e.g. for detecting botnets, phishing scams or malicious domain name registrations
A study report
Profile
We are looking for a highly motivated MSc student with a passion for internet technology, mathematics, data analysis and their visualisation. You are energised by investigating exactly how mechanisms and systems work and you are able to devise and evaluate solutions. The ideal candidate is a self-starter with an enquiring mind, capable of developing proofs of concept.
Interested?
If you are interested in this project, e-mail labs@sidn.nl explaining why you think you are the right person to carry it out. Questions can be mailed to the same address.
About SIDN Labs
SIDN manages the Dutch national internet domain, .nl. We are an innovative domain name registry with our own DNS anycast network and more DNSSEC-secured domain names than any other top-level domain in the world (2.4 million of the 5.5 million .nl domain names). SIDN Labs is our R&D arm, which develops and evaluates new technologies and systems with a view to further enhancing the stability and security of .nl, the DNS and the wider internet. On our ENTRADA big data platform, we retain more than a year's DNS data, including about 65 billion unique DNS queries, of which we receive 150,000,000 a day. More information on www.sidnlabs.nl.