Nebula: A Network Agnostic DHT Crawler
Dennis Trautwein
Dennis presents Nebula, ProbeLab's network-agnostic DHT crawler, and demonstrates it live against several supported networks including IPFS/Amino, the Ethereum consensus and execution layers, Filecoin, Polkadot/Kusama/Rococo/Westend, and the newly added Celestia mainnet. He explains the crawl algorithm — connecting to bootstrap peers, generating keys that fall into each Kademlia bucket, and recursively asking peers for ever-closer nodes until the reachable set is exhausted — and walks through the CLI flags, dry-run mode, and the four newline-delimited JSON output files that capture crawl metadata, per-peer visits with protocols/agent versions/errors, and optional neighbor data for full topology reconstruction. He shows how the monitor subcommand reconnects to discovered peers with exponential backoff to track uptime and churn, feeding studies on routing-table health, replication factor, and the IPFS DHT incident earlier in the year. The talk also describes how Nebula was recently refactored to abstract over different DHT implementations so it can map disc-v5 ENR node IDs to libp2p peer IDs and extract Ethereum-specific fields like fork digest into a free-form properties map, and ends with a roadmap pitch covering IPNS, GossipSub, and bandwidth measurements.