IPFS DHT #

One of the core components of IPFS is the Distributed Hash Table (DHT), which enables nodes to locate and retrieve content from other nodes on the network. The IPFS DHT is a distributed key-value store that maps content addresses (aka CIDs) to the nodes that are currently storing that content. It works by dividing the content address space into small, hash-based “buckets” that are distributed across the network. When a node wants to find a piece of content, it queries the DHT with the content’s CID, and the DHT returns the nodes that are currently storing that content, in the form of Provider Records. This allows content to be located and retrieved in a decentralized, efficient, and fault-tolerant manner. The IPFS DHT is a key component of the IPFS network, and is used by many IPFS implementations, tools, as well as a variety of decentralized applications and systems built on top of IPFS.

Health #

As a distributed system, IPFS relies on the coordinated participation of multiple nodes and servers to function correctly. Monitoring the availability and long term stability of DHT servers over time can give insight into the health of the network. High churn in the network can make content harder to locate and lead to longer retrieval times. Measuring DHT server availability and expected lifetimes can help assess the health and overall efficiency of the network.

Availability #

The Nebula crawler attempts to connect to DHT Server peers in the IPFS DHT periodically. When a new DHT Server peer is discovered, the crawler records the start of a session of availability and extends the session length with every successful connection attempt. However, a failed connection terminates the session, and a later successful attempt starts a new session. Peers can have multiple sessions of availability during each measurement period.

In the following, a peer is classified as “online” if it was available for at least 80% of the measurement period. If a peer was available between 40% and 80% of the period, it is considered “mostly online,” while “mostly offline” indicates availability between 10% and 40% of the time. Any peer that was available for less than 10% of the period is classified as “offline.”

IPFS DHT #

Health #

Availability #

DHT Server Availability #

DHT Server Availability, classified over time #

DHT Server Availability, classified by region #

Churn #

Capabilities #

Transports #

Performance #

Lookup Performance #

Median DHT Lookup Performance over time #

DHT Lookup Performance Distribution #

DHT Lookup Performance Distribution, by region #

Publish Performance #

Median DHT Publish Performance over time #

DHT Publish Performance Distribution #

DHT Publish Performance Distribution, by region #

Participation in the DHT #

Client vs Server Node Estimate #

DHT Server Software #

Most Frequent DHT Server Agents #

Active Kubo Versions #

Kubo Version Distribution #

Recent Kubo Versions Over Time #

DHT Key Density Monitoring #

Keyspace population distribution #

Keyspace density distribution #