The iovation Site
HOME  |  ABOUT  |  CONTACT  |  RSS  |  VISIT IOVATION

Not All IP Addresses Are Created Equally

October 8th, 2009 by Bart Lonchar

IP Addresses have long been used in device fingerprinting solutions, but their utility has been hit-and-miss due to differences in how groups of IP addresses are managed. As a result, solutions relying on the IP address for device identification generally experience high false positive rates; this is especially true in cases where the same IP address has been issued to different end users over time.

Based on data iovation has collected from performing over two billion device identification requests, we’ve developed techniques to more accurately assess the relevance of an IP address in identifying and re-recognizing a device. This allows us to use IP address as a factor, when appropriate, and ignore it when not.

One of the keys to successfully utilizing IP addresses in device fingerprinting is to understand how different service providers manage their IP addresses. Some service providers go to great lengths to assign the same IP address to the same user over time, even when DHCP (Dynamic Host Configuration Protocol) is used for obtaining an IP address. Other providers make use of a smaller pool of IP addresses, requiring them to reissue the same IP addresses to different users over time. Mobile service providers present the most extreme example of this type.

To better understand the issue, I decided to take a closer look at some of our data. Over a recent 30-day window, I collected data from device identification requests in which we could definitively say that the correct device was identified via its fingerprint. (By limiting the study to these requests, the correlation of IP addresses to devices can be done with confidence because the device identifier is a statistical truth value.)

Analysis of this data (presented below) shows which IP addresses are associated with multiple end-user devices, ultimately allowing for a better understanding of different service providers’ policies with respect to reusing IP addresses. This information, in turn, allows us to determine how effective different IP addresses will be for unique device identification.

Metrics Computed
For each service provider, the following metrics were computed:

  • Number of IP addresses (IPA)
  • Number of IP address and device combinations (IPD)
  • The ratio of IPD to IPA

Many service providers have an IPD to IPA ratio very close to 1, suggesting a policy that attempts to assign a user with the same IP address over time. On the other hand, some service providers have an IPD to IPA ratio over 100, suggesting a policy that liberally reuses IP addresses among users. Of course, there are service providers everywhere in-between.

Examples

  1. On the low end of the scale (where a single IP address tends to correlate directly to a single device) is H3G Italy. During the study period, 20,509 IP addresses managed by this service provider were encountered, with 22,545 device and IP address combinations, giving them an IPD to IPA ratio of 1.09.
  2. On the high end of the scale (where a single IP address tends to be associated with multiple devices) is danger.com. From this service provider we encountered 54 unique IP addresses covering 4,967 device and IP address combinations, resulting in an IPD to IPA ratio of 91.9.

Results
On aggregate, I grouped the values of IPD to IPA ratios into ranges and each range was analyzed using frequency distributions. Based on a device fingerprinting system’s optimal performance goals and tolerance for false positives, the service provider’s IPD to IPA ratio can be used to determine the role of the IP address in device identification.

Ratio of IPD to IPA # Service Providers # IP Addresses Under Management
Under 1.2 17,594 16,407,051
1.2 to 1.5 5,685 1,698,703
1.5 to 2 3,082 406,879
2 to 3 2,635 245,609
3 to 5 1,691 32,993
5 to 10 1,084 21,561
10 to 25 519 14,718
25 to 50 126 5,136
50 to 100 48 1,227
Over 100 25 766

SPbyClass

IPbyClass

Tags: , , , ,

Leave a Reply