SlideShare a Scribd company logo
IP Geolocation
Demystified
Understanding
IP Geolocation Technology
Let's start with..
What is an
IP Address?
An Internet Protocol address (IP address) is a numerical label assigned
to each device connected to a computer network. This numerical label
is used to identify these devices, allowing for direct communication.
The public internet operates with the same principles. When a device
connects to the internet it utilises a globally unique IP address to
ensure both inbound and outbound communication is delivered
correctly.
In this context, the IP address acts in a similar way to a postal address
used to deliver conventional mail. However, unlike the postal address,
an IP address does not have an intrinsic location and does not expose
any geographical properties. This is why you cannot determine the
location of a device by its IP address alone.
IP Geolocation is an essential technology that overcomes this
limitation to help organisations identify the location of their customers
based on their IP addresses.
Organisations such as online service operators, financial institutions,
search engines, ad agencies and any business offering an online
shopping/e-commerce experience are able to provide their customers
with the best products and services available in their region with IP
geolocation technology.
This IP Geolocation service is also crucial for preventing online fraud,
managing digital rights, and serving targeted marketing material and
pricing.
So..
What is an
IP Geolocation?
If you wonder where your online customers are coming from or wish to
customise your clients’ online experience based on their location, you are
likely familiar with various commercial IP Geolocation services, ranging
from free to highly-priced to enterprise-only. Most of these providers
declare superior accuracy, although show little transparency on the
methodology, and present scarce evidence to support their claimed
accuracy.
In general, validation of the accuracy of an IP Geolocation service is
challenging and requires a large pool of ground-truth data (i.e. vast
numbers of IP addresses from known locations). This data is generally
collected from all active ISPs/AS’ and is required to be random, spread over
various geographical regions. In reality, such data is generally not available,
in which case any claimed IP Geolocation accuracy without full
transparency is questionable.
For in-depth understanding, check our blog post:
How accurate can IP Geolocation get?
But..
How accurate is
IP Geolocation?
WHAT IS THE
ULTIMATE DATA
SOURCE?
For IP geolocation technology
26 Million
Link: https://www.iana.org/
Due to the global uniqueness requirement of IP
Addresses across both protocols, the global IP
address space allocation is heavily regulated.
51 Million
Let's understand..
How IP addresses are distributed.
The IPv4 protocol uses 32-bit addresses which makes
the maximum theoretical address space limited to
4,294,967,296 (2^32) IP addresses. IPv6, the next-
generation protocol, utilises 128-bit addresses which
makes the pool considerably larger, but still limited.
IANA – ‘The internet Assigned Numbers Authority is a function of
ICANN, a nonprofit private American corporation that oversees
global IP address allocation, autonomous system number
allocation, root zone management in the Domain Name System,
media types, and other internet Protocol-related symbols and
internet numbers.’ (source: Wikipedia).
AFRINIC for Africa Region
APNIC for Asia/Pacific Region
ARIN for Canada, USA, and some Caribbean Islands
LACNIC for Latin America and some Caribbean Islands
RIPE NCC for Europe, the Middle East, and Central
Asia
IANA is responsible for the allocation of large IP address
space blocks to the Regional Internet Registries (RIRs):
RIRs, in turn, delegate a portion of their allocated address
space to Local Internet Registries (LIRs), e.g. APNIC
delegates to the Japan Network Information Center
(JPNIC). All registries both regional and local allocate their
remaining available address space to organisations seeking
to utilise it on the public internet.
More about..
IANA Internet Assigned Numbers Authority
ARIN
RIPE NCC
AFRINIC APNIC
LACNIC
Illustration of IP
address distribution
Business entities (or autonomous networks) that are assigned IP
address space for their own use are called Autonomous Systems (AS).
They must first register as an AS, receiving a globally-unique
Autonomous System Number (ASN) which can then be used to identify
them.
The Internet Service Provider (ISP) is the most typical example of an
AS operator, but it is not the only one. Virtually, any organisation
seeking to use their own IP addresses on the internet qualifies as an
AS. It is a common occurrence that AS entities liberally use their
allocated IP space in any manner they wish, and more importantly, in
any geographical location they like. They can allocate it to any AS
entity/network within the same enterprise regardless of global
location or even sublease it to a completely unrelated, geographically
remote entity. Despite existing regulations, there is no way to restrict
allocated IP address space geographically.
Let's talk about..
Autonomous
Systems (AS)
Therefore, the only ultimately accurate IP Geolocation data is that
which is made available by AS operators, who are the only ones who
confidently know how and where their IP addresses are utilised. AS,
however, are not obliged to share their internal data with any other
entity, except for law enforcement agencies within the determined
jurisdiction boundaries.
Existing commercial IP Geolocation service providers do not have
access to AS internal data. Some of these service providers claim they
have integrated services with ISPs or receive data directly from ISPs.
Considering there is more than 80,000 registered ASs, of which more
than 60,000 are active at any one time (active ASNs ranked list), it is
largely impractical to form commercial relationships with all.
Receiving the data from a small number of local ISPs may improve
regional geolocation accuracy to a minor extent but is not sufficient on
a global scale.
The Ultimate Data
Source?
WHERE DO IP GEOLOCATION
SERVICE PROVIDERS GET
THEIR DATA?
Assuming that the existing IP Geolocation services do not have access to the
Autonomous Systems’ internal data, they cannot be confident regarding the
actual geographical location of the routable IP addresses.
So, where are they getting their geolocation data from?
1 2 3 4 5
WhoIs Data
WhoIs database is
nourished by Regional
and Local internet
Registry organisations
(RIR/LIR) that are
obligated to keep their
registration records
public.
BGP Data
The Border Gateway
Protocol (BGP) is a global
internet address routing
directory.
Field evidence
There are many
additional data sources
that can be utilised for IP
geolocation which qualify
as field evidence data.
Eg: data received from
user using GPS-enabled
device.
Scientific data
These are scientifically
derived data from
calculations such as time-
delay to distance
conversions and others.
Reverse DNS
The method is based on
DNS records (textual
name of the public
internet addresses).
IP Geolocation Data Sources
WhoIs Data
WhoIs is by far the most common source of geolocation data. WhoIs
database is nourished by Regional and Local internet Registry organisations
(RIR/LIR) that are obligated to keep their registration records public.
This information discloses all IP addresses registered for each entity they
belong to, including independent organisations or ISPs. IP Geolocation
service vendors can obtain this registry data using RIR websites and APIs or
can request bulk access to the data.
Example site:
What is WhoIs Data?
WhoIs Data
This data is usually updated on a daily basis and include a set of
registration data. This registration data contains the IP address block
records and which organisations they are registered under.
It may additionally contain a street address or the network location
coordinates, although none of the geographical properties is
mandatory.
Furthermore, these records are maintained by the registered party
and are not validated by any external body. This means the accuracy
of the data is questionable even when it is made available.
Screenshot of example WhoIs data from ARIN's website.
What data is available?
WhoIs Data
There are around 10 million records in the global WhoIs database for IPv4
alone, some of which can serve as a very accurate IP Geolocation source.
For example, a small internet Cafe with a static IP address (or a small range
of addresses) used on-premises and recorded into the RIR database inclusive
of its physical address. This scenario exposes accurate geolocation
information with a precision up to a street address. In most cases, when an
organisation reports incorrect or outdated information, or outsources the
registered address blocks to another party, the records will not reveal the IP
usage location.
Therefore, IP Geolocation based on WhoIs database only is largely inaccurate
as whole.
How accurate is WhoIs Data?
BGP Data
The Border Gateway Protocol (BGP) is a global internet address
routing directory. This is a standardised exterior gateway protocol
to exchange routing information amongst active Autonomous
Systems (AS) on the internet. BGP involves the announcement of
preferred pathways and direction of internet address blocks
(prefixes).
When an AS entity wishes to use an IP address range on the public
internet, it has to ‘announce’ it to the closest peers. In simple
words, it sends the announcement that means: “I’m responsible for
that range (prefix), so whoever wishes to communicate with a
device in that range, direct the communication through me”.
This announcement eventually propagates across all other peers
worldwide to inform them on how to send traffic to that IP address
range if required.
For instance, if I need to send a packet to destination ‘A’, but I only
know host ‘C’ and can forward traffic to it. The packet will still reach
the desired destination ‘A’ if ‘C’ knows ‘A’ either directly or via other
intermediate peers. In a nutshell, this is how global internet
connectivity works.
What is BGP Data?
A
B
C
BGP Data
Now, how this can be helpful for IP Geolocation? Firstly, unlike the
WhoIs data which shows the organisation registered against a
particular IP address block, BGP data can reveal who is actually
using it. This is not always the same enterprise entity as we
discussed above.
If, for example, we witness a block registered with ARIN for an
American company with a US street address, but is being used by
AS registered with RIPE in Turkey, this suggests that the IP block is
likely being used in Turkey, which improves geolocation. Secondly,
the BGP data can also reveal what addresses are not used at all, an
unannounced space, with which a geolocation process should not
even be attempted.
How is BGP Data used for IP
geolocation?
BGP Data
The IP address is not a physical object in a physical location. It is simply a
numerical label that can be allocated and unallocated from individual
devices or networks. There is no way we can geolocate a label that is not in
use (allocated). Therefore, when your IP Geolocation service provider
states it can geolocate 100% of the address space, please interpret this
with caution as it can only geolocate the announced (routable) space at
most. The routable space for IPv4 can be monitored on the IPv4 Address
Space Report.
Some other usages of BGP data rely on the assumption that IP addresses
belonging to certain prefixes are meant to share geographical proximity.
This, however, does not always hold true. Prefixes tend to aggregate along
the way and may include a cluster of several smaller prefixes that originate
from different regions.
Coverage of IP geolocation service?
eCommerce originated data sources/feeds, such as billing/shipping address of
the customer when combined with an IP address used for the transaction;
IoT devices with known locations and IP addresses and device pools, either
publicly available or proprietary, for example, the RIPE ATLAS project; and
voluntarily or commercially obtained geolocation data feeds such as Self-
published IP Geolocation Data.
There are many additional data sources that can be utilised for IP geolocation
which qualify as field evidence data. The best example is the data received directly
from users or submitted using GPS-enabled devices, such as mobile phones or
tablets. This data can reveal the alleged geographical coordinates of a device using
a public IP address and can serve as empirical evidence or ground-truth data for
that particular IP address at that particular moment in time.
Other sources include:
Field evidence data
What is Field Evidence Data?
Field evidence data
The data is always limited, as it is impractical for one entity
to access all internet-connected devices around the world.
This method identifies IP location at a specific point in time
only, and is prone to errors. Not everything can be trusted
as pure and reliable evidence. Device misconfiguration or
faults and network redirections such as VPN or PROXYs
along the way are some of many data inaccuracy scenarios
that can occur during the data collection process.
There are 2 important principles associated with the field
evidence IP Geolocation data:
Limitation of Field Evidence Data.
Scientific data
Over the years, many attempts have been made to introduce an
additional active measurement approach to IP Geolocation solutions.
Most of these approaches come from the research on time-delay to
distance conversions, such as triangulation, down to the closest point
of presence (POP) of network interfaces (routers).
However, global network traffic interfaces (public routers) are
complex, with the assumption that time-delay between two
consecutive interfaces is proportional to the physical distance
between them is incorrect.
What is Scientific Data?
Scientific data
Some large ISPs make their internal subnets hidden. Therefore, many
intermediate nodes are not publicly visible and cannot be accounted
for.
Practical network considerations are based on ‘least cost’ routing,
which is different from a common academic assumption of the shortest
one.
Due to Quality of Service (QoS) considerations, some network
interfaces can also be programmed to artificially delay non-productive
traffic.
Therefore, the relation between time-delay and distance is inconsistent and
cannot lay the foundation for overarching principles. To date, none of the
methods based on time-delay triangulation theory has been introduced
into the service and is unlikely to emerge for global commercial
implementation.
Limitations of Scientific Data.
The Domain Name System (DNS) is the phonebook of the internet. Usually, DNS is
used to translate a domain name to an IP address, so the browsers can load
Internet resources. However, it can also work in reverse order, you can query DNS
about what domain name record is attached to an IP address.
This textual record associated with an IP address is not mandatory. It is hardly of
any utility when the address is not involved in publishing internet services or
consumable material. However, some ISPs may use this textual tagging opportunity
to mark their IP addresses for some internal purposes.
Some of the DNS entries can be potentially used to reveal geographical properties.
For example, if the target address or the last router along the way is listed on
DNS as an entry: p1-0-0.sanjose1.br2.bbnplanet.net, it suggests that the IP
address is likely located in San Jose, California. This method shows an add-on
benefit for locating areas with interpretable DNS names.
Reverse DNS data
What is Reverse DNS data?
The only known commercially utilised scientific approach
has been introduced by Digital Envoy, Inc, protected by
US patent (6,757,740) granted in 2004. The method is
based on DNS records (textual name of the public
internet addresses) and crawling (tracert) to the closest
router in an attempt to identify the city and country of
the host.
Reverse DNS data
Many interfaces do not have an assigned DNS name;
The misnaming of an interface results in incorrect location;
City names can often be repetitive across different countries or
territories, i.e. San Jose City can also be found in both Costa Rica
and in California, US;
The lack of universally accepted rules and naming regulations
means records require manual processing, which is time-
consuming and prone to errors.
Unfortunately, the reverse DNS-based approach suffers from several
limitations:
1.
2.
3.
4.
Limitation of Reverse DNS data?
THE ART OF GUESSING
The IP Geolocation service providers can obtain their data from multiple
sources, although none can serve as an ultimate and undoubtable source of
truth. When data is mutually supportive, i.e. multiple sources indicate the very
same location for an IP address, it is a no brainer. Often, however, the data
received is very controversial, and this is where the tricky part lies.
We frequently hear people say that IP Geolocation is part science, part art.
Well, here is the art part. The art of guessing! Let’s try to see what your
average IP Geolocation service provider is dealing with.
Imagine we’ve got the field evidence, such as a user-submitted data sample,
suggesting that the IP address X.X.X.5 was used today somewhere in
Manhattan, in the centre of New York City, NY, US.
The WhoIs data for that address reveals that the block X.X.X.0 - X.X.X.255
(where the above-mentioned address belongs to) is registered for a business
‘Y’ located in Ontario, California, US.
The BGP data suggests that that address has been announced by an AS
entity ‘D’, registered as operated from Austin, Texas, US. And the prefix
size was /22 (1024 hosts).
So, where is the actual location? Can one say that X.X.X.0 - X.X.X.255
block is located in NY?
Or maybe even entire /22 prefix is in NY too?
Maybe the X.X.X.5 is the only one in NY and others are not even close?
Or maybe the sample data we’ve got is wrong and the actual location for all
is in Ontario, California or even Texas?
1.
2.
3.
The final conclusion depends on which data source can be trusted the most.
Considering there are limited tools to prioritise data sources, the existing IP
Geolocation service providers often end up guessing.
Their motto: Any guess is a good guess!
Challenges with..
IP Geolocation
26 Million
51 Million
Further analysis
If we happen to obtain more evidence data points from nearby address entries,
it would likely improve our confidence, but only if the data support one of the
leading guess options.
However, if the data is controversial, it can make geolocation estimation
extremely challenging. What if we have further evidence from address
X.X.X.128 from Toronto, Canada, dated just a couple of days before? Would
this address have moved from Canada to the US recently or just a part of the
block or are we facing an error somewhere?
This is another complex issue – data granulation. IP addresses are usually
deployed in blocks. The larger blocks are better for global routing. If blocks
are too small, the world’s routing table substantially expands and the routers
can eventually face memory overflow errors. Therefore, IP Geolocation
services can logically assume that some consecutive sequences of IP addresses
are likely to share reasonable geographical proximity.
Challenges with data sources
26 Million
51 Million
Further analysis
DNS entries are not always available or can be wrong.
Traceroute does not always reveal all the hosts in the delivery path, as some are
simply do not respond to ICMP requests.
Perfect host correlation is not always possible, as network routers often use several IP
address ports for the same router device. They may appear different on a traceroute
but in reality are the same, which may also lead to an error.
However, defining the actual block IP address boundaries can be tricky and often involves
a series of educated guesses which may require intervention from the human operators.
For example, one can find similarities in the reverse DNS entries for the block member
addresses that possibly suggest the same network. Also, IP addresses can be tracerouted
while looking for correlations between the host IP addresses that participate in the packet
delivery.
Whichever way is chosen, it is commonly prone to errors.
Complexity of reducing errors
SO HOW DO IP
GEOLOCATION SERVICE
PROVIDERS OPERATE?
Let's understand how various IP geolocation service providers work.
Entry Level
Advanced Level
Comparison
The entry-level IP geolocation providers are likely to use fewer data
sources, largely using WhoIs data only, which limits their decision scope
to much fewer options. This makes the process easier and maybe
faster, but as a trade-off, it is much less accurate.
The more advanced IP Geolocation providers presumably work hard to
organise and improve their results by delegating many of the final
decisions to a human personal, in addition to some low-level automated
process. Unfortunately, manual work does not guarantee better results,
as humans are also prone to errors, and definitely makes the process
slower. As a result, we often see commercial IP geolocation databases
updated on a monthly basis only, or weekly as the best.
26 Million
The IP address space is a very dynamic area. Millions of IP addresses changing hands or are
reallocated continuously every hour. Therefore monthly or weekly updates are certainly not
suitable for most IP geolocation applications.
In summary, none of the currently existing methods is sufficiently accurate. Even though a
combination of methods allows for more precise estimation of IP location, this does not solve
the problem of accuracy on a global scale.
Moreover, the lack of a fully automated and deterministic methodology prevents existing IP
geolocation databases from being updated frequently enough to cope with the highly dynamic
nature of the internet IP address space.
Conclusion
To find out how BigDataCloud's IP Geolocation service differs from existing providers,
check out our detailed blog post: The Next Generation IP Geolocation Service.
Contact us
Reach out if you have any
questions or clarifications
support@bigdatacloud.com
Email Address
www.bigdatacloud.com
Website
For more content related to
IP geolocation, visit our
website.
BigDataCloud Pty Ltd is a highly innovative start-up
company founded in 2018 and operated
internationally from our headquarters in Adelaide,
South Australia. After years of previous experience
in e-commerce, fraud protection and targeted
international marketing, the BigDataCloud founders
identified an immense lack of high quality, fast and
affordable APIs within this and other technical
industries.
For more info, visit: www.bigdatacloud.com

More Related Content

What's hot

IPV6 ADDRESS
IPV6 ADDRESSIPV6 ADDRESS
IPV6 ADDRESS
Jothi Lakshmi
 
Wireless Communication Technologies (WIFI ,Bluetooth , NFC )
Wireless Communication Technologies (WIFI ,Bluetooth , NFC )Wireless Communication Technologies (WIFI ,Bluetooth , NFC )
Wireless Communication Technologies (WIFI ,Bluetooth , NFC )
layth jasim
 
MPLS - Multiprotocol Label Switching
MPLS - Multiprotocol Label SwitchingMPLS - Multiprotocol Label Switching
MPLS - Multiprotocol Label Switching
Peter R. Egli
 
IPv4 VS IPv6
IPv4 VS IPv6IPv4 VS IPv6
IPv4 VS IPv6
Humayra Khanum
 
GTP Overview
GTP OverviewGTP Overview
GTP Overview
aliirfan04
 
Hot standby router protocol (hsrp) using
Hot standby router protocol (hsrp) usingHot standby router protocol (hsrp) using
Hot standby router protocol (hsrp) using
ShubhiGupta94
 
Internet as the network infrastructure
Internet as the  network infrastructureInternet as the  network infrastructure
Internet as the network infrastructure
mohanapragashkec
 
Report on 4g Wireless Communication
Report on 4g Wireless CommunicationReport on 4g Wireless Communication
Report on 4g Wireless Communication
Shubham Roy
 
WiFi 7 Training, Improved Latency, Introduction to 802.11be
WiFi 7 Training, Improved Latency, Introduction to 802.11beWiFi 7 Training, Improved Latency, Introduction to 802.11be
WiFi 7 Training, Improved Latency, Introduction to 802.11be
Bryan Len
 
Ip address
Ip address Ip address
Ip address
Yash Bansal
 
Introduction to Wi-Fi Direct
Introduction to Wi-Fi DirectIntroduction to Wi-Fi Direct
Introduction to Wi-Fi DirectWei-Tsung Su
 
Network Design
Network DesignNetwork Design
Network Designcornimar
 
Worldwide attacks on SS7 network
Worldwide attacks on SS7 networkWorldwide attacks on SS7 network
Worldwide attacks on SS7 network
Alexandre De Oliveira
 
CommScope RUCKUS ICX Switching Configuration
CommScope RUCKUS ICX Switching ConfigurationCommScope RUCKUS ICX Switching Configuration
CommScope RUCKUS ICX Switching Configuration
Carla Nadin
 
Juniper MPLS Tutorial by Soricelli
Juniper MPLS Tutorial by SoricelliJuniper MPLS Tutorial by Soricelli
Juniper MPLS Tutorial by SoricelliFebrian ‎
 
Nat pat
Nat patNat pat
Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017
Novosco
 
Lorawan: What you need to know
Lorawan: What you need to knowLorawan: What you need to know
Lorawan: What you need to know
Paul Coomans
 
The_SIMbox_fraud_survey.pdf
The_SIMbox_fraud_survey.pdfThe_SIMbox_fraud_survey.pdf
The_SIMbox_fraud_survey.pdf
manthanpandit1
 

What's hot (20)

IPV6 ADDRESS
IPV6 ADDRESSIPV6 ADDRESS
IPV6 ADDRESS
 
Wireless Communication Technologies (WIFI ,Bluetooth , NFC )
Wireless Communication Technologies (WIFI ,Bluetooth , NFC )Wireless Communication Technologies (WIFI ,Bluetooth , NFC )
Wireless Communication Technologies (WIFI ,Bluetooth , NFC )
 
Btcl
BtclBtcl
Btcl
 
MPLS - Multiprotocol Label Switching
MPLS - Multiprotocol Label SwitchingMPLS - Multiprotocol Label Switching
MPLS - Multiprotocol Label Switching
 
IPv4 VS IPv6
IPv4 VS IPv6IPv4 VS IPv6
IPv4 VS IPv6
 
GTP Overview
GTP OverviewGTP Overview
GTP Overview
 
Hot standby router protocol (hsrp) using
Hot standby router protocol (hsrp) usingHot standby router protocol (hsrp) using
Hot standby router protocol (hsrp) using
 
Internet as the network infrastructure
Internet as the  network infrastructureInternet as the  network infrastructure
Internet as the network infrastructure
 
Report on 4g Wireless Communication
Report on 4g Wireless CommunicationReport on 4g Wireless Communication
Report on 4g Wireless Communication
 
WiFi 7 Training, Improved Latency, Introduction to 802.11be
WiFi 7 Training, Improved Latency, Introduction to 802.11beWiFi 7 Training, Improved Latency, Introduction to 802.11be
WiFi 7 Training, Improved Latency, Introduction to 802.11be
 
Ip address
Ip address Ip address
Ip address
 
Introduction to Wi-Fi Direct
Introduction to Wi-Fi DirectIntroduction to Wi-Fi Direct
Introduction to Wi-Fi Direct
 
Network Design
Network DesignNetwork Design
Network Design
 
Worldwide attacks on SS7 network
Worldwide attacks on SS7 networkWorldwide attacks on SS7 network
Worldwide attacks on SS7 network
 
CommScope RUCKUS ICX Switching Configuration
CommScope RUCKUS ICX Switching ConfigurationCommScope RUCKUS ICX Switching Configuration
CommScope RUCKUS ICX Switching Configuration
 
Juniper MPLS Tutorial by Soricelli
Juniper MPLS Tutorial by SoricelliJuniper MPLS Tutorial by Soricelli
Juniper MPLS Tutorial by Soricelli
 
Nat pat
Nat patNat pat
Nat pat
 
Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017Software Defined Networking - Huawei, June 2017
Software Defined Networking - Huawei, June 2017
 
Lorawan: What you need to know
Lorawan: What you need to knowLorawan: What you need to know
Lorawan: What you need to know
 
The_SIMbox_fraud_survey.pdf
The_SIMbox_fraud_survey.pdfThe_SIMbox_fraud_survey.pdf
The_SIMbox_fraud_survey.pdf
 

Similar to IP Geolocation Demystified

The Connection Between Geolocation and IPv4 Addresses | How to Get IPv4 | IPv...
The Connection Between Geolocation and IPv4 Addresses | How to Get IPv4 | IPv...The Connection Between Geolocation and IPv4 Addresses | How to Get IPv4 | IPv...
The Connection Between Geolocation and IPv4 Addresses | How to Get IPv4 | IPv...
IPv4Mall
 
Unlocking the Power of Data: ipinfo.info's Revolutionary Approach to IP Addre...
Unlocking the Power of Data: ipinfo.info's Revolutionary Approach to IP Addre...Unlocking the Power of Data: ipinfo.info's Revolutionary Approach to IP Addre...
Unlocking the Power of Data: ipinfo.info's Revolutionary Approach to IP Addre...
johnmillerus7
 
A Comprehensive Guide Unlocking the Power of IP Geolocation API 2024.pdf
A Comprehensive Guide Unlocking the Power of IP Geolocation API 2024.pdfA Comprehensive Guide Unlocking the Power of IP Geolocation API 2024.pdf
A Comprehensive Guide Unlocking the Power of IP Geolocation API 2024.pdf
mirza1naeem
 
Computer network lab note 2
Computer network lab note 2Computer network lab note 2
Computer network lab note 2
MDHASNAIN23
 
Finding An IP Address on Your Network | IP Address Buyer | How to Sell IP Add...
Finding An IP Address on Your Network | IP Address Buyer | How to Sell IP Add...Finding An IP Address on Your Network | IP Address Buyer | How to Sell IP Add...
Finding An IP Address on Your Network | IP Address Buyer | How to Sell IP Add...
IPv4Mall
 
IRJET- Personal Assistant for Visually Impaired People in Malls
IRJET-  	  Personal Assistant for Visually Impaired People in MallsIRJET-  	  Personal Assistant for Visually Impaired People in Malls
IRJET- Personal Assistant for Visually Impaired People in Malls
IRJET Journal
 
ISP AND ITS ARCHITECTURE.
ISP AND ITS ARCHITECTURE.ISP AND ITS ARCHITECTURE.
ISP AND ITS ARCHITECTURE.
university of Gujrat, pakistan
 
Tracing An IP Address or Domain Name by Raghu Khimani
Tracing An IP Address or Domain Name by Raghu KhimaniTracing An IP Address or Domain Name by Raghu Khimani
Tracing An IP Address or Domain Name by Raghu Khimani
Dr Raghu Khimani
 
Why IP Address Management is Necessary | How to Get IPv4 Address
Why IP Address Management is Necessary | How to Get IPv4 AddressWhy IP Address Management is Necessary | How to Get IPv4 Address
Why IP Address Management is Necessary | How to Get IPv4 Address
IPv4Mall
 
Mobile IP
Mobile IP Mobile IP
Mobile IP
Er. Rahul Jain
 
Nat cisco
Nat ciscoNat cisco
Nat cisco
moonmanik
 
Network addresses.pptx
Network addresses.pptxNetwork addresses.pptx
Network addresses.pptx
CherryLim21
 
1+1=3 Combining IP Intelligence and Mobile Network Location for Authentication
1+1=3 Combining IP Intelligence and Mobile Network Location for Authentication1+1=3 Combining IP Intelligence and Mobile Network Location for Authentication
1+1=3 Combining IP Intelligence and Mobile Network Location for AuthenticationLocaid Technologies
 
Computer Networks Slide.pptx
Computer Networks Slide.pptxComputer Networks Slide.pptx
Computer Networks Slide.pptx
emsResulzade1
 
Leantegra Real-Time Location System
Leantegra Real-Time Location SystemLeantegra Real-Time Location System
Leantegra Real-Time Location System
Julia Danby
 
How to know my ip address with iplogger.org
How to know my ip address with iplogger.orgHow to know my ip address with iplogger.org
How to know my ip address with iplogger.org
iploggers
 
The Registration and Allocation of IPv4 Addresses | Buy IPv4 Address Block | ...
The Registration and Allocation of IPv4 Addresses | Buy IPv4 Address Block | ...The Registration and Allocation of IPv4 Addresses | Buy IPv4 Address Block | ...
The Registration and Allocation of IPv4 Addresses | Buy IPv4 Address Block | ...
IPv4Mall
 
Wp passpoint wi-fi
Wp passpoint wi-fiWp passpoint wi-fi
Wp passpoint wi-fi
Samuel K. Itotia
 

Similar to IP Geolocation Demystified (20)

The Connection Between Geolocation and IPv4 Addresses | How to Get IPv4 | IPv...
The Connection Between Geolocation and IPv4 Addresses | How to Get IPv4 | IPv...The Connection Between Geolocation and IPv4 Addresses | How to Get IPv4 | IPv...
The Connection Between Geolocation and IPv4 Addresses | How to Get IPv4 | IPv...
 
Unlocking the Power of Data: ipinfo.info's Revolutionary Approach to IP Addre...
Unlocking the Power of Data: ipinfo.info's Revolutionary Approach to IP Addre...Unlocking the Power of Data: ipinfo.info's Revolutionary Approach to IP Addre...
Unlocking the Power of Data: ipinfo.info's Revolutionary Approach to IP Addre...
 
A Comprehensive Guide Unlocking the Power of IP Geolocation API 2024.pdf
A Comprehensive Guide Unlocking the Power of IP Geolocation API 2024.pdfA Comprehensive Guide Unlocking the Power of IP Geolocation API 2024.pdf
A Comprehensive Guide Unlocking the Power of IP Geolocation API 2024.pdf
 
Computer network lab note 2
Computer network lab note 2Computer network lab note 2
Computer network lab note 2
 
Finding An IP Address on Your Network | IP Address Buyer | How to Sell IP Add...
Finding An IP Address on Your Network | IP Address Buyer | How to Sell IP Add...Finding An IP Address on Your Network | IP Address Buyer | How to Sell IP Add...
Finding An IP Address on Your Network | IP Address Buyer | How to Sell IP Add...
 
IRJET- Personal Assistant for Visually Impaired People in Malls
IRJET-  	  Personal Assistant for Visually Impaired People in MallsIRJET-  	  Personal Assistant for Visually Impaired People in Malls
IRJET- Personal Assistant for Visually Impaired People in Malls
 
ISP AND ITS ARCHITECTURE.
ISP AND ITS ARCHITECTURE.ISP AND ITS ARCHITECTURE.
ISP AND ITS ARCHITECTURE.
 
Tracing An IP Address or Domain Name by Raghu Khimani
Tracing An IP Address or Domain Name by Raghu KhimaniTracing An IP Address or Domain Name by Raghu Khimani
Tracing An IP Address or Domain Name by Raghu Khimani
 
Why IP Address Management is Necessary | How to Get IPv4 Address
Why IP Address Management is Necessary | How to Get IPv4 AddressWhy IP Address Management is Necessary | How to Get IPv4 Address
Why IP Address Management is Necessary | How to Get IPv4 Address
 
Mobile IP
Mobile IP Mobile IP
Mobile IP
 
Ipadd mngt
Ipadd mngtIpadd mngt
Ipadd mngt
 
Nat cisco
Nat ciscoNat cisco
Nat cisco
 
Network addresses.pptx
Network addresses.pptxNetwork addresses.pptx
Network addresses.pptx
 
1+1=3 Combining IP Intelligence and Mobile Network Location for Authentication
1+1=3 Combining IP Intelligence and Mobile Network Location for Authentication1+1=3 Combining IP Intelligence and Mobile Network Location for Authentication
1+1=3 Combining IP Intelligence and Mobile Network Location for Authentication
 
CCCNP ROUTE v6_ch06
CCCNP ROUTE v6_ch06CCCNP ROUTE v6_ch06
CCCNP ROUTE v6_ch06
 
Computer Networks Slide.pptx
Computer Networks Slide.pptxComputer Networks Slide.pptx
Computer Networks Slide.pptx
 
Leantegra Real-Time Location System
Leantegra Real-Time Location SystemLeantegra Real-Time Location System
Leantegra Real-Time Location System
 
How to know my ip address with iplogger.org
How to know my ip address with iplogger.orgHow to know my ip address with iplogger.org
How to know my ip address with iplogger.org
 
The Registration and Allocation of IPv4 Addresses | Buy IPv4 Address Block | ...
The Registration and Allocation of IPv4 Addresses | Buy IPv4 Address Block | ...The Registration and Allocation of IPv4 Addresses | Buy IPv4 Address Block | ...
The Registration and Allocation of IPv4 Addresses | Buy IPv4 Address Block | ...
 
Wp passpoint wi-fi
Wp passpoint wi-fiWp passpoint wi-fi
Wp passpoint wi-fi
 

Recently uploaded

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
Alison B. Lowndes
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
ThousandEyes
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdfFIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
FIDO Alliance Osaka Seminar: Passkeys at Amazon.pdf
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Assuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyesAssuring Contact Center Experiences for Your Customers With ThousandEyes
Assuring Contact Center Experiences for Your Customers With ThousandEyes
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 

IP Geolocation Demystified

  • 2. Let's start with.. What is an IP Address? An Internet Protocol address (IP address) is a numerical label assigned to each device connected to a computer network. This numerical label is used to identify these devices, allowing for direct communication. The public internet operates with the same principles. When a device connects to the internet it utilises a globally unique IP address to ensure both inbound and outbound communication is delivered correctly. In this context, the IP address acts in a similar way to a postal address used to deliver conventional mail. However, unlike the postal address, an IP address does not have an intrinsic location and does not expose any geographical properties. This is why you cannot determine the location of a device by its IP address alone.
  • 3. IP Geolocation is an essential technology that overcomes this limitation to help organisations identify the location of their customers based on their IP addresses. Organisations such as online service operators, financial institutions, search engines, ad agencies and any business offering an online shopping/e-commerce experience are able to provide their customers with the best products and services available in their region with IP geolocation technology. This IP Geolocation service is also crucial for preventing online fraud, managing digital rights, and serving targeted marketing material and pricing. So.. What is an IP Geolocation?
  • 4. If you wonder where your online customers are coming from or wish to customise your clients’ online experience based on their location, you are likely familiar with various commercial IP Geolocation services, ranging from free to highly-priced to enterprise-only. Most of these providers declare superior accuracy, although show little transparency on the methodology, and present scarce evidence to support their claimed accuracy. In general, validation of the accuracy of an IP Geolocation service is challenging and requires a large pool of ground-truth data (i.e. vast numbers of IP addresses from known locations). This data is generally collected from all active ISPs/AS’ and is required to be random, spread over various geographical regions. In reality, such data is generally not available, in which case any claimed IP Geolocation accuracy without full transparency is questionable. For in-depth understanding, check our blog post: How accurate can IP Geolocation get? But.. How accurate is IP Geolocation?
  • 5. WHAT IS THE ULTIMATE DATA SOURCE? For IP geolocation technology
  • 6. 26 Million Link: https://www.iana.org/ Due to the global uniqueness requirement of IP Addresses across both protocols, the global IP address space allocation is heavily regulated. 51 Million Let's understand.. How IP addresses are distributed. The IPv4 protocol uses 32-bit addresses which makes the maximum theoretical address space limited to 4,294,967,296 (2^32) IP addresses. IPv6, the next- generation protocol, utilises 128-bit addresses which makes the pool considerably larger, but still limited. IANA – ‘The internet Assigned Numbers Authority is a function of ICANN, a nonprofit private American corporation that oversees global IP address allocation, autonomous system number allocation, root zone management in the Domain Name System, media types, and other internet Protocol-related symbols and internet numbers.’ (source: Wikipedia).
  • 7. AFRINIC for Africa Region APNIC for Asia/Pacific Region ARIN for Canada, USA, and some Caribbean Islands LACNIC for Latin America and some Caribbean Islands RIPE NCC for Europe, the Middle East, and Central Asia IANA is responsible for the allocation of large IP address space blocks to the Regional Internet Registries (RIRs): RIRs, in turn, delegate a portion of their allocated address space to Local Internet Registries (LIRs), e.g. APNIC delegates to the Japan Network Information Center (JPNIC). All registries both regional and local allocate their remaining available address space to organisations seeking to utilise it on the public internet. More about.. IANA Internet Assigned Numbers Authority ARIN RIPE NCC AFRINIC APNIC LACNIC
  • 9. Business entities (or autonomous networks) that are assigned IP address space for their own use are called Autonomous Systems (AS). They must first register as an AS, receiving a globally-unique Autonomous System Number (ASN) which can then be used to identify them. The Internet Service Provider (ISP) is the most typical example of an AS operator, but it is not the only one. Virtually, any organisation seeking to use their own IP addresses on the internet qualifies as an AS. It is a common occurrence that AS entities liberally use their allocated IP space in any manner they wish, and more importantly, in any geographical location they like. They can allocate it to any AS entity/network within the same enterprise regardless of global location or even sublease it to a completely unrelated, geographically remote entity. Despite existing regulations, there is no way to restrict allocated IP address space geographically. Let's talk about.. Autonomous Systems (AS)
  • 10. Therefore, the only ultimately accurate IP Geolocation data is that which is made available by AS operators, who are the only ones who confidently know how and where their IP addresses are utilised. AS, however, are not obliged to share their internal data with any other entity, except for law enforcement agencies within the determined jurisdiction boundaries. Existing commercial IP Geolocation service providers do not have access to AS internal data. Some of these service providers claim they have integrated services with ISPs or receive data directly from ISPs. Considering there is more than 80,000 registered ASs, of which more than 60,000 are active at any one time (active ASNs ranked list), it is largely impractical to form commercial relationships with all. Receiving the data from a small number of local ISPs may improve regional geolocation accuracy to a minor extent but is not sufficient on a global scale. The Ultimate Data Source?
  • 11. WHERE DO IP GEOLOCATION SERVICE PROVIDERS GET THEIR DATA? Assuming that the existing IP Geolocation services do not have access to the Autonomous Systems’ internal data, they cannot be confident regarding the actual geographical location of the routable IP addresses. So, where are they getting their geolocation data from?
  • 12. 1 2 3 4 5 WhoIs Data WhoIs database is nourished by Regional and Local internet Registry organisations (RIR/LIR) that are obligated to keep their registration records public. BGP Data The Border Gateway Protocol (BGP) is a global internet address routing directory. Field evidence There are many additional data sources that can be utilised for IP geolocation which qualify as field evidence data. Eg: data received from user using GPS-enabled device. Scientific data These are scientifically derived data from calculations such as time- delay to distance conversions and others. Reverse DNS The method is based on DNS records (textual name of the public internet addresses). IP Geolocation Data Sources
  • 13. WhoIs Data WhoIs is by far the most common source of geolocation data. WhoIs database is nourished by Regional and Local internet Registry organisations (RIR/LIR) that are obligated to keep their registration records public. This information discloses all IP addresses registered for each entity they belong to, including independent organisations or ISPs. IP Geolocation service vendors can obtain this registry data using RIR websites and APIs or can request bulk access to the data. Example site: What is WhoIs Data?
  • 14. WhoIs Data This data is usually updated on a daily basis and include a set of registration data. This registration data contains the IP address block records and which organisations they are registered under. It may additionally contain a street address or the network location coordinates, although none of the geographical properties is mandatory. Furthermore, these records are maintained by the registered party and are not validated by any external body. This means the accuracy of the data is questionable even when it is made available. Screenshot of example WhoIs data from ARIN's website. What data is available?
  • 15. WhoIs Data There are around 10 million records in the global WhoIs database for IPv4 alone, some of which can serve as a very accurate IP Geolocation source. For example, a small internet Cafe with a static IP address (or a small range of addresses) used on-premises and recorded into the RIR database inclusive of its physical address. This scenario exposes accurate geolocation information with a precision up to a street address. In most cases, when an organisation reports incorrect or outdated information, or outsources the registered address blocks to another party, the records will not reveal the IP usage location. Therefore, IP Geolocation based on WhoIs database only is largely inaccurate as whole. How accurate is WhoIs Data?
  • 16. BGP Data The Border Gateway Protocol (BGP) is a global internet address routing directory. This is a standardised exterior gateway protocol to exchange routing information amongst active Autonomous Systems (AS) on the internet. BGP involves the announcement of preferred pathways and direction of internet address blocks (prefixes). When an AS entity wishes to use an IP address range on the public internet, it has to ‘announce’ it to the closest peers. In simple words, it sends the announcement that means: “I’m responsible for that range (prefix), so whoever wishes to communicate with a device in that range, direct the communication through me”. This announcement eventually propagates across all other peers worldwide to inform them on how to send traffic to that IP address range if required. For instance, if I need to send a packet to destination ‘A’, but I only know host ‘C’ and can forward traffic to it. The packet will still reach the desired destination ‘A’ if ‘C’ knows ‘A’ either directly or via other intermediate peers. In a nutshell, this is how global internet connectivity works. What is BGP Data? A B C
  • 17. BGP Data Now, how this can be helpful for IP Geolocation? Firstly, unlike the WhoIs data which shows the organisation registered against a particular IP address block, BGP data can reveal who is actually using it. This is not always the same enterprise entity as we discussed above. If, for example, we witness a block registered with ARIN for an American company with a US street address, but is being used by AS registered with RIPE in Turkey, this suggests that the IP block is likely being used in Turkey, which improves geolocation. Secondly, the BGP data can also reveal what addresses are not used at all, an unannounced space, with which a geolocation process should not even be attempted. How is BGP Data used for IP geolocation?
  • 18. BGP Data The IP address is not a physical object in a physical location. It is simply a numerical label that can be allocated and unallocated from individual devices or networks. There is no way we can geolocate a label that is not in use (allocated). Therefore, when your IP Geolocation service provider states it can geolocate 100% of the address space, please interpret this with caution as it can only geolocate the announced (routable) space at most. The routable space for IPv4 can be monitored on the IPv4 Address Space Report. Some other usages of BGP data rely on the assumption that IP addresses belonging to certain prefixes are meant to share geographical proximity. This, however, does not always hold true. Prefixes tend to aggregate along the way and may include a cluster of several smaller prefixes that originate from different regions. Coverage of IP geolocation service?
  • 19. eCommerce originated data sources/feeds, such as billing/shipping address of the customer when combined with an IP address used for the transaction; IoT devices with known locations and IP addresses and device pools, either publicly available or proprietary, for example, the RIPE ATLAS project; and voluntarily or commercially obtained geolocation data feeds such as Self- published IP Geolocation Data. There are many additional data sources that can be utilised for IP geolocation which qualify as field evidence data. The best example is the data received directly from users or submitted using GPS-enabled devices, such as mobile phones or tablets. This data can reveal the alleged geographical coordinates of a device using a public IP address and can serve as empirical evidence or ground-truth data for that particular IP address at that particular moment in time. Other sources include: Field evidence data What is Field Evidence Data?
  • 20. Field evidence data The data is always limited, as it is impractical for one entity to access all internet-connected devices around the world. This method identifies IP location at a specific point in time only, and is prone to errors. Not everything can be trusted as pure and reliable evidence. Device misconfiguration or faults and network redirections such as VPN or PROXYs along the way are some of many data inaccuracy scenarios that can occur during the data collection process. There are 2 important principles associated with the field evidence IP Geolocation data: Limitation of Field Evidence Data.
  • 21. Scientific data Over the years, many attempts have been made to introduce an additional active measurement approach to IP Geolocation solutions. Most of these approaches come from the research on time-delay to distance conversions, such as triangulation, down to the closest point of presence (POP) of network interfaces (routers). However, global network traffic interfaces (public routers) are complex, with the assumption that time-delay between two consecutive interfaces is proportional to the physical distance between them is incorrect. What is Scientific Data?
  • 22. Scientific data Some large ISPs make their internal subnets hidden. Therefore, many intermediate nodes are not publicly visible and cannot be accounted for. Practical network considerations are based on ‘least cost’ routing, which is different from a common academic assumption of the shortest one. Due to Quality of Service (QoS) considerations, some network interfaces can also be programmed to artificially delay non-productive traffic. Therefore, the relation between time-delay and distance is inconsistent and cannot lay the foundation for overarching principles. To date, none of the methods based on time-delay triangulation theory has been introduced into the service and is unlikely to emerge for global commercial implementation. Limitations of Scientific Data.
  • 23. The Domain Name System (DNS) is the phonebook of the internet. Usually, DNS is used to translate a domain name to an IP address, so the browsers can load Internet resources. However, it can also work in reverse order, you can query DNS about what domain name record is attached to an IP address. This textual record associated with an IP address is not mandatory. It is hardly of any utility when the address is not involved in publishing internet services or consumable material. However, some ISPs may use this textual tagging opportunity to mark their IP addresses for some internal purposes. Some of the DNS entries can be potentially used to reveal geographical properties. For example, if the target address or the last router along the way is listed on DNS as an entry: p1-0-0.sanjose1.br2.bbnplanet.net, it suggests that the IP address is likely located in San Jose, California. This method shows an add-on benefit for locating areas with interpretable DNS names. Reverse DNS data What is Reverse DNS data? The only known commercially utilised scientific approach has been introduced by Digital Envoy, Inc, protected by US patent (6,757,740) granted in 2004. The method is based on DNS records (textual name of the public internet addresses) and crawling (tracert) to the closest router in an attempt to identify the city and country of the host.
  • 24. Reverse DNS data Many interfaces do not have an assigned DNS name; The misnaming of an interface results in incorrect location; City names can often be repetitive across different countries or territories, i.e. San Jose City can also be found in both Costa Rica and in California, US; The lack of universally accepted rules and naming regulations means records require manual processing, which is time- consuming and prone to errors. Unfortunately, the reverse DNS-based approach suffers from several limitations: 1. 2. 3. 4. Limitation of Reverse DNS data?
  • 25. THE ART OF GUESSING The IP Geolocation service providers can obtain their data from multiple sources, although none can serve as an ultimate and undoubtable source of truth. When data is mutually supportive, i.e. multiple sources indicate the very same location for an IP address, it is a no brainer. Often, however, the data received is very controversial, and this is where the tricky part lies. We frequently hear people say that IP Geolocation is part science, part art. Well, here is the art part. The art of guessing! Let’s try to see what your average IP Geolocation service provider is dealing with.
  • 26. Imagine we’ve got the field evidence, such as a user-submitted data sample, suggesting that the IP address X.X.X.5 was used today somewhere in Manhattan, in the centre of New York City, NY, US. The WhoIs data for that address reveals that the block X.X.X.0 - X.X.X.255 (where the above-mentioned address belongs to) is registered for a business ‘Y’ located in Ontario, California, US. The BGP data suggests that that address has been announced by an AS entity ‘D’, registered as operated from Austin, Texas, US. And the prefix size was /22 (1024 hosts). So, where is the actual location? Can one say that X.X.X.0 - X.X.X.255 block is located in NY? Or maybe even entire /22 prefix is in NY too? Maybe the X.X.X.5 is the only one in NY and others are not even close? Or maybe the sample data we’ve got is wrong and the actual location for all is in Ontario, California or even Texas? 1. 2. 3. The final conclusion depends on which data source can be trusted the most. Considering there are limited tools to prioritise data sources, the existing IP Geolocation service providers often end up guessing. Their motto: Any guess is a good guess! Challenges with.. IP Geolocation
  • 27. 26 Million 51 Million Further analysis If we happen to obtain more evidence data points from nearby address entries, it would likely improve our confidence, but only if the data support one of the leading guess options. However, if the data is controversial, it can make geolocation estimation extremely challenging. What if we have further evidence from address X.X.X.128 from Toronto, Canada, dated just a couple of days before? Would this address have moved from Canada to the US recently or just a part of the block or are we facing an error somewhere? This is another complex issue – data granulation. IP addresses are usually deployed in blocks. The larger blocks are better for global routing. If blocks are too small, the world’s routing table substantially expands and the routers can eventually face memory overflow errors. Therefore, IP Geolocation services can logically assume that some consecutive sequences of IP addresses are likely to share reasonable geographical proximity. Challenges with data sources
  • 28. 26 Million 51 Million Further analysis DNS entries are not always available or can be wrong. Traceroute does not always reveal all the hosts in the delivery path, as some are simply do not respond to ICMP requests. Perfect host correlation is not always possible, as network routers often use several IP address ports for the same router device. They may appear different on a traceroute but in reality are the same, which may also lead to an error. However, defining the actual block IP address boundaries can be tricky and often involves a series of educated guesses which may require intervention from the human operators. For example, one can find similarities in the reverse DNS entries for the block member addresses that possibly suggest the same network. Also, IP addresses can be tracerouted while looking for correlations between the host IP addresses that participate in the packet delivery. Whichever way is chosen, it is commonly prone to errors. Complexity of reducing errors
  • 29. SO HOW DO IP GEOLOCATION SERVICE PROVIDERS OPERATE? Let's understand how various IP geolocation service providers work.
  • 30. Entry Level Advanced Level Comparison The entry-level IP geolocation providers are likely to use fewer data sources, largely using WhoIs data only, which limits their decision scope to much fewer options. This makes the process easier and maybe faster, but as a trade-off, it is much less accurate. The more advanced IP Geolocation providers presumably work hard to organise and improve their results by delegating many of the final decisions to a human personal, in addition to some low-level automated process. Unfortunately, manual work does not guarantee better results, as humans are also prone to errors, and definitely makes the process slower. As a result, we often see commercial IP geolocation databases updated on a monthly basis only, or weekly as the best.
  • 31. 26 Million The IP address space is a very dynamic area. Millions of IP addresses changing hands or are reallocated continuously every hour. Therefore monthly or weekly updates are certainly not suitable for most IP geolocation applications. In summary, none of the currently existing methods is sufficiently accurate. Even though a combination of methods allows for more precise estimation of IP location, this does not solve the problem of accuracy on a global scale. Moreover, the lack of a fully automated and deterministic methodology prevents existing IP geolocation databases from being updated frequently enough to cope with the highly dynamic nature of the internet IP address space. Conclusion To find out how BigDataCloud's IP Geolocation service differs from existing providers, check out our detailed blog post: The Next Generation IP Geolocation Service.
  • 32. Contact us Reach out if you have any questions or clarifications support@bigdatacloud.com Email Address www.bigdatacloud.com Website
  • 33. For more content related to IP geolocation, visit our website. BigDataCloud Pty Ltd is a highly innovative start-up company founded in 2018 and operated internationally from our headquarters in Adelaide, South Australia. After years of previous experience in e-commerce, fraud protection and targeted international marketing, the BigDataCloud founders identified an immense lack of high quality, fast and affordable APIs within this and other technical industries. For more info, visit: www.bigdatacloud.com