Using Graph Databases in Real-Time to Solve Resource Authorization at Telenor - GraphConnect London 2013

Using Graph Databases in Real-Time to
Solve Resource Authorization at Telenor

Graph Connect London – 19 Nov 2013
by Sebastian Verheughe

Telenor Norway
Subsidiary of the Telenor Group
~1 billion GBP in mobile revenues 2012

Sebastian Verheughe
Lead Developer for Neo4j solution
Coding Architect

Disclaimer
The presentation is not identical to the implementation
due to security reasons but shows how we have
modeled and solved the problem in general.
However, all presented data (numbers & charts) are
real, unfiltered and extracted from the production logs

A very

aspect of
our business

Telenor Norway Middleware Services
Channel

Channel

used by 42 channels
calls 35 sub-systems
10,000 code classes
500 requests/second
20,000 orders/day

Backend

Backend

Channel

Channel

MOBILE

MW

Channel

Channel

Providing business logic
and data for all channels
in the mobile value chain

BUSINESS

LOGIC
& DATA

Backend

Handles users with
access to X00,000
resources

Backend

Backend

Backend

Our Problem
20 minutes to calculate all accessible resources
1500 lines of SQL to implement the authorization logic
“solved” by caching data going stale
and the solution did not scale…

Why a Graph Database?
Access

Parent Company

User: R. Branson

Which resources
does the user
have access to?

Part of Company

Sales

Finance Production

HR

Sub

The questions we wanted answered
required traversal of tree structures.

Tablet

Subscription
Owner

Sub

Tablet

Uses
Subscription

Phone

Tailored Read Model
The Model makes read queries
as simple and efficient as possible.
First find your questions
then model your graph

graph model

=

relational model

High Level Architecture
Clients

Classic MW
Services

other
sources

tx log
RDBMS

check
access

Resource
Authorization

Message Queue

Neo4j

Conditional Rules
ACCESS is given with the following include parameters:
access to subsidiaries and access to content
Only find children of PARENT COMPANY
given access to subsidiaries is allowed
User

Only look at PART OF COMPANY
given access to content is allowed

Only look at SUBSCRIPTION OWNER
given access to content is allowed

Different Access Needs
Access Subsidiaries & Content

Super Admin

Access Content

Umbrella Admin
Access S&C

Admin

Graph Algorithm
Prerequisite: The user node
1. Follow all ACCESS relationships and
read the access parameters on the relationship
2. Follow all PARENT COMPANY relationships given
access to subsidiaries is allowed

3. Follow all PART OF COMPANY relationships given
access to content is allowed
4. Follow all SUBSCRIPTION OWNER relationships given
access to content is allowed

Solution Value
1. Performance optimized from minutes to seconds.
2. Simplicity of writing and understanding business rules
for the query traversal.

3. Scalability by performance allowing us to onboard
more corporate customers (project business case)

Autonomous Service
with it’s own life-cycle and data repository.

Authorization Complexity
• Not a collection of isolated customer trees *
• Not all users of a customer have equal access
• Not a fixed schema, form or size for all
customers
• Real-time updated with customer & product
data

The data form a highly connected living graph
* Covered later in Technical Details

How we Started with Neo4j
1. Searched the internet for articles about graph
database and different solutions.
2. Downloaded and quickly prototyped the solution we
liked that matched our requirements (Neo4j).
3. Workshop with Neo4j and our project developers to
quickly gain competence and ensure design QA.

4. Solution QA with Neo4j before production and help
with performance issues / tuning.

Lessons Learned
• Choose a solution/technology that fits your problem
• New way of thinking – build competence in org.
• Profile your java code to make it really fast
• Don’t put everything into the graph (functional creep)
• Need to know how traversal works (e.g. shortest
path)
• Benchmark the graph to evaluate your traversal
speed

Alternative In-Memory RDBMS
Option 1: Use existing database
- Performance issues due to shared data / suboptimal
structure
- Complexity since SQL not designed for traversal

Option 2: Separate database
+ Might reach same performance as graph db
+ Familiar technology
- Complexity since SQL not designed for traversal

Decided to go with our instinct

Graph Database

Different Graph Structures
get all accessible subscriptions
2 000

1 000

1 700 ms

Company X: 147 000

750 ms

Company Y: 52 000

1300 ms

Company Z: 95 000

Data from test – repeated prod sampling gave ~2.4 sec for 215,000 subscriptions

Different Graph Structures
check access to single subscription
2 000

1 000

1 ms

Company X: 147 000

1 ms

Company Y: 52 000

1 ms

Company Z: 95 000

Production Performance
retrieve all accessible resources
RDBMS Disk

RDBMS Mem Cached

Graph In-Heap

Company X

12 min

18 sec

< 2 sec

Company Y

22 min

58 sec

< 2 sec

Company Z

3 min

15 sec

< 2 sec

Check single resource access

1 ms
No operational problems in production

Production Details
Graph Size
heap)

27 million nodes (pre-warmed in
~1x properties, ~2x relationships

Traffic Volume

~1000 req/min during biz hours
~ 40K daily real-time updates

Performance
ms

Avg: 1 ms, 99% < 4 ms, 99.9% < 9

JVM
warmed)

Sun 6, 20 GB Heap (~15 GB pre-

Production Has Access Query

Time (ms)

Time (ms)

Production All Queries

Time (ms)
Garbage collection

Implementing the Algorithm
Lets look at the Neo4j Traversal Framework
Iterable<Node> getAccessibleResources(…) {

Evaluator myEvaluator = …
Expander myExpander = …
return Traversal.description()
.evaluator(myEvaluator)

.expander(myExpander)
.traverse(startNode).nodes();
}

Evaluator is a simple filter, e.g. for Node
type
class MyEvaluator implements Evaluator {
public Evaluation evaluate(Path path) {
if <I am interested in this node>
return Evaluation.INCLUDE_AND_CONTINUE;
else
return Evaluation.EXCLUDE_AND_CONTINUE;
}
}

The custom Expander contains business
rules!
class ResAuthExpander implements PathExpander<PathExpander> {
…
public … expand(Path path, BranchState<…> state) {
if (path.lastRelationship rel == ACCESS)
accToSub = rel.getProperty(ACCESS_TO_SUBSIDIARIES);
accToCont = rel.getProperty(ACCESS_TO_CONTENT);
state.set( getExpander(accToSub, accToCont) );
}
return state.get().expand(…)
}

Single expander class to control business

Generates the valid relationships to traverse.
public getExpander(boolean accToSub, boolean accToCont) {
PathExpander exp = StandardExpander.DEFAULT.add(ACCESS,…);
if (accToSub)
exp.add(PARENT_COMPANY,…)
if (accToCont)
exp.add(PART_OF_COMPANY,…).add(SUBSCRIPTION_OWNER,…);
return exp;
}
}

U-Turn Strategy
4.
Access

User

Does the user
have access to
subscription X?

3.

5.

Up to find path quickly
Down to check access

6.
2.
7.
1.

8.
X

Subscription

Reversing the traversal increases performance from n/2 to
2d where n and d are tree size and depth (we went from 1s to

The Zigzag Problem
What if we also have reversed access to the subscription
payer?

User

Op

IT

E
d

Jo

Subscriptions

Solvable by adding state to the traversal (or check path)

The Many-to-Many Problem
The nodes Op & IT may be connected through many
subscriptions

Does the user
have access to
department Op?

Op

IT

Access

User

Subscription

Traversal becomes time consuming (e.g. M2M market)
However, we only needed to implement the rule for direct access to
sub.

Deployment View
• Two equal instances of Neo4j embedded in Tomcat
• Access through Java API due to need for custom logic
• Using Neo4j 1.8 without HA (did not like ZooKeeper)

Resource
Authorization

Neo4j

tx log
RDBMS

Message Queue

Resource
Authorization

Neo4j

Dual Model Cost
There are some drawbacks with dual models
also
• Not possible to simply join the ACL with resource
tables in the relational database - queries needed
redesign
• The complexity added by code and infrastructure
necessary to manage an additional model.
• Not ordinary competence (in Norway at least)

Unexplored Areas
Combining Access Control List & Graph
• Best of both worlds (simple logic, fast lookup)
Algorithm
–
–
–
–

Find all affected users when the graph is updated
Invalidate users access control list
Calculate all accessible resources for each user
Store result in users access control list

Could then skip the U-turn and many-to-many problem.

Was is worth it?

Yes!
The user experience is important in
Telenor

Web References
• Telenor Norway
• The Project - How NOSQL Paid off for
Telenor
• JavaWorld - Graphs for Security

Using Graph Databases in Real-Time to Solve Resource Authorization at Telenor - GraphConnect London 2013

More Related Content

What's hot

Viewers also liked

Similar to Using Graph Databases in Real-Time to Solve Resource Authorization at Telenor - GraphConnect London 2013

Recently uploaded

Using Graph Databases in Real-Time to Solve Resource Authorization at Telenor - GraphConnect London 2013

Editor's Notes