songhou@creditease.cn
1
1.
2.
3.
4.
5.
2
1.
•
P2P
•
•
3
1.
2.
3.
4.
5.
4
2.
1.
2.
3.
4.
5
2.1
6
2.2
•
•
•
7
2.3
•
•
-
-
•
-
-
-
8
2.4
•
•
• -> ->
• -> ->
-
-
-
•
9
1.
2.
3.
4.
5.
10
3.
1.
2.
3.
4.
11
3.1
12
risk model
data data
3.2
13
Query
Engine
crawler
controller
REST
Client
DB
HDFS
File
KG
web front
REST
Client
realtime
source
realtime/batch
extractionrealtime
inserts
commands web
trace
batch
processing
Web
crawlers
logging
.
.
.
.
.
.
.
.
.
.
.
.
Kafka
spark
streaming
web extraction
config
online knowledge
processing
offline complex
reasoning
entity retrieve
graph traverse
full text search
KG
repository
batch logging
partners
Query
Engine
data
integration
3.3
14
3.4
15
Spark / MR
Yarn
Hadoop
1.
2.
3.
4.
5.
16
4.
1.
2.
17
4.1
•
- 3000
- 30
-
•
-
-
-
-
18
4.1
•
NLP
•
• Albus
•
19
4.2
•
-
-
-
-
•
-
-
-
20
4.2
•
•
• Pentaho
dashboard
21
1.
2.
3.
4.
5.
22
5.
•
•
•
•
23
•
• jianmindong@creditease.cn
24
Thanks
Q & A
25
songhou@creditease.cn
http://housong.github.io