Security has always been a fundamental requirement for enterprise adoption. For example, in a company, billing, data science, and regional marketing teams may all have the required access privileges to view customer data, while sensitive data like credit card numbers should be accessible only to the finance team. Previously, Apache Hive™ with Apache Ranger™ policies was used to manage such scenarios. In this talk, we shows that Apache Spark™ SQL is aware of the existing Apache Ranger policies defined for Apache Hive. In other words, for SQL users, access to databases, tables, rows and columns are controlled in a fine-grained manner, irrespective of whether the data is analyzed using Apache Spark SQL or Hive. If a policy is updated, both Apache Spark and Apache Hive users get their result consistently. In addition, all fine-grained access via Apache Spark SQL can be monitored and searched through a centralized interface via Apache Ranger.
Human Factors of XR: Using Human Factors to Design XR Systems
Security Updates: More Seamless Access Controls with Apache Spark and Apache Ranger
1. SECURITY UPDATES:
More Seamless Access Controls with
Apache Spark and Apache Ranger
Dongjoon Hyun @ Hortonworks Spark Team
Jason Dere @ Hortonworks Hive Team
June 2017