Data Engineering Roles

Data Engineering Roles
Adam Doyle
12/1/2021

Data Store Engineer
• Store Data, Retrieve Data, Optimize Data
• SQL (all flavors)
• Data Warehouse
• Data Lake

ETL Engineer
• Retrieve data from remote sources and move the data into a data
store.
• Data Enrichment
• Tool-based ETL products
• Programmatic ETL development

Stream Engineer
• Retrieve data from streaming data sources
• Handle multi-source and late-arriving data
• Stream data sources (Kafka, RabbitMQ)
• Programmatic processing (Spark)

Data Quality Engineer
• Profile and check for outliers
• Handle data quality issues
• Data Quality Tools (Informatica DQ, Great Expectations)
• Data Analysis/Profiling (SQL)
• Programmatic Adjustments

Visualization Engineer
• Develop internal data models within data visualization tools
• Create dashboards
• Data Visualization tools (Tableau, Power BI)
• Data analysis (SQL)

Deployment Engineer
• Deploys processes to production
• DevOps, CI/CD (Ansible/Terraform)
• Source Control (Git)
• Data deployment (Liquibase)

Operations Engineer
• Monitor data applications
• Troubleshooting production issues
• Data Analysis (SQL)
• Root Cause Analysis (Splunk)

Production Engineer
• Ensure that application code is ready to go to production
• Test Harness (SoapUI)
• Programming languages
• Understanding of Machine Learning processes

Cluster Engineer
• Work with clustered hardware and software to ensure deployment
and scalability.
• Cluster software (Hadoop, Kubernetes)
• Log Monitoring (Splunk)

Cloud Engineer
• Implement solutions in the cloud with both cloud-native technology
and conversions of on-premise solutions
• Cloud Platforms (Azure, AWS, GCP)
• Infrastructure as Code (Terraform)

Machine Learning Engineer
• Adapt Machine Learning Models to be deployed in production with
an emphasis on performance and scalability
• Machine Learning Platforms (Spark)
• Programming language (Python, Scala)
• Performance tuning

Feature Engineer
• Create informational features to be used in data science models – at
scale, at speed
• Extract information form data
• Aggredate data into information
• Apply business rules
• Data analysis (SQL)
• Programming language (Python, Scala)

Data Engineering Roles

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Data Engineering Roles

Similar to Data Engineering Roles (20)

More from Adam Doyle

More from Adam Doyle (20)

Recently uploaded

Recently uploaded (20)

Data Engineering Roles