closed

Senior Data Engineer

Full-time

Commitment type

Europe

Location

$4000 - $5500/month

Salary
Job Description

Adeva is a global talent network that enables work without boundaries by connecting tech professionals with top companies worldwide.

This role is for a highly skilled Data Engineer specializing in building, optimizing, and managing data pipelines for data ingestion, transformation, and publication using Azure Databricks and related Azure services. The ideal candidate should have strong experience with Databricks, Spark (SQL, PySpark), Databricks Workflows, and Azure data services like Azure Data Lake, with a focus on scalability and performance.

Responsibilities

  • Data Pipeline Expertise: Extensive experience in designing and implementing scalable ETL/ELT data pipelines in Azure Databricks, transforming raw data into usable datasets for analysis.
  • Azure Databricks Proficiency: Strong knowledge of Spark (SQL, PySpark) for data transformation and processing within Databricks, along with experience building workflows and automation using Databricks Workflows.
  • Azure Data Services: Hands-on experience with Azure services like Azure Data Lake, Azure Blob Storage, and Azure Synapse for data storage, processing, and publication.
  • Data Governance & Security: Familiarity with managing data governance and security using Databricks Unity Catalog, ensuring data is appropriately organized, secured, and accessible to authorized users.
  • Optimization & Performance Tuning: Proven experience in optimizing data pipelines for performance, cost-efficiency, and scalability, including partitioning, caching, and tuning Spark jobs.
  • Cloud Architecture & Automation: Strong understanding of Azure cloud architecture, including best practices for infrastructure-as-code, automation, and monitoring in data environments.
     

Requirements

The Data Engineer will focus on building and maintaining data pipelines that support ingestion, transformation, and publishing of data for integrations and downstream analytics, ensuring the performance, scalability, and reliability of data workflows.

Data Pipeline Development:

  • Design and implement end-to-end data pipelines in Azure Databricks, handling ingestion from various data sources, performing complex transformations, and publishing data to Azure Data Lake or other storage services.
  • Write efficient and standardized Spark SQL and PySpark code for data transformations, ensuring data integrity and accuracy across the pipeline.
  • Automate pipeline orchestration using Databricks Workflows or integration with external tools (e.g., Apache Airflow, Azure Data Factory).

Data Ingestion & Transformation:

  • Build scalable data ingestion processes to handle structured, semi-structured, and unstructured data from various sources (APIs, databases, file systems).
  • Implement data transformation logic using Spark, ensuring data is cleaned, transformed, and enriched according to business requirements.
  • Leverage Databricks features such as Delta Lake to manage and track changes to data, enabling better versioning and performance for incremental data loads.

Data Publishing & Integration:

  • Publish clean, transformed data to Azure Data Lake or other cloud storage solutions for consumption by analytics and reporting tools.
  • Define and document best practices for managing and maintaining robust, scalable data pipelines.

Data Governance & Security:

  • Implement and maintain data governance policies using Unity Catalog, ensuring proper organization, access control, and metadata management across data assets.
  • Ensure data security best practices, such as encryption at rest and in transit, and role-based access control (RBAC) within Azure Databricks and Azure services.

Performance Tuning & Optimization:

  • Optimize Spark jobs for performance by tuning configurations, partitioning data, and caching intermediate results to minimize processing time and resource consumption.
  • Continuously monitor and improve pipeline performance, addressing bottlenecks and optimizing for cost efficiency in Azure.

Automation & Monitoring:

  • Automate data pipeline deployment and management using tools like Terraform, ensuring consistency across environments.
  • Set up monitoring and alerting mechanisms for pipelines using Databricks built-in features and Azure Monitor to detect and resolve issues proactively.

This role requires a deep understanding of data engineering within cloud environments, with an emphasis on Azure and Databricks, to build highly reliable, scalable, and efficient data pipelines that support data-driven decision-making.
 

About Adeva

Adeva is an exclusive tech professionals' network that connects some of the best tech professionals in the world with leading enterprise organizations and startups. 

Our network is distributed all over the world, with engineers in more than 35 countries. Our company culture builds connections, careers, and employee growth. We are creating a workplace from the future that values flexibility, autonomy, and transparency. If that sounds like something you’d like to be part of, we’d love to hear from you.

Required skills
  • PySpark
  • Databricks
  • ETL
  • Data Pipelines
  • Dbt
Apply for This Job

Or, continue with

Share this job

Not what you’re looking for? Check out these similar roles.

Senior FullStack Engineer

Hiring Now

$7000 - $8500/month

United States

Full-time

  • Azure
  • Express.js

View Details

Unix/Linux Shell Script Developer (German Language Mandatory)

Hiring Now

$5500 - $7000/month

Europe

Full-time

  • UNIX
  • Linux
  • Shell
  • PERL
  • PL/SQL

View Details

Senior Engineering Manager

Hiring Now

$5500 - $7000/month

South America

Full-time

View Details

Tech Lead (Senior Staff/Principal Engineer)

Hiring Now

$5500 - $7000/month

South America

Full-time

View Details

Staff Full-Stack Engineer

Hiring Now

$5500 - $7000/month

South America

Full-time

  • Golang

View Details

Staff Backend Engineer

Hiring Now

$5500 - $7000/month

South America

Full-time

View Details

Senior FE Engineer

Hiring Now

$4000 - $5500/month

South America

Full-time

  • Vue.js
  • Mocha

View Details

Senior Backend Engineer

Hiring Now

$4000 - $5500/month

South America

Full-time

  • Go
  • Node.js
  • RESTful Services
  • gRPC
  • GraphQL
  • MongoDB
  • PostgreSQL
  • . . .

View Details

Senior Full Stack Developer

Hiring Now

$4000 - $5500/month

Europe

Full-time

  • .NET
  • C#
  • Entity Framework

View Details

Senior Python Engineer

Hiring Now

$4000 - $5500/month

South America, Europe

Full-time

  • Java
  • jQuery
  • Monolithic Architecture
  • GCP
  • MySQL

View Details

FullStack Engineer Django-JS

Hiring Now

$4000 - $5500/month

Macedonia, Serbia

Full-time

  • HTML
  • CSS

View Details

Software Engineer FE (React)

Hiring Now

$4000 - $5500/month

South America

Full-time

  • Redux
  • GitHub Actions
  • Jest
  • AWS CloudFront

View Details

Software Engineer (Python, React, Data)

Hiring Now

$4000 - $5500/month

South America

Full-time

  • ReactJS
  • JavaScript
  • Python
  • Pandas
  • Flask
  • Django
  • SQL
  • GitHub
  • AWS
  • Kubernetes
  • . . .

View Details

Senior Backend Developer (Laravel)

Hiring Now

$4000 - $5500/month

Europe

Full-time

  • PHP
  • Laravel

View Details

Senior Nvidia HPC Engineer

Hiring Now

$8500+/month

Finland, Iceland, Indonesia, Malaysia, Norway, United Arab Emirates, United States

Full-time

  • High-Performance Computing Cluster (HPCC)

View Details

Senior Scala Engineer

Hiring Now

$4000 - $5500/month

Europe

Full-time

  • Scala
  • Cats

View Details