C
Posted 2 days ago
Data Engineer
CapgeminiData Engineer
Requirements
5+ years Data Engineering experience, 3+ years Databricks experience, Python, PySpark, SQL, Azure Data Factory, Azure Data Lake, Azure Databricks, CI/CD, English fluency
Skills
DatabricksPySparkAzure
About the role
Responsibilities
- Design and implement robust, scalable data pipelines using ADF Pipelines and Databricks
- Develop maintainable ETL pipelines using PySpark following established coding guidelines
- Perform data model development, versioning, and optimize ETL performance
- Work extensively with Azure cloud infrastructure including Data Factory, Data Lake, and Databricks
- Develop and monitor schema and data migrations
- Transform SQL code into Databricks workflows written in Python
Requirements
- 5+ years of experience in Data Engineering
- 3+ years of hands-on experience working with Databricks
- Proficiency in Python, PySpark, and SQL
- Strong experience with Azure stack: Azure Data Factory, Azure Data Lake, and Azure Databricks
- Experience with DevOps practices, CI/CD (Azure DevOps, Git), and testing
- Fluency in English
- Experience working in Agile/Scrum environments
Preferred Qualifications
- Fluency in German
- Experience with PowerBI
- Knowledge of Object-oriented programming
- Experience with PowerShell or pytest
- Familiarity with Azure Cosmos DB
About the Company
Capgemini is an AI-powered global business and technology transformation partner, delivering tangible business value. We imagine the future of organizations and make it real with AI, technology, and people. With a strong heritage of nearly 60 years, we are a responsible and diverse group of 420,000 team members in more than 50 countries.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeData Engineer
Capgemini · Cairo
