Data Engineer at Capgemini - ScoutJobs - The AI-curated global job board
Skip to content
C
Posted 2 days ago

Data Engineer

CapgeminiData Engineer

Requirements

5+ years data engineering experience, Expert-level SQL, Python proficiency, Clean room technology experience, Identity resolution knowledge, PII governance knowledge

Skills

SQLPythonSnowflakeAWSBigQuery

About the role

Responsibilities

  • Design and maintain high-throughput ingestion pipelines for transaction signals, behavioral events, and third-party identity graphs.
  • Implement identity resolution logic at scale, including deterministic matching and probabilistic graph construction.
  • Build and maintain data clean room connectors and privacy-preserving data exchange pipelines (e.g., AWS Clean Rooms, Google ADH).
  • Design medallion-architecture data models optimized for cohort-level attribution.
  • Build automated QC and reconciliation frameworks for deduplication and compliance validation.
  • Implement PII governance controls, including redacted ID egress and consent signal propagation.
  • Integrate LLM-based APIs for AI-powered signal enrichment and compliance pre-screening.

Requirements

  • 5+ years of data engineering experience.
  • Expert-level SQL proficiency across major cloud data warehouses (Snowflake, BigQuery, Redshift, or Synapse).
  • Strong proficiency in Python for pipeline development and automation.
  • Hands-on experience with clean room technology (AWS Clean Rooms, LiveRamp DCR, Google ADH, or equivalent).
  • Deep understanding of identity resolution concepts and device graph assembly.
  • Strong knowledge of PII governance and financial services regulatory requirements (GLBA, Fair Lending, etc.).
  • Experience integrating with DSPs, CDPs, or marketing activation platforms.

Preferred Qualifications

  • Experience with graph database technologies such as Neo4j, Amazon Neptune, or TigerGraph.
  • Familiarity with LiveRamp Embedded Identity, UID2 token handling, or walled garden attribution.
  • Working knowledge of LLM APIs for structured data enrichment and AI-assisted workflows.

Benefits

  • Paid time off including vacation, company holidays, personal days, and sick leave.
  • Medical, dental, and vision coverage.
  • Retirement savings plans (e.g., 401(k)).
  • Life and disability insurance.
  • Employee assistance programs.

About the Company

Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world. With over 340,000 team members in more than 50 countries, we deliver end-to-end services leveraging market-leading capabilities in AI, cloud, and data.

ScoutJobs Agent

Get matches like this delivered daily

Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.

Get started — it's free

Data Engineer

Capgemini · New York

Sign up to apply