
Posted 4 days ago
Senior Software Engineer (Machine Learning Platform Engineering)
TekionSenior Software Engineer (Machine learning Platform Engineering)
Requirements
5+ years building large-scale data/ML or platform systems, Proficiency in Python and Java, Scala, or Go, Experience with MLOps pipelines (Airflow, Kubeflow, MLflow), Cloud expertise in AWS and container orchestration (Docker, Kubernetes), Knowledge of LLM gateways and agentic orchestration, Experience with vector search and knowledge graphs
Skills
PythonAWSKubernetesMLOpsLLMDocker
About the role
Responsibilities
- Build and operate the LLM control plane and gateway, including smart routing, rate limiting, failover, and cost tracking.
- Develop unified APIs and SDKs (REST/gRPC) with normalized schemas, caching, and full observability.
- Own the agent runtime, managing tool registries, permissions, function calling, and grounding.
- Design orchestration patterns such as sequential, planner-executor, and streaming workflows.
- Enable platform components for training and scoring pipelines for both classical ML and deep learning models.
- Implement safety and privacy guardrails, including content filtering, prompt validation, and PII redaction.
- Evolve the domain graph and entity resolution to power hybrid retrieval (graph, vector, and keyword search).
- Define and maintain SLOs for latency, uptime, and cost, while providing templates and documentation for product teams.
Requirements
- 5+ years of experience building large-scale data, machine learning, or platform systems.
- Proficiency in Python and at least one of Java, Scala, or Go.
- Strong software engineering fundamentals in distributed systems, concurrency, and API design.
- Experience with MLOps pipelines and tools such as Airflow, Kubeflow, or MLflow.
- Expertise in cloud environments (AWS preferred) and container orchestration using Docker and Kubernetes.
- Practical knowledge of machine learning workflows, including feature engineering, training, and drift detection.
Preferred Qualifications
- Experience building or operating LLM gateways, provider adapters, or control planes.
- Hands-on experience with agentic systems, including tool use, orchestration frameworks, and human-in-the-loop workflows.
- Experience with knowledge graphs (e.g., Neo4j, Neptune) and vector search (e.g., pgvector, Qdrant, Milvus).
- Familiarity with real-time data processing using Spark, Flink, or Kafka.
About the Company
Tekion is disrupting the automotive industry with the first cloud-native automotive platform. By connecting the entire ecosystem—including OEMs, retailers, and consumers—through a seamless, AI-driven platform, Tekion is enabling the best automotive retail experiences ever. We employ close to 3,000 people across North America, Asia, and Europe.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSenior Software Engineer (Machine Learning Platform Engineering)
Tekion · Bengaluru
