Data Engineer

ATG (Auction Technology Group)
City of London
1 month ago
Applications closed

Related Jobs

View all jobs

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

You have a passion for building scalable, reliable data systems that enable data scientists, ML engineers, and analysts to do their best work. You understand that great data products require more than just moving data; they need robust pipelines, data quality assurance, and thoughtful architecture. Not only do you put reliability and scalability at the heart of everything you do, but you are adept at enabling data-driven decisions through proper data modeling and pipeline design. You will be comfortable working cross-functionally with Product, Engineering, Data Science, Analytics, and MLOps teams to develop our products and improve the end-user experience. You should have a strong track record of successful prioritization, meeting critical deadlines, and enthusiastically tackling challenges with an eye toward problem solving.


Key Responsibilities

  • Data Pipeline Development & Management
  • Design, build, and maintain robust ETL/ELT pipelines that support analytics, ML models, and business intelligence
  • Develop scalable batch and streaming data pipelines to process millions of auction events, user interactions, and transactions daily
  • Implement workflow orchestration using Airflow, Dagster, or similar tools to manage complex data dependencies
  • Build data validation and quality monitoring frameworks to ensure data accuracy and reliability
  • ML & Analytics Infrastructure
  • Build feature engineering pipelines to support ML models for search, recommendations, and personalization
  • Integrate with feature stores to enable consistent feature computation across training and inference
  • Create datasets for model training, validation, and testing with proper versioning
  • Data Quality & Monitoring
  • Implement comprehensive data quality checks, anomaly detection, and alerting systems
  • Monitor pipeline health, data freshness, and SLA compliance
  • Create dashboards and reporting tools for data pipeline observability
  • Debug and resolve data quality issues and pipeline failures
  • Collaboration & Best Practices
  • Work closely with Data Scientists and ML Engineers to understand data requirements and deliver reliable datasets
  • Partner with Software Engineers to integrate data pipelines with application systems
  • Establish and document data engineering best practices, coding standards, and design patterns
  • Mentor junior engineers on data engineering principles and best practices

Key Requirements

  • Required Qualifications: BSc or MSc in Computer Science, Data Engineering, Software Engineering, or a related field, or equivalent practical experience
  • 5+ years of experience building and maintaining data pipelines and infrastructure in production environments
  • Strong programming skills in Python, with experience in data processing libraries (Pandas, PySpark)
  • Expert-level SQL skills with experience in query optimization and performance tuning
  • Proven experience with workflow orchestration tools (Airflow, Dagster, Prefect, or similar)
  • Hands‑on experience with cloud platforms (AWS preferred) including S3, Redshift, EMR, Glue, Lambda
  • Experience with data warehousing solutions (Redshift, Snowflake, BigQuery, or similar)
  • Experience with version control systems (Git) and CI/CD practices for data pipelines

Technical Skills

  • Experience with distributed computing frameworks (Apache Spark, Dask, or similar)
  • Knowledge of both batch and streaming data processing (Kafka, Kinesis, or similar)
  • Familiarity with data formats (Parquet, ORC, Avro, JSON) and their trade-offs
  • Understanding of data quality frameworks and testing strategies
  • Previous work with vector databases (Pinecone, Milvus, etc)
  • Experience with monitoring and observability tools (Prometheus, Grafana, CloudWatch)
  • Knowledge of infrastructure-as-code tools (Terraform, CloudFormation)
  • Understanding of containerization (Docker) and orchestration (Kubernetes) is a plus

Nice-to-Have

  • Familiarity with dbt (data build tool) for data transformation workflows
  • Knowledge of Elasticsearch or similar search technologies
  • Experience in eCommerce, marketplace, or auction platforms
  • Understanding of GDPR, data privacy, and compliance requirements
  • Experience with real-time analytics and event-driven architectures (Flink, Materialize)


#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Write a Data Engineering Job Ad That Attracts the Right People

Data engineering is the backbone of modern data-driven organisations. From analytics and machine learning to business intelligence and real-time platforms, data engineers build the pipelines, platforms and infrastructure that make data usable at scale. Yet many employers struggle to attract the right data engineering candidates. Job adverts often generate high application volumes, but few applicants have the practical skills needed to build and maintain production-grade data systems. At the same time, experienced data engineers skip over adverts that feel vague, unrealistic or misaligned with real-world data engineering work. In most cases, the issue is not a shortage of talent — it is the quality and clarity of the job advert. Data engineers are pragmatic, technically rigorous and highly selective. A poorly written job ad signals immature data practices and unclear expectations. A well-written one signals strong engineering culture and serious intent. This guide explains how to write a data engineering job ad that attracts the right people, improves applicant quality and positions your organisation as a credible data employer.

Maths for Data Engineering Jobs: The Only Topics You Actually Need (& How to Learn Them)

If you are applying for data engineering jobs in the UK, maths can feel like a vague requirement hiding behind phrases like “strong analytical skills”, “performance mindset” or “ability to reason about systems”. Most of the time, hiring managers are not looking for advanced theory. They want confidence with the handful of maths topics that show up in real pipelines: Rates, units & estimation (throughput, cost, latency, storage growth) Statistics for data quality & observability (distributions, percentiles, outliers, variance) Probability for streaming, sampling & approximate results (sketches like HyperLogLog++ & the logic behind false positives) Discrete maths for DAGs, partitioning & systems thinking (graphs, complexity, hashing) Optimisation intuition for SQL plans & Spark performance (joins, shuffles, partition strategy, “what is the bottleneck”) This article is written for UK job seekers targeting roles like Data Engineer, Analytics Engineer, Platform Data Engineer, Data Warehouse Engineer, Streaming Data Engineer or DataOps Engineer.

Neurodiversity in Data Engineering Careers: Turning Different Thinking into a Superpower

Every modern organisation runs on data – but without good data engineering, even the best dashboards & machine learning models are built on sand. Data engineers design the pipelines, platforms & tools that make data accurate, accessible & reliable. Those pipelines need people who can think in systems, spot patterns in messy logs, notice what others overlook & design elegant solutions to complex problems. That is exactly why data engineering can be such a strong fit for many neurodivergent people, including those with ADHD, autism & dyslexia. If you’re neurodivergent & considering a data engineering career, you might have heard comments like “you’re too disorganised for engineering”, “too literal for stakeholder work” or “too distracted for complex systems”. In reality, the traits that can make traditional office environments hard often line up beautifully with data engineering work. This guide is written for data engineering job seekers in the UK. We’ll cover: What neurodiversity means in a data engineering context How ADHD, autism & dyslexia strengths map to common data engineering tasks Practical workplace adjustments you can request under UK law How to talk about your neurodivergence in applications & interviews By the end, you’ll have a clearer sense of where you might thrive in data engineering – & how to turn “different thinking” into a genuine professional superpower.