Data Engineer

ConnexAI
Manchester
4 months ago
Applications closed

Related Jobs

View all jobs

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Data Engineer

Build the Future of Conversational AI with ConnexAI

As a Speech Data Engineer, your work will power the data behind real-time speech systems used by millions worldwide, ensuring our AI learns from clean, accurate, and reliable datasets. By curating, analysing, and engineering the voice data that fuels our models, you’ll help shape products that transform how people and businesses communicate.

You’ll be part of the team that manages and scales our massive speech corpora, builds automated pipelines for cleaning and validation, and works closely with annotation and Machine Learning teams to keep our models at the cutting edge.

Core Responsibilities
  • Organise and maintain large-scale audio and text corpora, ensuring they are versioned correctly, catalogued, and easy to retrieve.
  • Build automated pipelines using Python, AWS, and Docker to clean, validate, and standardise speech data, detecting duplicates, transcription inconsistencies, or quality issues.
  • Develop and integrate APIs to streamline ingestion and processing of new datasets.
  • Analyse speech datasets to support ASR and TTS model development, performance evaluation, and linguistic research.
  • Implement and manage data version control tools to ensure dataset reproducibility and traceability.
  • Contribute to evaluation frameworks for ASR and TTS performance by analysing metrics such as Word Error Rate (WER), Speaker Similarity (SSim), and Mean Opinion Score (MOS) to generate data-driven insights.
  • Document data processes and tools, ensuring all datasets and analyses are well-documented, reproducible, and compliant with internal standards.
  • Collaborate closely with data scientists, ML engineers, and product teams to identify opportunities to improve data quality, balance, and diversity through targeted analysis and feedback loops.
Key Skills & Experience
  • Strong programming skills in Python for data processing, analysis, and automation.
  • Proficiency with SQL for developing and managing large-scale datasets.
  • Experience with AWS cloud services.
  • Hands‑on experience with Docker and containerised development environments.
  • Familiarity with data versioning tools (e.g., LakeFS, DVC) and dataset reproducibility principles.
  • Strong collaboration and communication skills.
  • Background in speech, audio, or NLP data processing is highly desirable.
Interview Process
  • 30‑minute video call with the team lead
  • Take‑home technical exercise
  • 90‑minute face‑to‑face interview
About ConnexAI

ConnexAI is an award‑winning Conversational AI platform designed by an elite engineering team. ConnexAI’s technology enables organisations to maximise profitability, increase revenue, and take productivity to new levels. ConnexAI provides cutting‑edge, enterprise‑grade AI applications, including AI Agent, AI Guru, AI Analytics, ASR, AI Voice, and AI Quality. We value growth both for our products and our people. As we scale, there will be clear opportunities to progress into senior data science, leadership, or principal research roles. Our high retention rate reflects our inclusive, supportive, and empowering environment.

Seniority level
  • Associate
Employment type
  • Full‑time
Job function
  • Information Technology and Research
Industries
  • Software Development, Data Infrastructure and Analytics, and Research Services


#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How Many Data Engineering Tools Do You Need to Know to Get a Data Engineering Job?

If you’re aiming for a career in data engineering, it can feel like you’re staring at a never-ending list of tools and technologies — SQL, Python, Spark, Kafka, Airflow, dbt, Snowflake, Redshift, Terraform, Kubernetes, and the list goes on. Scroll job boards and LinkedIn, and it’s easy to conclude that unless you have experience with every modern tool in the data stack, you won’t even get a callback. Here’s the honest truth most data engineering hiring managers will quietly agree with: 👉 They don’t hire you because you know every tool — they hire you because you can solve real data problems with the tools you know. Tools matter. But only in service of outcomes. Jobs are won by candidates who know why a technology is used, when to use it, and how to explain their decisions. So how many data engineering tools do you actually need to know to get a job? For most job seekers, the answer is far fewer than you think — but you do need them in the right combination and order. This article breaks down what employers really expect, which tools are core, which are role-specific, and how to focus your learning so you look capable and employable rather than overwhelmed.

What Hiring Managers Look for First in Data Engineering Job Applications (UK Guide)

If you’re applying for data engineering jobs in the UK, the first thing to understand is this: Hiring managers don’t read every word of your CV. They scan it. They look for signals of relevance, credibility, delivery and collaboration — and if they don’t see the right signals quickly, your application may never get a second look. In data engineering, hiring managers are especially focused on whether you can build and operate reliable, scalable data systems, handle real-world data challenges and work effectively with analytics, BI, data science and engineering teams. This guide breaks down exactly what they look at first in your application — and how to shape your CV, portfolio and cover letter so you stand out.

The Skills Gap in Data Engineering Jobs: What Universities Aren’t Teaching

Data engineering has quietly become one of the most critical roles in the modern technology stack. While data science and AI often receive the spotlight, data engineers are the professionals who design, build and maintain the systems that make data usable at scale. Across the UK, demand for data engineers continues to rise. Organisations in finance, retail, healthcare, government, media and technology all report difficulty hiring candidates with the right skills. Salaries remain strong, and experienced professionals are in short supply. Yet despite this demand, many graduates with degrees in computer science, data science or related disciplines struggle to secure data engineering roles. The reason is not academic ability. It is a persistent skills gap between university education and real-world data engineering work. This article explores that gap in depth: what universities teach well, what they consistently miss, why the gap exists, what employers actually want, and how jobseekers can bridge the divide to build successful careers in data engineering.