Case study

CES achieves 85% cost reduction in speech-to-text with managed AI on AWS

Industry: Deep Tech & Manufacturing

About

CES is a Welsh company processing high volumes of customer service calls in both English and Welsh. As call volumes grew, the cost of transcription using AWS Transcribe became unsustainable. Firemind helped CES redesign its speech-to-text architecture using managed AI services on AWS, delivering the same accuracy and functionality at a fraction of the cost.

Challenge

CES relied on AWS Transcribe to convert 1,000–2,000 daily calls into text. While accurate, the service became increasingly expensive as call volumes scaled.

Monthly transcription costs ranged between $2,000 and $3,000, driven by AWS Transcribe’s per-minute pricing model.

CES needed to:

  • Dramatically reduce transcription costs

  • Maintain transcription accuracy and reliability

  • Continue supporting both English and Welsh languages

  • Preserve advanced capabilities such as speaker separation and PII reduction

  • Avoid disrupting existing batch-processing workflows

Firemind was engaged to design and deliver a cost-efficient, production-ready managed AI solution on AWS.

Solution

Firemind replaced AWS Transcribe with a custom, scalable speech-to-text architecture built on AWS managed services:

  • Deployed an open-source Whisper model (Turbo) on Amazon SageMaker after evaluating alternatives and selecting Whisper for superior Welsh language support

  • Implemented SageMaker async endpoints with auto-scaling, allowing the platform to scale from zero to active instances based on demand

  • Optimised infrastructure using ML.G4.XLarge GPU instances with NVIDIA L4 GPUs at approximately $2 per hour

  • Rebuilt Transcribe-native capabilities using LLMs, including:

    • Speaker separation

    • PII detection and reduction

  • Integrated serverless components, using AWS Lambda for payload processing and S3/SQS for orchestration

  • Maintained existing workflows, preserving the customer’s four daily batch runs and S3-based ingestion

  • Delivered full CI/CD automation using AWS CodeBuild for repeatable, production-grade deployments

Results

The new managed AI platform delivered substantial and measurable business impact:

  • Monthly costs reduced from $2,000–$3,000 to $300–$600
  • Overall cost savings of 80–85%, equivalent to $1,400–$2,400 per month
  • Maintained transcription accuracy and feature parity with AWS Transcribe
  • Continued full support for English and Welsh languages
  • Scalable, future-ready architecture with potential for an additional 40% cost reduction through further instance optimisation

CES achieved significant cost optimisation without compromising performance, compliance, or language coverage demonstrating how managed AI on AWS can outperform fully managed services at scale.

Outcome

See more case studies

View all case studies

CONTACT US

Start with a focused conversation about your environment.

We help you build, optimise and run AI that delivers measurable results.

Your benefits:

  • Outcome-driven - Measurable business impact
  • Expert-led - Hands-on delivery from senior practitioners
  • Secure by design - Your data and compliance requirements first
  • Fast to value - From discovery to production in weeks

What happens next?

Let's talk

A 20-minute focused session on your goals and current situation.

We propose

A clear plan and scope tailored to your priorities.

You decide

No obligation - move forward when the time is right.

No obligation - just a focused 20-minute discussion about your goals.