CES achieves 85% cost reduction in speech-to-text with AI operations on AWS

About

CES is a Welsh company processing high volumes of customer service calls in both English and Welsh. As call volumes grew, the cost of transcription using AWS Transcribe became unsustainable. Firemind helped CES redesign its speech-to-text architecture using AI operations on AWS, delivering the same accuracy and functionality at a fraction of the cost.

Challenge

CES relied on AWS Transcribe to convert 1,000–2,000 daily calls into text. While accurate, the service became increasingly expensive as call volumes scaled.

Monthly transcription costs ranged between $2,000 and $3,000, driven by AWS Transcribe’s per-minute pricing model.

CES needed to:

Dramatically reduce transcription costs
Maintain transcription accuracy and reliability
Continue supporting both English and Welsh languages
Preserve advanced capabilities such as speaker separation and PII reduction
Avoid disrupting existing batch-processing workflows

Firemind was engaged to design and deliver a cost-efficient, production-ready AI operations solution on AWS.

Solution

Firemind replaced AWS Transcribe with a custom, scalable speech-to-text architecture built on AWS managed services:

Deployed an open-source Whisper model (Turbo) on Amazon SageMaker after evaluating alternatives and selecting Whisper for superior Welsh language support
Implemented SageMaker async endpoints with auto-scaling, allowing the platform to scale from zero to active instances based on demand
Optimised infrastructure using ML.G4.XLarge GPU instances with NVIDIA L4 GPUs at approximately $2 per hour
Rebuilt Transcribe-native capabilities using LLMs, including:
- Speaker separation
- PII detection and reduction
Integrated serverless components, using AWS Lambda for payload processing and S3/SQS for orchestration
Maintained existing workflows, preserving the customer’s four daily batch runs and S3-based ingestion
Delivered full CI/CD automation using AWS CodeBuild for repeatable, production-grade deployments

Results

The new AI operations platform delivered substantial and measurable business impact:

Monthly costs reduced from $2,000–$3,000 to $300–$600
Overall cost savings of 80–85%, equivalent to $1,400–$2,400 per month
Maintained transcription accuracy and feature parity with AWS Transcribe
Continued full support for English and Welsh languages
Scalable, future-ready architecture with potential for an additional 40% cost reduction through further instance optimisation

CES achieved significant cost optimisation without compromising performance, compliance, or language coverage demonstrating how AI operations on AWS can outperform fully managed services at scale.

See more case studies

A Nordic enterprise: 28x less cloud operations effort, zero disruptions, in a three week pilot

The Firemind IT Operating Engine on a 300+ account AWS estate: 28x less effort across 22 test cases, zero disruptions, humans in control.

28x less engineering effort across 11 measured test cases
~170 hours of engineering time saved in the pilot
22 test cases run with zero disruptions across four non-production accounts

Learn more

22% off the cloud bill, proved on one account before scaling.

How autonomous cloud cost optimisation cut a Nordic firm's AWS bill by 22% in a single dev and QA account, every figure cross-verified against the live estate.

22% annual cloud cost reduction, cross-verified on a single AWS dev and QA account
Nearly half of the saving from a single idle database
Continuous FinOps discipline, not a one-off audit

Learn more

A decade of dormant AWS risk, triaged in two months.

A decade of dormant AWS security risk at a Nordic digital marketing firm, triaged in two months with the most urgent exposure closed under control.

Around 10,000 AWS security findings triaged in two months, with no added headcount
Publicly accessible S3 bucket remediated autonomously
Higher-risk changes held for human approval by design

Learn more

CES achieves 85% cost reduction in speech-to-text with AI operations on AWS

About

Challenge

Solution

Results

See more case studies

A Nordic enterprise: 28x less cloud operations effort, zero disruptions, in a three week pilot

22% off the cloud bill, proved on one account before scaling.

A decade of dormant AWS risk, triaged in two months.

Start with a focused conversation about your environment.

Your benefits:

What happens next?

Let's talk

We propose

You decide