From Manual Metadata to Intelligent Video Search
Turning 2.5 million videos into searchable, time-addressable assets with a cost-efficient hybrid AI pipeline.
A video discovery platform with millions of high-value assets
Kurator is a video licensing and discovery platform within Nimia, serving major media and entertainment buyers with high-value archival and broadcast footage — including news, interviews, and historical content.
The platform's core value is helping customers find the right moment inside long video assets, then enabling easy purchase with confidence in rights management. But at scale, that promise depended entirely on metadata quality.
Search and tagging had become a serious bottleneck
With millions of videos, search and tagging had become a serious bottleneck. Several cloud-first and vendor-based approaches were explored but rejected due to cost, accuracy, and data transfer trade-offs that made them impractical at Kurator's scale.
Manual tagging didn't scale
Teams spent hours per batch entering transcripts, metadata, keywords, and compliance flags — often with inconsistent results across the catalog.
Manual tagging didn't scale
Teams spent hours per batch entering transcripts, metadata, keywords, and compliance flags — often with inconsistent results across the catalog.
Manual tagging didn't scale
Teams spent hours per batch entering transcripts, metadata, keywords, and compliance flags — often with inconsistent results across the catalog.
Manual tagging didn't scale
Teams spent hours per batch entering transcripts, metadata, keywords, and compliance flags — often with inconsistent results across the catalog.
A hybrid AI video intelligence pipeline built for scale
AccelOne designed and built a multi-model hybrid execution architecture that balances performance with economics — running heavy inference on-premises while using cloud services selectively and only when necessary.
Real outcomes, measurable impact
Manual tagging is nearly eliminated, results are reliable enough for day-to-day production use, and the cost is dramatically lower than any traditional cloud-based approach.
Compared to cloud-only or vendor pipelines. Throughput scales by adding low-cost GPU machines at $2.5K–$3K each — shifting video intelligence from a capital project into a repeatable operational capability.
Consistently exceeds 95% in spot-checked samples, approaching human-level performance under good audio conditions. Powers keyword search, time-based navigation, and downstream metadata extraction.
Manual metadata entry reduced from hours to minutes per batch. Teams perform a quick spot-check and add only information requiring human judgment, freeing them to focus on quality.
In specific high-volume processing paths, including the 50× reduction in external API calls to AWS Rekognition via mosaic batching optimization.
Let’s Build What Works
Looking for a relevant example or similar engagement?
We’re happy to walk through comparable work in a short conversation.