Azure Cognitive Services provide pre-trained AI capabilities accessible via REST API. The building blocks model enables application developers to integrate vision, speech, language, and decision AI without machine learning expertise.
The services landscape
Azure Cognitive Services covers five domains: Vision (Computer Vision, Face API, Form Recogniser), Speech (Speech-to-Text, Text-to-Speech, Translation), Language (Text Analytics, LUIS, QnA Maker), Decision (Anomaly Detector, Personaliser, Content Moderator), and Applied AI Services (Azure Applied AI Services for document and form processing). The API surface has grown significantly through 2020-2021 with new capabilities in each domain.
Form Recogniser for document processing
Azure Form Recogniser provides AI-powered document understanding: extracting structured data from invoices, receipts, ID documents, tax forms, and custom document layouts. For back-office automation use cases, processing incoming invoices, extracting data from scanned forms, automating document classification, Form Recogniser provides production-ready capability without training a model. The custom model capability allows fine-tuning on domain-specific documents.
The Azure OpenAI Service preview
Azure OpenAI Service entered private preview in 2021, providing enterprise access to GPT-3 models through the Azure API with Azure's compliance, security, and data residency guarantees. The private preview was intentionally limited while Microsoft assessed the responsible AI implications. For enterprises that need GPT-3 capability but cannot use the public OpenAI API due to data governance requirements, Azure OpenAI Service was the path.
Responsible AI and model limitations
Microsoft's Responsible AI Standard shapes how Cognitive Services capabilities are made available. Face recognition capabilities have usage restrictions. Content moderation requires human review integration. The principle is that AI capabilities with meaningful harm potential require usage documentation and accountability mechanisms. The engineering implication: when integrating Cognitive Services for high-stakes decisions (identity verification, content moderation), build human review workflows alongside the AI.