Meet the Omniagent API
The foundational layer powering multimodal coworkers across every surface your customer touches. Create an Omniagent โ an AI agent with a persistent identity, voice, and visual appearance that carries its knowledge, memory, and actions across website, apps, phone, and beyond. One API, one agent, every channel.
Add an AI agent to any web app in a day.
Edge MCP runs inside your web app and gives your agent the eyes and hands it needs to operate the site on the user's behalf โ along with the context of what the user is doing.
A second system, built and maintained by hand.
- Copied product knowledge to keep up to date
- Hand-wrapped tools for every action
- Manual sync that breaks on every release
- Weeks of engineering to build and maintain
One bridge, always in sync with the real app.
- The agent knows the user's context โ which page, what they're doing
- The agent takes real actions โ search products, book an appointment, answer questions
- One developer, one day
Edge MCP is in private preview. Reach out to get access.
Request accessOr build it yourself
Follow the path. Each step links to the in-depth resource you need.
Get an API key
Create a resource in the Azure Portal to set up your subscription, then generate an API key from the dashboard. Once you have a key, see Authentication for details on how to authenticate API calls.
Build
Pick a companion, add knowledge and tools, configure voice and settings โ then create your Omniagent.
Deploy
Choose a channel โ WebRTC, WebSocket, VoIP, or SIP โ and go live. Same agent, every surface.
Monitor
Track sessions, review transcripts, and refine your agent over time.
Explore by topic
Identity
Define your agent's role, persona, voice, and visual appearance. Consistent across every session and channel.
Knowledge
Give your agent domain expertise with files, FAQs, and structured data.
Memory
Your agent remembers who it's talking to โ across sessions and channels.
Actions
Connect your agent to real systems through tools and functions.
WebRTC
Browser-based voice and video via the Web SDK.
WebSockets
Audio-only streaming for apps and custom integrations.
VoIP
Put your agent on any phone number in minutes. No trunk credentials โ just a webhook URL for Twilio, Telnyx, Vonage, or any VoIP provider.
SIP
Answer phone calls with your own SIP trunk. Human handoff, custom routing, and enterprise telephony integration.
Capabilities
| Capability | Details |
|---|---|
| Real-time video avatars | 2D avatars with lip sync and natural expressions |
| Persistent identity and memory | Agents maintain a consistent persona and remember users across sessions and channels |
| Tool integration | Connect agents to real systems through custom functions to trigger workflows, generate content, and more |
| Multiple providers | Azure OpenAI (default) or HUMAIN's Allam for Arabic-language agents. Bring your own deployment or use a Napster-managed one |
| Web SDK | Works with React, Vue, Angular, or vanilla JS; mount a companion in any DOM container |
| Low latency | ~300ms response times |
| Multilingual | ~30 languages supported |
| 2,000+ companions | Pre-built expert agents, or wire your own knowledge base and tools |
| Built-in safety | Content moderation, crisis detection, topic restrictions, and auditability are always active |
Compliance and security
- SOC 2 Type II โ report available under NDA
- GDPR & CCPA โ US and EU data protection
- Encryption โ AES + TLS 1.2+ at rest and in transit
- Pen testing โ annual third-party security audits
- Data residency โ regional options for enterprise
- Tenant isolation โ your data stays in your environment; no cross-customer data mixing, no training on your data