Razer AIKit Now Sees, Hears, and Generates Images: What It Means for Project AVA
The SDK powering Project AVA just received its biggest update yet. Razer AIKit goes omni-modal — text, image, video, and audio — and adds support for ARM64 architectures. What used to be just chat can now see, listen, and create.

Razer AIKit is the free, open-source AI development toolkit that developers use to build Project AVA's avatars, behaviors, and capabilities. Until now, it worked primarily with text. With this update, the SDK takes a significant leap: it natively supports image, video, and audio models, and can run on a much wider range of hardware.
What Does "Omni-Modal" Actually Mean?
Until recently, AI models tended to specialize: text models processed text, image models processed images. An omni-modal system unifies those capabilities into a single consistent workflow. For Project AVA, this matters because the device already includes visual and audio sensing — the cameras and microphones that let AVA "see" your screen and "hear" you.
With omni-modal AIKit, ecosystem developers can now build avatars and behaviors that respond coherently to what AVA sees, hears, and reads — without having to manually wire together multiple separate models. Everything flows through one consistent toolkit.
Image Generation
AIKit can generate and manipulate images directly, without relying on external APIs. AVA Mini proved it: over 11,000 avatars generated in 4 days.
Audio Models
Native support for voice synthesis and recognition models. AVA can listen and respond with its own voice without additional integration layers.
Video Models
Ability to process and generate video sequences. Relevant for hologram animations and real-time visual responses.
ARM64 & Broader Hardware
AIKit now runs on ARM64 architectures, including NVIDIA DGX Spark platforms. Greater flexibility for both local development and cloud deployment.
The Real-World Proof: AVA Mini by the Numbers
Razer didn't present this update in the abstract. They already stress-tested it in production during the AVA Mini campaign (April 1–4, 2026), where the toolkit generated personalized pet avatars from user-uploaded photos.
11,000+
images generated
3.24s
average latency
15×
lower cost vs. standard cloud
The deployment ran on Akash Network's distributed GPU grid, bringing inference costs down to $0.01 per image — up to 15 times cheaper than standard cloud APIs. Razer was explicit: the campaign was also a real stress test of their AI infrastructure.
What This Means for the End User of Project AVA
As a device owner, you don't need to understand the SDK internals. What matters is what omni-modal AIKit makes possible in practice:
- More expressive avatars. The hologram can react to what it hears and sees in real time with greater coherence, because all models speak the same language.
- A larger developer ecosystem. By supporting ARM64 and reducing inference costs, more independent developers can build extensions and avatars for AVA.
- Today's experiments, tomorrow's features. AVA Mini was a prototype. Capabilities like generating personalized visual content in seconds have a clear path to the final product.

AIKit Is Free and Open Source
Razer maintains AIKit as an open-source project on GitHub, with active support from their engineering team. Any developer can download it, run it on their own hardware — no cloud subscription required — and contribute to the AVA ecosystem.
This open-source commitment is deliberate: Razer wants the ecosystem of avatars and behaviors for AVA to grow organically, much like what happened with mod-friendly gaming platforms. The technical foundation for that to happen just became considerably more powerful.
When Does Project AVA Launch?
The official Project AVA launch is scheduled for the second half of 2026. Check dates and available regions.