- Omni, O, has multimodal capabitlies, which means it can take text, voice or video as an input and serve audio/text/image output (there's no video output).
- It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation.
- It is faster and cheaper.
- Has multilingual capabilities.
Use Cases
- Comics / visual narration / : Sally the mailwoman
- You can create an image, upload that image with a name given to character (as an attachment), and then go on to storyboard the work.
No comments:
Post a Comment