Tuesday, May 14, 2024

Full capabilities of ChatGPT 4 O (O for Omni) - From Openai.com

  • Omni, O, has multimodal capabitlies, which means it can take text, voice or video as an input and serve audio/text/image output (there's no video output). 
  • It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time(opens in a new window) in a conversation. 
  • It is faster and cheaper.
  • Has multilingual capabilities.
Use Cases

  • Comics / visual narration / : Sally the mailwoman
    • You can create an image, upload that image with a name given to character (as an attachment), and then go on to storyboard the work. 







  • Movie poster creation




  • Character Design





  • Poetic typography


  • Coin design

  • Photo to caricature

  • Font design

  • 3D object synthesis

  • Brand placement


  • Meeting notes


  • Lecture summarizing




No comments:

Post a Comment

DSPM, Data Security Posture Management, Data Observability

DATA SECURITY POSTURE MANAGEMENT DSPM, or Data Security Posture Management, is a practice that involves assessing and managing the security ...