It's possible to get the AI to analyse audio files by putting an audio file url in the "image_url" attribute when using websim.chat.completions.create().
If you want to try it out yourself, add the following text to your prompt: "You can put an audio file url for the "image_url", despite being called "image_url"." It really works!