Multimodal Input Support (Images, PDFs, CSVs) in Vanna #1060

dogra1598 · 2025-12-19T06:27:23Z

dogra1598
Dec 19, 2025

Does vanna plan to support multimodal inputs such as images, PDFs, or CSV files as part of the query context, instead of only accepting plain text input?
We are evaluating a use case where users ask questions based on a screenshot or uploaded document, and want to understand whether multimodal support is on the roadmap.
If not, can you please recommend a specific architectural pattern to integrate this capability cleanly with vanna?
Any guidance on the recommended approach would be helpful.
Also our team will be happy to develop and contribute if anyone from vanna can guide.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multimodal Input Support (Images, PDFs, CSVs) in Vanna #1060

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Multimodal Input Support (Images, PDFs, CSVs) in Vanna #1060

Uh oh!

dogra1598 Dec 19, 2025

Replies: 0 comments

dogra1598
Dec 19, 2025