Vision API - pros and cons

Rating: 8 out of 10

May 1, 2026

Use Cases and Deployment Scope

we used Vision API to extract attribute values from medical devices images for internal purpose only. We have TBs of data to process in such and the response time was in within few seconds only. but for such a scale the processing time is coming out to be in months. then we had to introduce multiprocessing for speeding up.

Pros

Object detection is good
Get facial expressions across different faces from different part of the words
Used OCR for different languages in india

Cons

response Time is an issue
Google API is not available across different platforms as an integrated feature
Not able to add custom labels easily, the process is long for it

Likelihood to Recommend

Mostly Google Cloud Vision API is getting used for extracting values out of pdfs which are scanned images. We also tried using it in analyzing different images and their marketing campaign results based on different features it contains like color, objects, positioning, logos etc. Vision API could be used for in conjunction with LLM to provide a more detailed prompt for image generation.

Verified User

Consultant in Engineering (10,001+ employees)

Vetted Review

3 years of experience

Google Cloud Vision API Reviews & Insights