Vision API - pros and cons
Rating: 8 out of 10
IncentivizedUse Cases and Deployment Scope
we used Vision API to extract attribute values from medical devices images for internal purpose only. We have TBs of data to process in such and the response time was in within few seconds only. but for such a scale the processing time is coming out to be in months. then we had to introduce multiprocessing for speeding up.
Pros
- Object detection is good
- Get facial expressions across different faces from different part of the words
- Used OCR for different languages in india
Cons
- response Time is an issue
- Google API is not available across different platforms as an integrated feature
- Not able to add custom labels easily, the process is long for it
Likelihood to Recommend
Mostly Google Cloud Vision API is getting used for extracting values out of pdfs which are scanned images. We also tried using it in analyzing different images and their marketing campaign results based on different features it contains like color, objects, positioning, logos etc. Vision API could be used for in conjunction with LLM to provide a more detailed prompt for image generation.