Connect Google Cloud Vision to your AI agent

AI Tools 29 actions available

Google Cloud Vision API enables developers to integrate vision detection features into applications, including image labeling, face and landmark detection, optical character recognition (OCR), and explicit content tagging.

We set up the connection using your own Google Cloud Vision account, with keys you control, and keep it running. Your agent picks it up and starts doing the work.

What your agent can do in Google Cloud Vision

Each one is a real action the agent can take on its own, the same things a person clicking around Google Cloud Vision could do. Read-only by default; write actions are confirmed against your policy.

  • Annotate Files with Vision API Tool to perform image detection and annotation for batch files in Google Cloud Vision. Supports PDF, TIFF, and GIF files. Extracts up to 5 frames (GIF) or pages (PDF/TIFF) from each file and performs detection for each…
  • Async Batch Annotate Files Tool to run asynchronous image detection and annotation for a list of generic files (PDF, TIFF, GIF). Use when processing multi-page documents that may contain multiple images per page. Results are written to Google Clo…
  • Annotate Images Run image detection and annotation for a batch of images using Google Cloud Vision API. Performs various types of image analysis including face detection, landmark detection, logo detection, label detection, text detect…
  • Annotate Images Async Batch Tool to run asynchronous image detection and annotation for a batch of images. Use when processing multiple images or large images that require longer processing time. Results are written to Google Cloud Storage as JSON…
  • Annotate Location Images Tool to run image detection and annotation for a batch of images scoped to a specific project and location. Performs various types of image analysis including label detection, face detection, landmark detection, logo de…
  • Create Vision Product Creates a new Product resource in Google Cloud Vision Product Search. A Product represents a physical item that can be visually searched using reference images. After creating a product, you can add reference images to…
  • Create Product Set Creates a new ProductSet resource in Google Cloud Vision Product Search. A ProductSet is a container for grouping related products together for visual search. After creating a product set, you can add products to it usi…
  • Create ReferenceImage Tool to create a ReferenceImage under a product. Use when adding a new image to a product for detection.
  • Delete Product Permanently deletes a Product and its associated reference images from Google Cloud Vision API. This is a destructive operation that cannot be undone. The product metadata and all images are deleted immediately, though…
  • Get Product Tool to get information associated with a Product. Use when you have the product resource name and need its details.
  • Get Product Set Tool to get a ProductSet. Use when you need metadata details of an existing ProductSet by its full resource name. Use after obtaining the resource name.
  • Import Product Sets Asynchronously imports product sets and reference images from a CSV file stored in Google Cloud Storage. This bulk import operation creates ProductSets, Products, and ReferenceImages from a properly formatted CSV file.…
  • List Vision AI IndexEndpoints Lists IndexEndpoints in Vertex AI Vision for a given project and location. IndexEndpoints are deployed instances of image indexes used for visual search and retrieval in Vision AI's media warehouse. Use this tool to dis…
  • List Locations Tool to list available Vision AI service locations for a project. Use when you need to discover supported regions before making region-specific API calls.
  • List Vision API Operations Tool to list operations that match the specified filter. Use when you need to retrieve all operations under a specific project and location.
  • Purge Products Tool to asynchronously delete products in a ProductSet or orphan products. Use when you need to clean up products at scale; ensure `force` is true to execute.
  • Update Product Tool to update a Product's mutable fields: displayName, description, and productLabels. Use after confirming the product resource name.
  • Update Product Set Tool to update a ProductSet resource. Use when you need to modify the displayName of an existing ProductSet.
  • Add Product to ProductSet Add a Product to a ProductSet in Google Cloud Vision Product Search. This action associates a Product with a ProductSet, enabling the product to be included in product search queries against that set. Both resources mus…
  • Cancel Vision Operation Starts asynchronous cancellation of a long-running Vision API operation. Returns an empty response on successful cancellation request. Note that the server makes a best effort to cancel the operation, but success is not…
  • Delete Vision API Operation Tool to delete a long-running Vision API operation. Use after confirming the operation name.
  • Delete Product Set Tool to permanently delete a ProductSet. Use after confirming the ProductSet's resource name.
  • Delete Reference Image Permanently removes a reference image from a product in Google Cloud Vision Product Search. This action deletes the reference image association from the specified product. The image will be marked for deletion and remov…
  • Get Vision API Operation Retrieves the latest state of a long-running Vision API operation. Use this to poll the status of asynchronous operations like importProductSets or purgeProducts. The operation name is returned when you start an async o…
  • Get Reference Image Tool to get information associated with a ReferenceImage. Use when you have the full resource name and need its metadata.
  • List Products in ProductSet Tool to list Products in a specified ProductSet. Use when you need to retrieve Products associated with a ProductSet after confirming it exists, with optional pagination.
  • List Projects List Google Cloud projects accessible to the authenticated user via Cloud Resource Manager API. This action queries the Cloud Resource Manager API (not Vision API directly) to enumerate projects. It requires OAuth 2.0 a…
  • List Reference Images Tool to list reference images for a product. Use when you need to retrieve stored reference images under a specified product resource name, with optional pagination.
  • Remove Product from ProductSet Removes a Product from a specified ProductSet in Google Cloud Vision API. This operation unlinks a product from a product set but does not delete either resource. Both the product and product set must exist in the same…

How we connect it

  1. 1

    Connect your account

    You create a key in Google Cloud Vision, a key you create and control, and paste it in once. It lives in a secrets store on your server, not with us.

  2. 2

    Set the guardrails

    Read-only by default. You choose which write actions the agent may take, and anything outside that policy gets confirmed with you first.

  3. 3

    We keep it running

    Health checks on every connection, updates handled for you, and we watch the first week of activity to make sure the work lands.

Google Cloud Vision questions, answered.

With a key you create and control. You paste it in once, it is stored in a secrets store on your server, permissions are scoped to the minimum the agent needs, and you can revoke it at any time.
The actions Google Cloud Vision's API allows, the same things a person clicking around the app could do. Connections start read-only by default; write actions are confirmed against the policy you set before the agent takes them.
Connections are priced per tool on top of the base plan. Some are included, some are premium. See pricing for how connection charges work.
Standard tools are ready inside 7 business days of the setup call. We test the connection end to end, walk you through how the agent uses it, and watch the first week of activity.

Ready to put Google Cloud Vision to work?

Tell us what your team runs on. We set up the connection, secure it, and your agent takes it from there.

All product names, logos, and brands are property of their respective owners; used for identification only. ZeroToClaw is not affiliated with or endorsed by Google Cloud Vision.