Extract Image Text with Cloud Vision AI
Made by Google Cloud
Extracts text from images and saves to Firestore using Cloud Vision API.
- Works with
- Cloud Storage and Cloud Firestore
- Version
- 0.1.5 | Source code
- Tags
- ai, image, text-extraction, cloud-vision, image-recognition, image-classification, text-recognition, character-recognition, ocr, google-ai, cloud-vision-api
- License
- Apache-2.0
- Publisher
- Google Cloud
- Report
- Bug
- Abuse
How this extension works
This extension extracts text from jpg or png images uploaded to Cloud Storage and writes the extracted text to Firestore, using the Cloud Vision API.
On install, you will be asked to provide a Cloud Storage bucket where files will be uploaded, and a Firestore collection to write extracted text back to.
When a JPG or PNG file is uploaded to the designated bucket, the extension automatically triggers a Cloud Function. This function then employs the Google Cloud Vision API to process the image, effectively extracting any discernible text. Following this extraction process, the resulting text is saved in a specific Firestore collection. In this collection, each document will have a field named ‘file’ whose value matches the full storage path of the uploaded image.
Use Cases
- Optical character recognition (OCR) for scanned documents: Extract and store text from scanned documents to make them searchable and accessible in your application.
- Reading text from images in a social media app: Automatically extract and analyze text from user images to identify trends or inappropriate content.
- Extracting menu items from restaurant images: Create a food ordering app that allows users to upload images of menus and automatically extracts and displays the menu items in a structured format.
Including Cloud Storage paths
The extension provides a include path list
parameter. Setting this parameter will restrict the extension to only extract text from images in specific locations in your Storage bucket.
If this parameter is not set, then images uploaded to all paths from the given bucket will trigger text extraction.
For example, specifying the paths /users/pictures,/restaurants/menuItems
will extract text from any images found in any subdirectories of /users/pictures
and /restaurants/menuItems
. You may also use wildcard notation for directories in the path.
Excluding Cloud Storage paths
Alternatively, the extension also provides a exclude path list
parameter. This parameter is a list of absolute paths not included for extract text from images.
Setting is will ensure the extension does not extract text from images in the specific locations.
For example, to exclude the images stored in the /foo/alpha
and its subdirectories and /bar/beta
and its subdirectories, specify the paths /foo/alpha,/bar/beta
.
Detail
The extension also allows you to control whether you would like to store just extracted text, or all returned metadata from Cloud Vision AI. If set to “basic” the extension will simply write the extracted text as a string field. If set to “full” then the full information returned from Cloud Vision AI will be written to the Firestore document.
Additional Setup
Ensure you have a Cloud Firestore database and Cloud Storage bucket set up in your Firebase project.
Billing
This extension uses other Firebase or Google Cloud Platform services which may have associated charges:
- Cloud Vision AI
- Cloud Storage
- Cloud Firestore
- Cloud Functions (See FAQs)
When you use Firebase Extensions, you’re only charged for the underlying resources that you use. A paid-tier (Blaze) billing plan is required because the extension uses Cloud Vision API.