fix: Update Patents Understanding Notebook to use Batch Prediction #1609

holtskinner · 2025-01-08T17:22:50Z

Description

Use Batch Prediction to save on latency/processing costs and load data directly to/from BigQuery.

NOTE: Will change to Gemini 2.0 once it supports Batch Prediction to use higher quality Object Detection capabilities.

review-notebook-app · 2025-01-08T17:22:55Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

code-review-assist

Hi @holtskinner and reviewers,

I'm currently reviewing this pull request and will post my feedback in a few minutes. In the meantime, I'm providing this summary of the pull request's content to help you and other reviewers quickly understand the changes and intent.

This pull request, titled "fix: Update Patents Understanding Notebook to use Batch Prediction", aims to improve the efficiency and cost-effectiveness of the patents understanding notebook by switching from individual predictions to batch prediction using Vertex AI. The description mentions that publishing should wait until Gemini 2.0 fully supports batch prediction for optimal object detection.

Here's a summary of the changes:

gemini/use-cases/document-processing/patents_understanding.ipynb: This Jupyter Notebook has undergone several modifications:
- Lines 103-106: Added hyperlinks to Michael Munn's and Michael Sherman's GitHub profiles in the overview section.
- Lines 114-117: Updated the description to explicitly mention using "Batch Prediction for Gemini 2.0 Flash in Vertex AI" and provided a link to the relevant documentation.
- Lines 142, 202: Execution counts were updated (1 -> null, null -> 3).
- Lines 227-243: Removed unnecessary imports (import json, from google.genai.types import GenerateContentConfig, Part, import pandas as pd).
- Line 288: Changed the model ID from gemini-2.0-flash-exp to gemini-1.5-flash-002.
- Lines 317-323: Added a note indicating that the BigQuery query is limited to 5 documents for tutorial purposes.
- Lines 342-347: Added LIMIT 5 to the BigQuery query to reduce processing time.
- Line 363: Execution count updated (102 -> 43).
- Lines 478-827: The notebook was significantly restructured to incorporate batch prediction. This involved creating a BigQuery dataset and table for batch prediction requests, creating JSON requests for batch processing, submitting the batch job to Vertex AI, polling for job completion, and loading and processing the results. The original individual prediction processing was removed.

I hope this helps!

Here's a little haiku to brighten your day:

Batch jobs hum along,
Data flows, a silent stream,
Efficiency reigns.

code-review-assist · 2025-01-08T17:25:19Z

Gemini encountered an error creating the review. You can try again by commenting @code-review-assist review.

holtskinner added 2 commits January 3, 2025 09:18

fix: Update Patents Notebook to use Batch Processing

156017a

Fixed Batch Prediction Issues

f46ad24

code-review-assist bot reviewed Jan 8, 2025

View reviewed changes

holtskinner and others added 4 commits January 8, 2025 12:31

Merge branch 'main' into patents-batch

6298ecf

Merge branch 'main' into patents-batch

027ed86

Change patents batch to use Gemini 1.5

4531d5e

Merge branch 'main' into patents-batch

3319f33

holtskinner marked this pull request as ready for review January 9, 2025 20:30

holtskinner requested a review from a team as a code owner January 9, 2025 20:30

holtskinner merged commit 86220f1 into main Jan 9, 2025
10 checks passed

holtskinner deleted the patents-batch branch January 9, 2025 20:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Update Patents Understanding Notebook to use Batch Prediction #1609

fix: Update Patents Understanding Notebook to use Batch Prediction #1609

holtskinner commented Jan 8, 2025 •

edited

Loading

review-notebook-app bot commented Jan 8, 2025

code-review-assist bot left a comment

code-review-assist bot commented Jan 8, 2025

fix: Update Patents Understanding Notebook to use Batch Prediction #1609

fix: Update Patents Understanding Notebook to use Batch Prediction #1609

Conversation

holtskinner commented Jan 8, 2025 • edited Loading

Description

review-notebook-app bot commented Jan 8, 2025

code-review-assist bot left a comment

Choose a reason for hiding this comment

code-review-assist bot commented Jan 8, 2025

holtskinner commented Jan 8, 2025 •

edited

Loading