fix(genai): resolve single aggregated embedding bug for gemini-embedding models by saitejabandaru-in · Pull Request #1817 · langchain-ai/langchain-google

Sai Teja Bandaru (saitejabandaru-in) · 2026-05-29T18:02:43Z

Description

This Pull Request resolves a critical integration issue (#37728) where calling GoogleGenerativeAIEmbeddings.embed_documents (or aembed_documents) with gemini-embedding-2 always returns a list of exactly 1 vector, regardless of how many documents are passed.

Root Cause

Unlike traditional text embeddings (e.g., text-embedding-004), multimodal gemini-embedding models (such as gemini-embedding-2) treat list inputs in standard embed_content calls as parts of a single aggregated multimodal document (designed for cross-modal retrieval, combining text/images/video/etc.). Consequently, they return exactly one merged vector for the entire batch.

Solution

Parallelized Individual Embedding: Checks if the target model is a gemini-embedding model. If so, it embeds each document individually in parallel to prevent aggregation:
- Synchronous path uses a standard ThreadPoolExecutor for concurrent network requests.
- Asynchronous path uses asyncio.gather for non-blocking concurrent awaits.
Backward Compatibility: Retains standard sequential/prepared batching for non-Gemini models (like text-embedding-004) to maximize network efficiency for those models.
Comprehensive Unit Tests:
- Updates legacy tests to use text-embedding-004 to preserve test coverage of the traditional batching logic.
- Adds new test_embed_documents_gemini_embedding_2 and test_aembed_documents_gemini_embedding_2 unit tests targeting gemini-embedding-2-preview to verify correct multi-call dispatching and output reconstruction.

…ing models Unlike traditional text embeddings (e.g. text-embedding-004), multimodal gemini-embedding models (such as gemini-embedding-2) treat list inputs in embed_content as parts of a single aggregated multimodal document, returning exactly one vector regardless of how many strings are passed. This change checks if the target model is a gemini-embedding model, and if so, runs individual embeds in parallel using a ThreadPoolExecutor (sync path) and asyncio.gather (async path) to correctly return a distinct embedding for each document in the input list, aligning with the LangChain Embeddings interface spec.

Changes standard unit test MODEL_NAME to text-embedding-004 to maintain coverage for standard list batching. Adds dedicated sync and async tests targeting gemini-embedding-2-preview to verify the new parallel ThreadPoolExecutor and asyncio.gather execution paths and ensure regression safety.

…ing models Unlike traditional text embeddings (e.g. text-embedding-004), multimodal gemini-embedding models (such as gemini-embedding-2) treat list inputs in embed_content as parts of a single aggregated multimodal document, returning exactly one vector regardless of how many strings are passed. This change checks if the target model is a gemini-embedding model, and if so, runs individual embeds in parallel using a ThreadPoolExecutor (sync path) and asyncio.gather (async path) to correctly return a distinct embedding for each document in the input list, aligning with the LangChain Embeddings interface spec.

Changes standard unit test MODEL_NAME to text-embedding-004 to maintain coverage for standard list batching. Adds dedicated sync and async tests targeting gemini-embedding-2-preview to verify the new parallel ThreadPoolExecutor and asyncio.gather execution paths and ensure regression safety.

…ing models Unlike traditional text embeddings (e.g. text-embedding-004), multimodal gemini-embedding models (such as gemini-embedding-2) treat list inputs in embed_content as parts of a single aggregated multimodal document, returning exactly one vector regardless of how many strings are passed. This change checks if the target model is a gemini-embedding model, and if so, runs individual embeds in parallel using a ThreadPoolExecutor (sync path) and asyncio.gather (async path) to correctly return a distinct embedding for each document in the input list, aligning with the LangChain Embeddings interface spec.

Changes standard unit test MODEL_NAME to text-embedding-004 to maintain coverage for standard list batching. Adds dedicated sync and async tests targeting gemini-embedding-2-preview to verify the new parallel ThreadPoolExecutor and asyncio.gather execution paths and ensure regression safety.

Sai Teja Bandaru (saitejabandaru-in) added 6 commits May 29, 2026 20:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(genai): resolve single aggregated embedding bug for gemini-embedding models#1817

fix(genai): resolve single aggregated embedding bug for gemini-embedding models#1817
Sai Teja Bandaru (saitejabandaru-in) wants to merge 6 commits into
langchain-ai:mainfrom
saitejabandaru-in:feature-fix-gemini-embedding-batch

Sai Teja Bandaru (saitejabandaru-in) commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Sai Teja Bandaru (saitejabandaru-in) commented May 29, 2026

Description

Root Cause

Solution

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant