Add embedding server by WenjinFu · Pull Request #100 · OpenMind/OM1-modules

WenjinFu · 2026-02-23T21:56:23Z

This pull request introduces a GPU-accelerated embedding microservice and a fast QA retrieval engine, enabling efficient semantic search for question answering. The main changes include adding a Dockerized FastAPI service for generating embeddings using a SentenceTransformer model on NVIDIA GPUs, and a Python module that leverages this service alongside a FAISS index for high-speed QA retrieval.

Embedding microservice (Dockerized FastAPI application):

Added embedding_server.py, a FastAPI server that loads the intfloat/e5-small-v2 SentenceTransformer model onto GPU, exposes /embed, /embed_fast, /embed_batch, and /health endpoints, and supports both JSON and base64-encoded embedding responses for optimal performance.
Created a Dockerfile to build the embedding service on top of NVIDIA's PyTorch container, installs dependencies, downloads the model, and sets up the server to run on port 8100.
Added a docker-compose.yml file to orchestrate the embedding service with GPU support, ensuring it runs with NVIDIA runtime and exposes the necessary port.

QA retrieval engine:

Added qa_engine.py, a Python module that loads a FAISS index and QA data, queries the embedding microservice for vector representations, and returns the best-matching answer with latency and similarity metrics. Includes a test harness for quick validation.

WenjinFu added 3 commits February 23, 2026 12:19

Add embedding service with FastAPI and Docker support

061b9d7

fix format

9c6e3d2

restructure files

fb5d360

WenjinFu closed this Feb 24, 2026

WenjinFu deleted the add-embedding-server branch February 24, 2026 21:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add embedding server#100

Add embedding server#100
WenjinFu wants to merge 3 commits intomainfrom
add-embedding-server

WenjinFu commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

WenjinFu commented Feb 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant