NEWAnnouncing Vector Store Object Storage — 50x cheaper than traditional vector databases.Vector Store Object Storage — 50x cheaper.Read the post →

Ingest & Store

Feature Extractors

Typed pipelines for faces, scenes, transcripts, OCR, fingerprints.

Vector Store (MVS)

Mixpeek Vector Store: horizontally scaled, feature-aware indexes.

Retrieve & Analyze

Compose multi-stage search in <100ms:filter, join, rerank.

Group scenes, faces or objects by similarity with Thompson sampling.

Encode your domain as versioned ontologies enforced at query time.

By Industry

Talent search, brand safety, creative analytics.

Scene search, recommendation, archive access.

Visual search, PDP enrichment, catalog QA.

Lecture search, transcript Q&A, content safety.

View all solutions →

By Use Case

Face & Person Search

Find anyone across video libraries in milliseconds.

IP & Copyright Detection

Logos, songs, faces:one pipeline, one report.

Visual Taste & Recs

Scene-similarity ranked recommendations with RL.

Brand & Ad Safety

Pre-publish content screening at bid-time speeds.

View all use cases →

Build

API reference, SDKs, recipes, and architecture guides.

Launches, deep dives, and field notes from our engineers.

Browse supported HuggingFace models by task and modality.

See what teams are building with Mixpeek.

Latest releases, fixes, and improvements.

Education

Multimodal University

Fundamentals of multimodal retrieval, modules + certs.

Every term you need:embeddings to re-rankers.

Talks, demos, and customer sessions on demand.

Mixpeek vs. Pinecone, Weaviate, Twelve Labs, more.

Mission, team, and the multimodal vision.

We're hiring across research, infra, and design.

Talk to sales, support, or press.

White-glove 30-day production pilot for new customers.

Integrations Pricing

Sign in Request Demo

Models/Embeddings/Qwen/Qwen3-Embedding-0.6B

HFText EmbeddingsApache 2.0

Qwen3-Embedding-0.6B

by Qwen

Compact multilingual text embedding with 100+ language support

2.1Mdl/month

0.6Bparams

HuggingFace Use in Pipeline

Identifiers

Model ID

Qwen/Qwen3-Embedding-0.6B

Feature URI

mixpeek://text_extractor@v1/qwen3_embedding_06b_v1

Overview

Qwen3-Embedding-0.6B is the smallest model in the Qwen3 Embedding family, delivering surprisingly strong text embedding performance from just 600 million parameters. It supports 100+ languages, context lengths up to 32K tokens, and flexible embedding dimensions from 32 to 1024 via Matryoshka training.

On Mixpeek, Qwen3-Embedding-0.6B is the ideal choice for high-throughput multilingual text indexing where you need fast embedding generation across diverse languages without sacrificing too much quality. It is particularly effective for indexing transcripts, captions, and extracted text.

Architecture

Dense transformer built on the Qwen3 0.6B foundation model, trained with a three-stage pipeline: large-scale unsupervised pre-training for foundational semantic understanding, supervised fine-tuning on high-quality labeled datasets, and model merging for optimal generalization. Supports instruction-aware embedding with task prefixes.

Mixpeek SDK Integration

import { Mixpeek } from "mixpeek";

const mx = new Mixpeek({ apiKey: "API_KEY" });

await mx.collections.ingest({
  collection_id: "my-collection",
  source: { url: "https://example.com/document.pdf" },
  feature_extractors: [{
    name: "text_embedding",
    version: "v1",
    params: {
      model_id: "Qwen/Qwen3-Embedding-0.6B"
    }
  }]
});

Capabilities

100+ language support with strong multilingual transfer
Flexible embedding dimensions from 32 to 1024
32K token context window
Instruction-aware embedding for task-specific optimization
Compact 0.6B parameter footprint

Use Cases on Mixpeek

High-throughput multilingual text indexing for transcripts and captions

Edge deployment for text search with minimal compute requirements

Real-time semantic search where sub-5ms latency is critical

Benchmarks

Dataset	Metric	Score	Source
MTEB Multilingual	Avg Score	64.33	Qwen3-Embedding paper, June 2025
MTEB Retrieval	nDCG@10	Competitive with BGE-M3	Qwen3-Embedding paper, June 2025

Performance

Input Size32K tokens max

Embedding Dim1024 (Matryoshka: 32-1024)

GPU Latency~1.5ms / passage (A100)

CPU Latency~12ms / passage

GPU Throughput~660 passages/sec (A100)

GPU Memory~1.2 GB

0.6B params — smallest Qwen3 embedding model, ideal for high-throughput indexing

Common Pipeline Companions

distil-whisper/distil-large-v3

Fast transcription → text embedding pipeline

openai/clip-vit-large-patch14

Visual embedding for cross-modal search

Specification

FrameworkHF

OrganizationQwen

FeatureText Embeddings

Output1024-dim vector

Modalitiesdocument, audio

RetrieverText Similarity

Parameters0.6B

LicenseApache 2.0

Downloads/mo2.1M

Research Paper

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Build a pipeline with Qwen3-Embedding-0.6B

Add this model to a processing pipeline alongside other extractors. Combine with retrieval stages for end-to-end search.

Alternative Models

BAAI/bge-large-en-v1.5

Text Embeddings

sentence-transformers/all-MiniLM-L6-v2

Text Embeddings

nomic-ai/nomic-embed-text-v2-moe

Text Embeddings

Qwen/Qwen3-VL-Embedding-2B

Text Embeddings

Related in Embeddings

openai/clip-vit-large-patch14

Visual Embeddings

google/siglip-base-patch16-224

Visual Embeddings

google/siglip2-giant-opt-patch16-384

Visual Embeddings

facebook/dinov2-large

Visual Embeddings