marqo-fashionSigLIP

by Marqo

Fashion-domain visual embedding model fine-tuned with Generalised Contrastive Learning

464Kdl/month

83likes

203Mparams

HuggingFace Run on your data, free

Identifiers

Model ID

Marqo/marqo-fashionSigLIP

Feature URI

mixpeek://image_extractor@v1/marqo_fashionsiglip_v1

Overview

Marqo FashionSigLIP is a ViT-B/16-SigLIP model fine-tuned on over 1M fashion products using Generalised Contrastive Learning (GCL). Unlike generic CLIP models, it trains on rich fashion metadata including categories, styles, colors, materials, and fine-grained product details, delivering up to 57% improvement in MRR and recall over previous fashion-specific models.

On Mixpeek, FashionSigLIP powers domain-specific visual search for e-commerce and retail, where generic embeddings miss style nuances like fabric texture, color palette, and silhouette that are critical for product discovery and recommendation.

Architecture

ViT-B/16-SigLIP (webli) backbone fine-tuned with Generalised Contrastive Learning on fashion-specific metadata (categories, styles, colors, materials, keywords). Sigmoid contrastive loss for efficient pairwise training. 768-dimensional shared image-text embedding space.

Mixpeek SDK Integration

import { Mixpeek } from "mixpeek";

const mx = new Mixpeek({ apiKey: "API_KEY" });

// Managed: create a collection over a bucket; Mixpeek runs this model's extractor
const collection = await mx.collections.create({
  namespace_id: "my-namespace",
  collection_name: "my-collection",
  source: { type: "bucket", bucket_ids: ["bkt_your_bucket"] },
  feature_extractor: {
    feature_extractor_name: "visual_embedding",
    version: "v1",
    parameters: { model_id: "Marqo/marqo-fashionSigLIP" },
  },
});

Capabilities

Fashion-optimized 768-dim visual embeddings
Text-to-image and image-to-image product search
Fine-grained attribute awareness (color, material, style, silhouette)
57% MRR improvement over FashionCLIP 2.0

Use Cases on Mixpeek

Visual product search for fashion e-commerce (find similar garments by style)

Automated product categorization and attribute tagging for catalogs

Recommendation engines that understand fashion-specific visual similarity

Benchmarks

Dataset	Metric	Score	Source
Fashion Product Retrieval	MRR improvement vs FashionCLIP 2.0	+57%	Marqo, 2024 — marqo-FashionCLIP GitHub
Fashion Category Classification	Recall improvement vs FashionCLIP 2.0	+57%	Marqo, 2024 — marqo-FashionCLIP GitHub