Mixpeek Logo
    Concepts

    Hierarchical Classification

    Assign content to multi-level category hierarchies using embedding-based classification. Define your taxonomy once, then classify new content automatically with confidence scores.

    video
    image
    text
    Multi-Tier
    76.0K runs
    Run in Builder

    "Show all educational tutorial videos classified under safe content with high confidence"

    Why This Matters

    Taxonomies are organizational infrastructure. Once defined, they enable consistent classification, compliance tagging, and structured navigation across all content.

    import requests
    API_URL = "https://api.mixpeek.com"
    headers = {"Authorization": "Bearer YOUR_API_KEY", "X-Namespace": "your-namespace"}
    # Create hierarchical taxonomy
    taxonomy = requests.post(f"{API_URL}/v1/taxonomies", headers=headers, json={
    "taxonomy_name": "content_classification",
    "taxonomy_type": "hierarchical",
    "retriever_id": "ret_classifier",
    "input_mappings": {
    "query_embedding": "mixpeek://multimodal_extractor@v1/embedding"
    },
    "hierarchy": [
    {
    "node_id": "safe",
    "collection_id": "col_safe_examples",
    "enrichment_fields": ["metadata.category"]
    },
    {
    "node_id": "educational",
    "parent_node_id": "safe",
    "collection_id": "col_educational_examples",
    "enrichment_fields": ["metadata.topic"]
    }
    ]
    }).json()
    # Apply taxonomy to collection
    requests.post(
    f"{API_URL}/v1/collections/col_my_content/apply-taxonomy",
    headers=headers,
    json={"taxonomy_id": taxonomy["taxonomy_id"]}
    )
    # Search within taxonomy categories
    results = requests.post(
    f"{API_URL}/v1/retrievers/taxonomy-search/execute",
    headers=headers,
    json={"query": {"text": "educational tutorial videos"}}
    ).json()
    for doc in results["documents"]:
    print(f"Document: {doc['document_id']}")
    print(f" Category: {doc.get('taxonomy_path', 'N/A')}")

    Feature Extractors

    Image Embedding

    Generate visual embeddings for similarity search and clustering

    752K runs

    Text Embedding

    Extract semantic embeddings from documents, transcripts and text content

    827K runs

    Video Embedding

    Generate vector embeddings for video content

    610K runs

    Retriever Stages

    attribute filter

    Filter documents by metadata attribute values using boolean logic

    filter

    Documentation

    Use Cases Using This Recipe

    Intermediate
    Coming Soon
    8 min

    Product Affordance Intelligence

    Understand what products can do, not just what they look like

    +35% improvement

    Search relevance (NDCG)

    Who It's For

    E-commerce platforms, product catalog managers, and merchandising teams managing 100K+ SKU catalogs

    Advanced
    Coming Soon
    9 min

    AdTech Creative Intelligence

    Understand what makes ad creatives perform before they run

    99% faster

    Creative approval speed

    Who It's For

    Ad networks, DSPs, creative agencies, and brand marketing teams managing 10K+ creative assets monthly

    Advanced
    Coming Soon
    7 min

    Government Intelligence

    Multimodal search and analysis for government document repositories

    100% unified index

    Cross-department search coverage

    Who It's For

    Government agencies, policy researchers, compliance teams, and public affairs professionals managing multi-department document repositories

    Beginner
    Coming Soon
    7 min

    Asset Intelligence (DAM Auto-Labeling)

    Auto-tag and organize digital assets with multimodal AI

    95% reduction

    Manual tagging effort

    Who It's For

    Creative teams, brand managers, and media companies managing 100K+ digital assets across DAM platforms

    Intermediate

    AI Content Moderation for User-Generated Content

    Automatically detect and flag policy-violating content across text, images, and video

    95%+ of violations flagged before going live

    Pre-publication violation catch rate

    Who It's For

    UGC platforms, social media companies, marketplace operators, and community platforms processing 100K+ daily uploads requiring trust and safety review

    Beginner

    AI-Powered Digital Asset Management

    Search, organize, and enrich your media library with multimodal AI

    80% faster search-to-find

    Asset discovery time

    Who It's For

    Media companies, creative agencies, brand teams, and publishers managing libraries of 500K+ images, videos, and documents across production workflows

    Intermediate

    Automated Video Tagging for Streaming

    Auto-generate rich metadata for every scene, shot, and moment in your catalog

    10x more tags than manual editorial process

    Metadata tags per title

    Who It's For

    Streaming platforms, content distributors, and VOD services managing catalogs of 10K+ titles that need rich metadata for discovery and recommendation

    Intermediate
    9 min

    Visual Product Search for Ecommerce

    Let shoppers search your catalog with images instead of keywords

    2.3x increase for visual search users

    Search-to-purchase conversion

    Who It's For

    Ecommerce platforms, online marketplaces, fashion retailers, home goods stores, and any product catalog with 10K+ SKUs where visual discovery drives conversion

    Intermediate
    10 min

    Brand Safety Verification

    AI-powered brand safety scoring for ad placements and content partnerships

    95% reduction in unsafe ad adjacency

    Brand safety violation rate

    Who It's For

    Brand safety teams at agencies, DSPs, SSPs, ad networks, and brand marketers who need to verify that ad placements and content partnerships meet safety standards before spend is allocated

    Advanced
    9 min

    AI Compliance Document Review

    Automate regulatory document review with multimodal AI understanding

    10x faster

    Review cycle time

    Who It's For

    Compliance teams, regulatory affairs departments, and legal operations groups reviewing 1,000+ regulatory documents per quarter across banking, insurance, pharma, and financial services

    Advanced
    12 min

    Clinical NLP at Scale

    Extract structured intelligence from clinical notes, pathology reports, and medical records

    94% F1 on medical NER benchmarks

    Entity extraction accuracy

    Who It's For

    Healthcare IT teams, clinical informatics departments, and health systems processing thousands of clinical documents daily