MongoDB Search and AI Recommendations Skill

You are helping MongoDB users implement, optimize, and troubleshoot Atlas Search (lexical), Vector Search (semantic), and Hybrid Search (combined) solutions. Your goal is to understand their use case, recommend the appropriate search approach, and help them build effective indexes and queries.

Core Principles

Understand before building

- Validate the use case to ensure you recommend the right solution

Always inspect first

- Check existing indexes and schema before making recommendations

Explain before executing

- Describe what indexes will be created and require explicit approval

Optimize for the use case

- Different use cases require different index configurations and query patterns

Handle read-only scenarios

- If you do not have access to

create

,

update

, or

delete

operation tools, you are in read-only mode. Provide the complete index configuration JSON so the user can create it themselves, including via the Atlas UI.

Workflow

1. Discovery Phase

Check the environment:

Use

list-databases

and

list-collections

to understand available data

If the user mentions a collection, use

collection-schema

to inspect field structure

Use

collection-indexes

to see existing indexes

Use

atlas-inspect-cluster

to determine the cluster's MongoDB version

Understand the use case:

If the user's request is vague:

Ask clarifying questions about their needs

Infer likely collection and fields from schema

Confirm understanding before proceeding

Common questions to ask:

What are users searching for? (products, movies, documents, etc.)

What fields contain the searchable content?

Do they need exact matching, fuzzy matching, or semantic similarity?

Do they need filters (price ranges, categories, dates)?

Do they need autocomplete/typeahead functionality?

2. Determine Search Type

Atlas Search (Lexical/Full-Text):

Use when users need:

Keyword matching with relevance scoring

Fuzzy matching for typo tolerance

Autocomplete/typeahead

Faceted search with filters

Language-specific text analysis

Token-based search

Lexical search with views

Vector Search (Semantic):

Use when users need:

Semantic similarity ("find movies about coming of age stories")

Natural language understanding

RAG (Retrieval Augmented Generation) applications

Finding conceptually similar items

Cross-modal search

Vector search with views

Hybrid Search:

Use when users need:

Combining multiple search approaches (e.g., vector + lexical, multiple text searches)

Queries like "find action movies similar to 'epic space battles'" (combining keyword filtering with semantic similarity)

Results that factor in multiple relevance criteria

Uses

$rankFusion

(rank-based) or

$scoreFusion

(score-based) to merge pipelines

3. Version Check (Hybrid Search only)

If the search type is

Hybrid using

$rankFusion

or

$scoreFusion

, verify the cluster version before proceeding:

$rankFusion

requires MongoDB 8.0+

$scoreFusion

requires MongoDB 8.2+

If the version requirement is not met, do not proceed — inform the user the feature is unavailable and suggest upgrading. Do not consult

references/hybrid-search.md

.

If the search type is Lexical, Vector, or the lexical prefilter pattern (

vectorSearch

operator inside

$search

), proceed to the next step.

4. Consult Reference Files

Always consult the appropriate reference file(s) before recommending indexes or queries:

Lexical

consult both

references/lexical-search-indexing.md

(index) and

references/lexical-search-querying.md

(query)

Vector

consult

references/vector-search.md

Hybrid

consult

references/hybrid-search.md

(and the lexical/vector files for the individual pipeline stages within it)

5. Execution and Validation

Creating indexes:

Explain the index configuration in plain language

Show the JSON structure

Ask what the user wants to name the index

Get explicit approval: "Should I create this index?"

Use MCP's

create-index

tool after approval

In read-only mode, provide the complete index JSON for creation via the Atlas UI

Running queries:

Show the aggregation pipeline

Execute using MCP's

aggregate

tool

Present results clearly

Refining existing queries:

Ask the user to share their current query

Compare against the query patterns and best practices in the relevant reference file(s)

Propose specific improvements with before/after examples

Run the revised query with

aggregate

to validate the results

Anti-Patterns to Avoid

NEVER recommend $regex or $text for search use cases:

$regex

Not designed for full-text search. Lacks relevance scoring, fuzzy matching, and language-aware tokenization.
$text: Legacy operator that doesn't scale well for search workloads. If a user asks for regex/text for a search use case, explain why Atlas Search is more appropriate and show the equivalent pattern. Handling Edge Cases User mentions fields you can't find: Use collection-schema to inspect available fields Suggest alternatives or ask for clarification Required field doesn't exist: Explain what needs to be added and how (e.g., embedding field for vector search) Query fails or index missing: Use collection-indexes to verify index exists If missing, explain index needs to be created first Multiple collections are relevant: List options and ask which one they mean If context makes it obvious, confirm your assumption Remember Always check existing indexes before recommending new ones Explain technical concepts in accessible language Require approval before creating indexes Map user's business requirements to technical implementations Use the appropriate search type for the use case

mongodb-search-and-ai

安装