1 of 25

Using MongoDB Atlas Vector Search for AI Semantic Search

Leonardo Gomes - Certified MongoDB Developer

Jan., 2025

2 of 25

What is the first thing that comes to mind when you hear the word AI?

3 of 25

Artificial Intelligence (AI)

A field in computer science that trains computers to simulate human intelligence.

https://www.mongodb.com/resources/basics/what-is-artificial-intelligence

Summary of AI concepts

4 of 25

Machine Learning

Supervised learning is a machine learning model that maps a specific input to an output using structured data.
Unsupervised learning is a machine learning model that learns patterns based on unstructured data.
Reinforcement learning is a machine learning model that can be broadly described as “learn by doing.”

https://cloud.google.com/learn/what-is-artificial-intelligence

Machine learning is a subset of artificial intelligence that uses algorithms to train data to obtain results.

5 of 25

Deep Learning

Deep learning models consist of artificial deep neural networks — i.e., interconnected neurons (or nodes) — and have many layers, enabling them to process more complex data patterns than machine learning algorithms.

https://www.mongodb.com/resources/basics/what-is-artificial-intelligence

Deep learning is a subset of machine learning that resembles human intelligence.

6 of 25

Large language model (LLM) and Generative AI

Generative AI refers to the use of AI to create new content, like text, images, music, audio, and videos.�E.g.: ChatGPT, DALL-E, Llama Live.

https://cloud.google.com/ai/llms�https://cloud.google.com/use-cases/generative-ai

A LLM is a statistical language model that can be used to generate and translate text and other content.

LLMs and generative AI are subsets of deep learning.

7 of 25

Natural language processing (NLP)

https://www.ibm.com/think/topics/natural-language-processing

A subfield of computer science and artificial intelligence (AI) that uses machine learning to enable computers to understand and communicate with human language.�E.g.: ChatBot

8 of 25

Summary of concepts:

Artificial Intelligence: A field of Computer Science that trains computers to simulate human intelligence

Machine Learning: Uses algorithms to train models based on data

Deep Learning: Uses artificial neural networks to simulate how the brain works

Generative AI: Leverages Machine Learning Models to generate media based on a given prompt. E.g.: ChatGPT

Natural Language Processing: Gives computer the ability to understand human language. E.g.: ChatBot

Artificial Intelligence

Machine Learning

Deep Learning

Generative AI

Natural Language�Processing

https://learn.mongodb.com/learn/course/introduction-to-ai-and-vector-search-ict-learnathon/lesson-1-introduction-to-ai

9 of 25

Vectors

A vector has magnitude and direction, and can represent complex data in data science through numerical features. Vector databases store these representations, enabling efficient similarity searches in a multi-dimensional space.

https://www.mongodb.com/resources/basics/databases/vector-databases

10 of 25

Embeddings

An embedding model converts diverse data types like text, images, and audio into vectors, positioning them in a multi-dimensional space.

https://causewriter.ai/courses/ai-explainers/lessons/vector-embedding

11 of 25

Vector Databases

https://www.mongodb.com/resources/basics/databases/vector-databases

12 of 25

Vector search and the cosine algorithm

Cosine Similarity calculates the cosine of the angle between two vectors, revealing how closely the vectors are aligned.

For instance, words like "cat" and "dog" will have a higher cosine similarity than "cat" and "banana."

https://www.timescale.com/learn/understanding-cosine-similarity

https://ubiai.tools/how-vector-similarity-search-functions

13 of 25

When was vector search created?

MongoDB Atlas Vector Search currently provides 3 approaches to calculate vector similarity:

euclidean distance
cosine product
dot product

https://www.mongodb.com/blog/post/vector-search-llm-essentials-what-when-why

14 of 25

Calculating the cosine similarity

We define cosine similarity mathematically as the dot product of the vectors divided by their magnitude.

https://www.learndatasci.com/glossary/cosine-similarity

import numpy as np

def cosine_similarity(x, y):

# Ensure length of x and y are the same

if len(x) != len(y) :

return None

# Compute the dot product between x and y

dot_product = np.dot(x, y)

# Compute the L2 norms (magnitudes) of x and y

magnitude_x = np.sqrt(np.sum(x**2))

magnitude_y = np.sqrt(np.sum(y**2))

# Compute the cosine similarity

cosine_similarity = dot_product / (magnitude_x * magnitude_y)

return cosine_similarity

15 of 25

Using the cosine to find similarities

https://www.timescale.com/learn/understanding-cosine-similarity

Using cosine, the closest words are those nearest to the search term and in the same direction.

✅

❌

16 of 25

MongoDB Atlas

MongoDB Atlas is a multi-cloud database service provided by MongoDB.

Atlas simplifies deploying and managing databases while offering versatility to build resilient and performant global applications on cloud providers.

https://www.mongodb.com/docs/atlas

17 of 25

MongoDB Atlas Vector Search

Streamlined simplicity: Keep operational and vector data together for ease of management.
Enhanced querying: Conduct advanced searches by blending vector queries with metadata filters, graph lookups, and more, all within one database.
Optimized scaling: MongoDB's architecture uniquely scales vector searches, ensuring isolated workloads and high performance at scale.

https://www.mongodb.com/products/platform/atlas-vector-search

18 of 25

The combined power of vectors and MongoDB

https://www.mongodb.com/resources/basics/databases/vector-databases

19 of 25

OpenAI

Example request:

Response:

OpenAI specializes in AI for natural language processing and offers the Embedding API, a tool for generating document embeddings.

https://platform.openai.com/docs/api-reference/embeddings

curl https://api.openai.com/v1/embeddings \

-H "Authorization: Bearer $OPENAI_API_KEY" \

-H "Content-Type: application/json" \

-d '{

"input": "The food was delicious and the waiter...",

"model": "text-embedding-ada-002",

"encoding_format": "float"

}'

{

"object": "list",

"data": [

{

"object": "embedding",

"embedding": [

0.0023064255,

-0.009327292,

.... (1536 floats total for ada-002)

-0.0028842222,

],

"index": 0

}

],

"model": "text-embedding-ada-002",

"usage": {

"prompt_tokens": 8,

"total_tokens": 8

}

20 of 25

Pricing | OpenAI

Embedding models: Build advanced search, clustering, topic modeling, and classification functionality with our embeddings offering.

https://openai.com/api/pricing��

*Batch API pricing requires requests to be submitted as a batch.

Model	Pricing	Pricing with Batch API*
text-embedding-3-small	$0.020 / 1M tokens	$0.010 / 1M tokens
text-embedding-3-large	$0.130 / 1M tokens	$0.065 / 1M tokens
ada v2	$0.100 / 1M tokens	$0.050 / 1M tokens

21 of 25

What are tokens in the OpenAI and how to count them?

Tokens are pieces of words. Before processing, the input is split into tokens, which may include trailing spaces or sub-words.�Here are some examples:

Input	~# of tokens	Pricing with ada v2
4 chars in English	1 token	$0.0000001
¾ words	1 token	$0.0000001
¾ words	100 tokens	$0.00001
1-2 sentence	30 tokens	$0.000003
1 paragraph	100 tokens	$0.00001
1,500 words	2048 tokens	$0.0002048

https://help.openai.com/en/articles/4936856-what-are-tokens-and-how-to-count-them

22 of 25

Pricing | MongoDB Atlas Vector Search

For this demo, we will use Atlas Search M0 (Free Cluster).

Maximum allowed vector search indexes: 3 *
Storage: 512MB
RAM, vCPU: Shared
Ops/Sec: 0-100
Features:

Free forever 💚

* Vector Search Indexes are different from Database indexes.�There is no limited number for standard database indexes.

https://www.mongodb.com/docs/atlas/atlas-search/shared-tier-limitations

23 of 25

Part 2: Hands-on!

https://github.com/leogomesdev/moviesflix

Creating a Free MongoDB Atlas Cluster
Using a sample dataset
Creating an Atlas Search Index
Exploring the data with MongoDB Compass
Using the Open AI embeddings API
Testing the semantic search

Steps:

1. MongoDB Atlas: https://account.mongodb.com/account/login

1. Show how to create an index and edit the existing one

2. MongoDB Compass: Explore the database from the cluster above

3. OpenAI Quick Start: https://platform.openai.com/docs/quickstart

4. OpenAI current usage dashboard: https://platform.openai.com/settings/organization/usage

5. PostMAN: Show sample requests using the model text-embedding-ada-002

6. Back to MongoDB Compass:

{

queryVector: [dimension1, dimension2, ...],

path: 'embeddings',

numCandidates: 10,

index: 'vectorsearch',

limit: 10,

}

7. Showcase the Node.js app, suggested inputs:

- Canines doing stuff

- Fluffy animals

- European history

- Animal who is a huge fan of lasagna

- flying cats

https://github.com/leogomesdev/moviesflix

https://us-east-1.console.aws.amazon.com/apprunner/home?region=us-east-1#/services

https://82vi9ghpzw.us-east-1.awsapprunner.com

24 of 25

Curious in learning more?

MongoDB University - free courses and training

Learning Path: Introduction to MongoDB, AI, and Vector Search

25 of 25

Curious in learning more?

Toronto MongoDB User Group: https://www.meetup.com/toronto-mongodb-usergroup

Connect with MUGs From All Over the World: https://mdb.link/mug