1. Cohere vectorizer
Cohere vectorizer uses the Cohere API to create Embeddings (vectors) from data. Vectorizing from large language models is CPU intensive, often requires GPUs, and can be very ineffective without proper optimisations. This is the purpose of this module, which calls Cohere servers with their Embeddings API.
Create a docker compose file from Weaviate wizzard : select the Text transformers vectorizer, with a specific transformer model among the list. Then start the docker container.
- (1) Select the “Text” vectorizer type
- (2) Select the “Cohere” vectorizer
- (3) Leave it. The API key will be set later, in WPSOLR’s settings.
Download and execute the docker-compose file generated by the wizard (docker-compose up -d):
Let’s setup our model on Cohere dashboard:
(Please check the Cohere Embed pricing first)
a) Login: https://dashboard.cohere.ai/welcome/login (signup first if necessary)
b) Select the API Keys menu
- (2) or (3) Copy your OpenAI API key (production or trial) for later during your WPSOLR index configuration
Now, let’s create our index in WPSOLR:
- (1) Select the text2vec-cohere vectorizer module
- (2) Copy your Cohere API access token described earlier
- (3) Set a name for you index, visible in WPSOLR admin
- (4) Set a name for your Weaviate class (index)
- (5) Set the url of your Weaviate docker container
- (6) Set the Cohere model name, “multilingual-22-12“
- (5) Create the index. Done!
Connect to the Weaviate GraphQL console at https://console.semi.technology/console with url https://localhost:8080, to check our new index (class):
2 Select your data
- (1) (2) (3) select the index you just created
- (4) Choose a filter: “Near Text” to perform a vector search (search on concepts), or “Where” to perform a keywords search (classic search that works with words)
- (5) Set a similarity for your “Near Text” search. The closer to “1”, the more precise is the vector search.
3 Index your data