The best open source AI on demand in a sovereign cloud
Discover the best open source alternatives to ChatGPT, Gemini, Midjourney or Claude for processing sensitive data in full compliance with European and Swiss law.
LLM↓
Embeddings↓
Audio↓
Image↓
Large language models (LLM)
The best open source alternatives to ChatGPT, Gemini and Microsoft Copilot for interacting, analysing and generating content with AI.
LLama 3.2
The most powerful
- ●
Optimised to handle large amounts of text ensuring consistency across multiple sources
- ●
Excellent in development, programming and academic research tasks
- ●
High multilingual flexibility with more than 30 languages supported
- ●
Suitable for artists and content creation, including storytelling
Max. input tokens
100’000
Max. output token
8’000
Languages
EN, ES, FR, DE, IT...
Training
2024/07
Functions call
No
- ●
Optimised to handle large amounts of text ensuring consistency across multiple sources
- ●
Excellent in development, programming and academic research tasks
- ●
High multilingual flexibility with more than 30 languages supported
- ●
Suitable for artists and content creation, including storytelling
Max. input tokens
100’000
Max. output token
8’000
Languages
EN, ES, FR, DE, IT...
Training
2024/07
Functions call
No
Mixtral 8x22B
The most versatile
- ●
Larger training corpus than Mixtral 8x7B for more complex tasks
- ●
Able to analyse unstructured data to support decision making and generate content
- ●
Management of conversational subtleties to feed complex discussions
- ●
Optimised for logical exploration (combining complex information) and generating ideas (scenarios, etc.)
Max. input tokens
23’000
Max. output token
23’000
Languages
FR, EN, DE, ES, IT
Training
2024/07
Functions call
Yes
- ●
Larger training corpus than Mixtral 8x7B for more complex tasks
- ●
Able to analyse unstructured data to support decision making and generate content
- ●
Management of conversational subtleties to feed complex discussions
- ●
Optimised for logical exploration (combining complex information) and generating ideas (scenarios, etc.)
Max. input tokens
23’000
Max. output token
23’000
Languages
FR, EN, DE, ES, IT
Training
2024/07
Functions call
Yes
Mixtral 8x7B
The fastest and most economical
- ●
Economical and very fast for many common tasks
- ●
Ideal for summarising, moderating, calculating, coding and extracting data from unstructured sources
- ●
Suitable for real-time interpretation of data and for logical reasoning
- ●
Easy to adjust and contextualise in order to limit undesirable outcomes
Max. input tokens
30’000
Max. output token
30’000
Languages
EN, ES, FR, DE, IT...
Training
2024/07
Functions call
No
- ●
Economical and very fast for many common tasks
- ●
Ideal for summarising, moderating, calculating, coding and extracting data from unstructured sources
- ●
Suitable for real-time interpretation of data and for logical reasoning
- ●
Easy to adjust and contextualise in order to limit undesirable outcomes
Max. input tokens
30’000
Max. output token
30’000
Languages
EN, ES, FR, DE, IT...
Training
2024/07
Functions call
No
Embedding models
The best open source embedding models to transform your data into intelligent vectors. Improve search accuracy, personalize recommendations, simplify data analysis, explore semantic links and easily classify text.
Bge Multilingual Gemma2
The highest quality
- ●
The most powerful open source embedding model on the market
- ●
The reference for semantic search and augmented search (ASR) tasks
- ●
Ideal for advanced use of embedding vectors in a variety of applications
- ●
Outstanding performance, whatever the language of the text (100 languages)
Max. input tokens
8192
Parameters
9.2 B
Dimensions
3584
Languages
EN, ES, FR, DE, IT...
Type
Text
- ●
The most powerful open source embedding model on the market
- ●
The reference for semantic search and augmented search (ASR) tasks
- ●
Ideal for advanced use of embedding vectors in a variety of applications
- ●
Outstanding performance, whatever the language of the text (100 languages)
Max. input tokens
8192
Parameters
9.2 B
Dimensions
3584
Languages
EN, ES, FR, DE, IT...
Type
Text
All MiniLM L12 v2
The best value for money
- ●
This model is the result of community work based on a model published by Microsoft.
- ●
Excellent value for money, ideal for prototyping and simple tasks with limited resources
- ●
Interesting performance for relatively simple tasks, whatever the language of the text
- ●
Extreme speed for indexing huge databases or real-time processing
- ●
High energy efficiency to reduce environmental impact
Max. input tokens
512
Parameters
33 M
Dimensions
384
Languages
EN, ES, FR, DE, IT...
Type
Text
- ●
This model is the result of community work based on a model published by Microsoft.
- ●
Excellent value for money, ideal for prototyping and simple tasks with limited resources
- ●
Interesting performance for relatively simple tasks, whatever the language of the text
- ●
Extreme speed for indexing huge databases or real-time processing
- ●
High energy efficiency to reduce environmental impact
Max. input tokens
512
Parameters
33 M
Dimensions
384
Languages
EN, ES, FR, DE, IT...
Type
Text
Voice recognition
The best open source AI for transcribing audio files into text or generating realistic human voices.
Whisper V3
For complex transcriptions
- ●
Model trained on over 1 million hours of data
- ●
Transcription errors reduced by up to 20% compared with Whisper V2
- ●
Better handling of accents, background noise and complex speech (e.g., calls or videoconferences)
- ●
Enhanced multilingual support and translation of transcriptions into languages other than English
Maximum file size
25 MB
Formats supported
mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a
- ●
Model trained on over 1 million hours of data
- ●
Transcription errors reduced by up to 20% compared with Whisper V2
- ●
Better handling of accents, background noise and complex speech (e.g., calls or videoconferences)
- ●
Enhanced multilingual support and translation of transcriptions into languages other than English
Maximum file size
25 MB
Formats supported
mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a
Whisper V2
For most transcriptions
- ●
Audio transcription in over 57 languages and translation of transcribed text into English
- ●
Model trained on 680,000 hours of data in 98 languages
- ●
Automatic identification of the original language
Maximum file size
25 MB
Formats supported
mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a
- ●
Audio transcription in over 57 languages and translation of transcribed text into English
- ●
Model trained on 680,000 hours of data in 98 languages
- ●
Automatic identification of the original language
Maximum file size
25 MB
Formats supported
mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a
Image generation and processing
The best open source alternatives to Midjourney, Microsoft Copilot Designer and Gemini for generating, merging or interpreting images.
SDXL-Lightning
Ideal for generating images
- ●
The best combination of quality and speed in generative AI image creation
- ●
Fast generation of photo-realistic images in 1, 2, 4 or 8 steps based on prompts
- ●
Operates by distillation, which increases energy efficiency and ensures excellent quality
- ●
Optimised for English, with limited knowledge of other languages (FR, DE, ES, IT, etc.)
Max. input tokens
77
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792
- ●
The best combination of quality and speed in generative AI image creation
- ●
Fast generation of photo-realistic images in 1, 2, 4 or 8 steps based on prompts
- ●
Operates by distillation, which increases energy efficiency and ensures excellent quality
- ●
Optimised for English, with limited knowledge of other languages (FR, DE, ES, IT, etc.)
Max. input tokens
77
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792
Photomaker V2
Ideal for modifying and merging portraits of people
- ●
Create photos in multiple styles from one or more profile photos
- ●
Powerful and flexible: recontextualisation, colourisation, age and gender change, mix of identities, etc.
Max. input tokens
77
Max. image input
6
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792
- ●
Create photos in multiple styles from one or more profile photos
- ●
Powerful and flexible: recontextualisation, colourisation, age and gender change, mix of identities, etc.
Max. input tokens
77
Max. image input
6
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792
Flux schnell
To generate high-quality images
- ●
Outstanding image quality, surpassing DALL-E 3 and MidJourney in some areas
- ●
Prompt fidelity and precise interpretation of complex scenes
- ●
A wide range of styles
Max. input tokens
76
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792
- ●
Outstanding image quality, surpassing DALL-E 3 and MidJourney in some areas
- ●
Prompt fidelity and precise interpretation of complex scenes
- ●
A wide range of styles
Max. input tokens
76
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792