Ozeki AI Server is the best AI Server Software you can use on Windows or Linux. It allows you to run multiple AI models on the same GPUs/NPUs simultaneously. This way you can utilize your AI hardware much more efficiently and you can setup multiple AI services with a single system.
What is Ozeki AI
Ozeki AI Server is the software you need if you want to build a Local AI system. To run AI models locally, you need a computer with a GPU or NPU and you need a software that will run the AI Models. Ozeki AI Server is the software that runs the AI models on your computer. It provides a chat user interface to ask questions from the AI models, and it offers ready to use AI services and APIs to use these models.
Why is it better
Because other AI execution frameworks can only run one AI model at a time. If you use Ozeki, you can run multiple models at the same time. Runing multiple AI models gives you a huge advantage, due to the fact that specialized AI models do a better job than general AI models. For example the Qwen coder model is way much better in writing computer software, the Flux model is better in image generation, and the NVidia trained Nemotrom LLm is better then ChatGPT for standard AI chat. With Ozeki AI server you can run all of these on the same computer alongside each other.
Get started
Step 1. Download Ozeki AI Chat
Download and install Ozeki AI Chat server on your Windows or Linux pc. This AI server will allow you
to run AI models on your GPUs locally and will allow you to setup AI services, such as
AI chat,
AI phone,
AI e-mail,
AI SQL or
AI coding support.
Step 2. Download AI Models
You can download AI models from the huggingface.co website, where over a million
models are available. When you download a model, please select the GGUF format.
Click on the following button to learn more about how to download AI models.
Step 3. Setup AI chat bots
Once your AI models are downloaded, you can setup AI chat bots and use your local AI
from mobile devices (Android or iPhone), from desktop PCs and laptops (Windows, Mac)
or from any device that can run a webbrowser.
How can I use Ozeki AI
The two most common ways are a chat inteface and an API. Human users interact with their AI models through the chat interface. The chat interface can be acccessed through a webbrowser (Chrome, Edge, Safari, Firefox, etx) or through a downloaded chat client. Any user can download an Ozeki chat client to enjoy faster access and more productivity. Software developers use the Ozeki API, which is compatible with other AI APIs, such as the ChatGPT API or CoPilot API. This makes switching easier, and development costs can be significantly reduced if developers build AI apps on a local AI systems powered by the Ozeki AI Server software.
Benefits
The greatest benefit of Ozeki AI is that it makes it possible to run multiple AI models on the same hardware simulatanously. This way you can utilize your AI hardware to the fullest extent and you can save on hardware costs. The Ozeki AI software will make sure that the AI models are loaded and unloaded when needed and the hardware resources are utilized to their best ability. This is achieved by combining a time sharing and parallel execution approach with a state of the art resource allocation algorithm.
Ozeki AI server offers other benefits as well:
Private
Chats and knowledge stay privateA Local AI server with Ozeki AI installed will make sure everything stays on your own system and only you will have access to your sensitive data.
Cost efficient
Your own system will cost lessRenting AI hardware or subscribing to online AI services will add up over the long run. Owning your own AI system will cost less especially if you have multiple users.
Future proof
Add models as they become availableYou can always download new AI models and add them to your system as they become available to take advanage of new developments in the field of AI.
Reliable
Ozeki AI is designed to run 24/7By running your own servers you can be sure, that your AI will always be available. You can setup a single PC system or a redundant system with Ozeki Cluster.
High performance
Performance optimized codeHardware acceleration is used and the code is optimized to provide the best performance. e.g. the Ozeki tokenizer is 3x faster on the same hardware.
Comfortable
Ergonomic chat clients on mobileOzeki AI comes with an easy to use, convenient and ergonomic chat clients for mobile phones, laptops, and desktop PCs. The users love it.
What is in the package
When you download Ozeki AI Server, the package will include a set of apps that work together to bring you a complete AI experience. The most useful item is the AI gateway, because it allows you to create amazing AI services for your users.
AI services
AI services take the power of AI models and connect it to real-life systems that are used by organizations on a daily basis. For example the Ozeki AI Server allows you to connect your AI models to communication channels, such as E-mail, Phone or SMS. It also makes it possible to interact with databases in human language. With Ozeki AI, custom AI apps can be written to create amazing AI services for any business. Check out some of the AI services you can create:
Run Large Language Models (LLMs) with local knowledge
Ozeki AI allows you to setup AI chat bots to support your team with local knowledge. This means, that you can run GPT chats locally with your own data. If you introduce Ozeki AI technology in your business, you can make your team more efficient by helping them with smart AI chat bots trained on information specific to their jobs. You can setup AI chat bots with custom prompts that define their behaviour for different organization units, such as Customer Support, HR, Sales or to support the productive work of the Marketing Department.
Ozeki AI Quick Start GuideUse local and on-line AI models
Ozeki AI is a local AI chat system, but it also allows you to use on-line AI models such as ChatGPT, CoPilot, Anthropic, Perplexity AI and others. If you find an On-line AI service, that performs better for a certain task, you can simply set it up using an API key and use it as if it was a local AI model. You can use On-line AI models to answer e-mails, make phone calls or provide AI voice assitance the same way you can use local AI models.
Mixing the power of local AI models and on-line AI models and using the one that offers the best results for a given task is the best way to work with AI.
On-line AI models are also a good choice if you don't have the required hardware capacity to run AI models locally.
What is an AI pipeline
The AI pipeline can also be called as AI automation. It is a technology that allows you to create a flow chart that can be used to create automated workflow or to build more intelligent chat bots.
The Ozeki AI pipeline can also be pictured as a chain of prompts and commands. It can be setup to achieve better results when automating tasks with AI. When a question is forwarded to the AI piplene, prompts are executed one after another to provide a better response at the end. You can design your custom AI pipelines by writing multiple prompts and passing the output of one prompt to the next. The AI pipeline can also include automation nodes, that lookup informatoin or trigger communication through various channels.
A simple AI piplene we use at Ozeki can be pictured as follows:
Use multiple channels
An AI chat bot created in Ozeki AI, can be accessed through multiple communication channels. The same chat bot can answer through chat, SMS, WhatsApp, E-mail, Voice calls, or through the API. This is useful because customers can get in touch through different channels, and often the same intelligence is required to serve them properly accross all channels.
Prepare task execution for your employees
The Ozeki AI chatbot configuration form, allows you to setup preparation prompts. This feature makes it possible for management to design custom chat bots for specific jobs. These chat bots can be prepared with chat history, knowledge bases, response prepration prompts and instruction prompts. The prepared chat bots can be used by employees during task exectuion and they will get better results to the prior knowledge added to the system by management.
Custom bots for custom jobs
A single chat bot cannot serve all the needs of an organization. Custom tasks require different prepration, different instructions and often different AI models (Figure 5). Ozeki Chat can run multiple chat bots simultaneously. Team members can use these chatbots from their mobile devices, laptops and desktop computers. As each chatbot is a friend in the contact list in Ozeki chat, a single person can communicate with multiple chat bots at the same time as their task demands it. For example one chatbot can generate appropriate images, and another can generate text.