Ebbot GPT API
Unlock the Power of Advanced Language Models with Ebbot GPT API
Welcome to the Ebbot GPT API documentation. Our API offers seamless integration of service-optimized Large Language Models (LLMs) and Knowledge Management tools into your applications. With Ebbot GPT, you can harness sophisticated text generation capabilities and enhance your systems with cutting-edge AI technology. Explore our documentation to learn how to integrate and leverage the full potential of Ebbot GPT in your projects.
1. API Key
To make requests to the API, you will need an API key. To obtain one, please request access by filling out this form.
2. Tenant
A tenant is required for many requests. Tenants help separate token usage and datasets between different users. Typically, you’ll only need one tenant, but multiple tenants can be used if your business case requires it. Tenants are tied to your API key, ensuring your data remains secure.
3. Making a Request
ExternalId: The ID you chose when creating the tenant.
Auth: Authentication is done with a bearer token in the authorization header. Example:
"Authorization: Bearer MyAPIKey123"
3.1 List Models
To list the available models:
3.2 Create a tenant
Tenants are automatically created when a request with an unknown tenant is made. If you want to manually create a tenant then use this:
(Replace externalId with the id you want to add)
3.3 List tenants
To list the tenants you have created:
A response could look like this:
3.4 Creating a chat completion
This is where the magic happens. This is where you get a response from the bot. Here is a simple request with the bare necessities:
A response might look like this:
Request parameters
The following table describes the various parameters you can use when configuring your request. These parameters control different aspects of the response generation and the overall behavior of the API. Each parameter has specific functionality, ranging from controlling randomness in responses to enabling advanced retrieval mechanisms. Understanding these parameters will help you tailor the API's output to better meet your needs and ensure that you can leverage all available features effectively.
Field | Type | Description |
---|---|---|
| float | Top-p filtering selects words with the highest probability that cumulatively reach a set threshold, ensuring diverse and unpredictable text generation. (Alternative to temperature) |
| float | Controls the randomness of the response. Higher values make the output more random. |
| integer | Attempts to make the answer deterministic. |
| integer | Maximum number of tokens used in the answer. |
| boolean | Flags inappropriate content by user and bot (e.g., sexual content, violence). |
| object | Retrieval Augmented Generation (embedder) |
| boolean | Indicates if the embedder is enabled. |
| string | Placeholder for the word that will be replaced by the response from the embedder in the system prompt (e.g., |
| string (UUID) | The ID of the dataset that should be used. |
| boolean | If the sources of the datasets should be returned. |
| array | List of search definitions. |
| integer | Retrieve and rerank involves first pulling relevant information from a large dataset and then sorting it by importance or relevance. (Experimental) |
| number | Selects the top 'K' items from a list based on the highest scores or most relevance. |
| integer | Number of messages to use from the conversation. |
| string | Filters based on specific roles (user, assistant, both). |
| string | Specifies the inference engine to be used (e.g., |
| string | The model to be used for the request. |
| array | The conversation history. |
| string | The content of the message. |
| enum | Who sent the message (e.g., |
When you have gotten a completion you can add the response from the bot to the array of messages and then continue the conversation with your user input. It could look like this:
4. Get token usage
To get the token usage between two dates:
The response might look something like this:
Last updated