Request to GPT Model#

This block allows you to send requests to classic generative models from Open AI prior to ChatGPT. With it, you can create new texts on demand, perform various tasks:

  • classification,
  • summarization,
  • translation and rewriting of texts,
  • writing code prototypes in different programming languages,
  • parsing semi-structured and unstructured data,
  • extracting and processing facts,
  • maintaining a dialogue on various topics, and much more.

Payment for the use of this functionality is deducted from the client's account on the platform. For testing purposes, each new User is given the opportunity to test this functionality free of charge. After the development of the Robot is completed, payment is required for using this functionality.

Request[Text] Enter your request in natural language.
ModelSelect a model for text generation. The "text-davinci-003" model offers the highest quality generation, but its use is the most expensive and slowest. Other models may provide cheaper and faster results at the cost of some loss in generation quality. Use trial and error to find the model that best suits your tasks.
Your model name[Text] Specify your model name supported by Sherpa AI Server.
Stop words[Text/List] A string or list containing no more than 4 lines that will stop further generation. The returned text will not contain these words.
Temperature

[Number] A decimal number from 0 to 1 indicating the degree of "randomness" or "creativity" of the result, where:

  • 0 - least creative result,
  • 1 - most random.

For most creative tasks, a value of 0.7 is more suitable, and if you want to receive the same answer for the same request every time, set the value to 0.

Maximum length

[Number] The maximum length of the result, expressed in conditional tokens.

For English, 1 token is 4 characters, for most other languages, 1 token is 1 character.

Reduce this number if you want to receive shorter responses on average, increase it for longer responses. Note that this number limits the length of the response, but the response may not necessarily be the length you specified - depending on the content of the request, it may be shorter.

Auto-length limit

When enabled, the specified maximum length will be automatically adjusted.

This is calculated based on the number of tokens in the request and the maximum possible number of tokens for the selected model.

Timeout

[Number] The maximum wait time for a response in seconds. The actual wait time depends on the selected model, the length of your request, and the expected length of the response, as well as the current load on the servers.

If the set timeout limit is exceeded, an error occurs.

Number of attempts[Number] The number of attempts in case of a connection error with the server.
Response[Text] The response of the generative model to the request.
Final length[Number] The final length of the request and result (combined), expressed in conditional tokens.
Error handling level

Select the error handling level. Possible values:

  • "Default" - default;
  • "Ignore" - errors are ignored;
  • "Handle" - errors are handled.

If "Default" is selected, the value from the "Start" block of this diagram will be used.

Message level

Select the message level that the blocks will output during operation. Possible values:

  • "Default" - default;
  • "Release" - output disabled;
  • "Debug" - main information output;
  • "Detailed" - detailed information output.

If "Default" is selected, the value from the "Start" block of this diagram will be used.

Error text[Text] Returns detailed information about the error in case of incorrect execution of the block's work.