Tools

This page lists specific tools provided by NovelAI.

= Module Training =

Consult Datasetting for AI Modules for more information on how to prepare files for Module training.

AI Module training can be accessed from the Story Library, then clicking on the Tools button (🧪), then Module Training.

The left part of the interface contains the dataset. Files supported are raw text only, encoded in UTF-8.

Upload all the files necessary for your module with the Select File button. Give your module a Name and Description on the right-hand side.

Choose the model that your module is for. Modules for Sigurd are not compatible with Euterpe!

The Total # of steps needed to train: field will display the estimate number of steps for 100% coverage of all text files. It is not necessary to set the number of training steps to that number. A good soft limit is ~3000 steps. You may also overfit your module by using more than the estimate. This can have problematic or useful effects depending on what you seek to achieve.

Select the number of steps using the slider, then click Train! to start the training. It will take several minutes before the module is produced. Once done, you can save it and import it like any other story, scenario or generation set.

Steps


You get a free allotment of Anlas per subscription period. Opus tier gets 10.000 Anlas per sub period. Other tiers get 500. Free Steps renew every month and do not accumulate.

You can purchase Anlas in order to train large modules or train more than your free allotment gives you per month. Paid Anlas are permanent until spent.

⬆ Return to Page Top

= Tokenizer =



The Tokenizer is a feature that allows you to check how your text is split into tokens before being sent to the AI. It can be accessed by pressing ALT+T or clicking on the Tools menu button (🧪) then "Tokenizer".

The Text tab will highlight each token in sequence, using a different color to make it more visible.

Token IDs will display the Token Identification Code that the AI uses. This is very useful for banning specific tokens (without any space addition and case insensitivity) using Ban Token ID.

⬆ Return to Page Top

= Token Probability Viewer =



'''To enable this feature, you must enable it in the AI Settings panel of the Account Settings window. This will create a 🧠 button, which lets you access the Probability Viewer.'''

The Token Probability Viewer is a powerful diagnosis tool that allows you to see what choices the AI considers before committing to a generation. This is hugely useful for fine-tuning generation settings, and checking why the AI keeps outputting the same response.

The left side of the interface displays the response text. You can switch between displaying the text proper, or the Token IDs. Cooler colors represent tokens with a low likelihood to appear, while hotter colors represent tokens highly likely to appear.

If you click a token, the right panel will update to show you tokens that were considered by the AI, before the generation settings, biases, bans and modules and after them.

Only the 10 most likely tokens are shown. The remainder is grouped up as a whole entity to save on processing time.

⬆ Return to Page Top