πŸ“• Node [[the ubuntu training corpus]]
πŸ“„ The Ubuntu training corpus.md by @KGBicheno

The Ubuntu training corpus

From [[Main Library - Chatterbot]]

The Ubuntu training corpus is an enormous (3gb) collection of conversational text from Ubuntus tech-support system.

It’s heavily biased and hopelessly garbage but should allow for a decent starting point.

Locations

Structure

Usage

Outputs

Benchmarks

Notes

See [[Why I didn’t use Chatterbot]]

Loading pushes...

Rendering context...