This Open Source ChatGPT Alternative isn’t for Everyone

Developing usable artificial intelligence

Despite the fact that PaLM and RLHF come pre-trained, the Reinforcement Learning with Human Feedback technique is intended to provide a more intuitive user experience.

» MORE: 14 Best Voice Changers for Discord, Games, PC & Mobile

As explained by TechCrunch(opens in new tab), RLHF trains a language model by producing a wide range of responses to a human prompt, which are then ranked by human volunteers. Those rankings are then used to train a “reward model”, which sorts the responses by order of preference.

This is not a cheap process, which will prevent all but the wealthiest of AI enthusiasts from training the model. PaLM has 540 billion components of the language model (or parameters) that must be trained on data, and a 2020 study(opens in new tab) revealed that training only a 1.6 billion parameter model would cost anywhere from $80,000 to $1.6 million.

Right now, it seems as though we’re relying on a wealthy benefactor to get involved and train and release the model to the public. Such reliances have not ended well before(opens in new tab), but there are existing efforts by other companies looking to replicate ChatGPT’s capabilities and release them as free software.