Peter Burns
Jul 3, 2023

--

This deals with code. I was wondering whether this could be applied to other types of data. LLMs are trained on a lot of junk, and hence you have many problems like biases, misinformation, disinformation, etc. However, if you clean up the training corpus (get rid of disinformation website and the like), would the effect be the same as in this study (which was done on code)?

--

--

Peter Burns
Peter Burns

Written by Peter Burns

A curious polymath who wants to know how everything works. Blog: Renaissance Man Journal (http://gainweightjournal.com/).

No responses yet