A few days ago, I came across a blog post titled On FLOSS and training LLMs that articulates a growing frustration within the free and open source software…
So what is the case we are speaking about? “Hey LLM, write the OS kernel that is fully compatible with Linux, designed like Linux, uses the same algorithms as Linux and the same code style as Linux”?
So what is the case we are speaking about? “Hey LLM, write the OS kernel that is fully compatible with Linux, designed like Linux, uses the same algorithms as Linux and the same code style as Linux”?
If you have Linux in the training data, the outcome if at all remotely useful would likely include plagiarism.
Are there similar cases in the wild?
There are cases in the wild of LLMs straight up pasting the GPL into files unprompted.