Even though it’s magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone’s willing to invest this just to retrain it from scratch, you’re left with the alignment of its trainers.
It’s baked into the training. It’s not a simple thing to take it out. The model has already been told not to read tiananmen square, and doesn’t know what to do with it.
Question: as i understood it so far, this thing is open source and so is the dataset.
With that, why would it still obey Chinese censorship?
Even though it’s magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone’s willing to invest this just to retrain it from scratch, you’re left with the alignment of its trainers.
It’s baked into the training. It’s not a simple thing to take it out. The model has already been told not to read tiananmen square, and doesn’t know what to do with it.