Message 86873 - Python tracker
Here is a patch that implements this proposed approach. It introduces a "surrogates" error handler, useful only for the utf-8 codec. If this is accepted, the implementation of PEP 383 can be simplified significantly, essentially removing the need for a separate utf-8b codec (as that could be done in the error handler, as for the other codecs).