I used the script from this post that extracts text from RTF files.
When I output the result into a text file I see a lot of "NEL" when viewing it in notepad++.
How can I replace "NEL"? For "NUL" I managed to remove it using replace(u'\x00', u' ')
I used the script from this post that extracts text from RTF files.
When I output the result into a text file I see a lot of "NEL" when viewing it in notepad++.
How can I replace "NEL"? For "NUL" I managed to remove it using replace(u'\x00', u' ')
NEL is Newline. It should be U+0085.
If your editor is showing all of your text on one line with NEL in the middle of text, you could replace(u'\x85', u'\n') (or u'\r\n' depending on your system).