If you can thoroughly define what a junk character is, it's possible to come up with a good regular expression that can find the junk characters and remove them (or even just parse anything read in...